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COMPOSITIONS, KITS, AND METHODS FOR 
IDENTIFICATION, ASSESSMENT, PREVENTION, AND THERAPY OF 

OVARIAN CANCER 

5 RELATED APPLICATIONS 

The present application claims priority to U.S. provisional patent application 
serial no. 60/191,031 filed on March 21, 2000, U.S. provisional patent application serial 
no. 60/207,124, filed on May 25, 2000, U.S. provisional patent application serial no. 
60/21 1,940, filed on June 15, 2000, U.S. provisional patent application serial no. 
10 60/216,820, filed on July 7, 2000, U.S. provisional patent application serial no. 

60/220,661, filed on July 25, 2000, and U.S. provisional patent application serial no. 
60/257,672, filed on December 21, 2000, all of which are expressly incorporated by 
reference. 



1 5 FIELD OF THE INVENTION 

The field of the invention is ovarian cancer, including diagnosis, 
characterization, management, and therapy of ovarian cancer. 

BACKGROUND OF THE INVENTION 
20 Ovarian cancer is responsible for significant morbidity and mortality in 

populations around the world. Ovarian cancer is classified, on the basis of clinical and 
pathological features, in three groups, namely epithelial ovarian cancer (EOC; >90% of 
ovarian cancer in Western countries), germ cell tumors {circa 2-3% of ovarian cancer), 
and stromal ovarian cancer (circa 5% of ovarian cancer; Ozols et ai, 1997, Cancer 
25 Principles and Practice of Oncology, 5th ed., DeVita et a/., Eds. pp. 1502). Relative to 
EOC, germ cell tumors and stromal ovarian cancers are more easily detected and treated 
at an early stage, translating into higher/better survival rates for patients afflicted with 
these two types of ovarian cancer. 

There are numerous types of ovarian tumors, some of which are benign, and 
30 others of which are malignant. Treatment (including non-treatment) options and 

predictions of patient outcome depend on accurate classification of the ovarian cancer. 
Ovarian cancers are named according to the type of cells from which the cancer is 
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derived and whether the ovarian cancer is benign or malignant. Recognized histological 
tumor types include, for example, serous, mucinous, endometrioid, and clear cell 
tumors. In addition, ovarian cancers are classified according to recognized grade and 
stage scales. 

5 In grade I, the tumor tissue is well differentiated from normal ovarian tissue. In 

grade II, tumor tissue is moderately well differentiated. In grade III, the tumor tissue is 
poorly differentiated from normal tissue, and this grade correlates with a less favorable 
prognosis than grades I and II. Stage I is generally confined within the capsule 
surrounding one (stage I A) or both (stage IB) ovaries, although in some stage I (i.e. 

10 stage IC) cancers, malignant cells may be detected in ascites, in peritoneal rinse fluid, or 
on the surface of the ovaries. Stage II involves extension or metastasis of the tumor 
from one or both ovaries to other pelvic structures. In stage IIA, the tumor extends or 
has metastasized to the uterus, the fallopian tubes, or both. Stage IIB involves extension 
of the tumor to the pelvis. Stage IIC is stage IIA or IIB in which malignant cells may be 

1 5 detected in ascites, in peritoneal rinse fluid, or on the surface of the ovaries. In stage III, 
the tumor comprises at least one malignant extension to the small bowel or the 
omentum, has formed extrapelvic peritoneal implants of microscopic (stage IIIA) or 
macroscopic (< 2 centimeter diameter, stage IIIB; > 2 centimeter diameter, stage IIIC) 
size, or has metastasized to a retroperitoneal or inguinal lymph node (an alternate 

20 indicator of stage IIIC). In stage IV, distant (i.e. non-peritoneal) metastases of the tumor 
can be detected. 

The durations of the various stages of ovarian cancer are not presently known, 
but are believed to be at least about a year each (Richart et ah, 1969, Am. J. Obstet. 
Gynecol 105:386). Prognosis declines with increasing stage designation. For example, 

25 5-year survival rates for patients diagnosed with stage I, II, III, and IV ovarian cancer 
are 80%, 57%, 25%, and 8%, respectively. 

Despite being the third most prevalent gynecological cancer, ovarian cancer is 
the leading cause of death among those afflicted with gynecological cancers. The 
disproportionate mortality of ovarian cancer is attributable to a substantial absence of 

30 symptoms among those afflicted with early-stage ovarian cancer and to difficulty 

diagnosing ovarian cancer at an early stage. Patients afflicted with ovarian cancer most 
often present with non-specific complaints, such as abnormal vaginal bleeding, 
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gastrointestinal symptoms, urinary tract symptoms, lower abdominal pain, and 
generalized abdominal distension. These patients rarely present with paraneoplastic 
symptoms or with symptoms which clearly indicate their affliction. Presently, less than 
about 40% of patients afflicted with ovarian cancer present with stage I or stage II. 
5 Management of ovarian cancer would be significantly enhanced if the disease could be 
detected at an earlier stage, when treatments are much more generally efficacious. 

Ovarian cancer may be diagnosed, in part, by collecting a routine medical history 
from a patient and by performing physical examination, x-ray examination, and 
chemical and hematological studies on the patient. Hematological tests which may be 

10 indicative of ovarian cancer in a patient include analyses of serum levels of proteins 
designated CA125 and DF3 and plasma levels of lysophosphatidic acid (LP A). 
Palpation of the ovaries and ultrasound techniques (particularly including endovaginal 
ultrasound and color Doppler flow ultrasound techniques) can aid detection of ovarian 
tumors and differentiation of ovarian cancer from benign ovarian cysts. However, a 

1 5 definitive diagnosis of ovarian cancer typically requires performing exploratory 
laparotomy of the patient. 

Potential tests for the detection of ovarian cancer (e.g., screening, reflex or 
monitoring) may be characterized by a number of factors. The "sensitivity" of an assay 
refers to the probability that the test will yield a positive result in an individual afflicted 

20 with ovarian cancer. The "specificity" of an assay refers to the probability that the test 
will yield a negative result in an individual not afflicted with ovarian cancer. The 
"positive predictive value" (PPV) of an assay is the ratio of true positive results (i.e. 
positive assay results for patients afflicted with ovarian cancer) to all positive results 
(i.e. positive assay results for patients afflicted with ovarian cancer + positive assay 

25 results for patients not afflicted with ovarian cancer). It has been estimated that in order 
for an assay to be an appropriate population-wide screening tool for ovarian cancer the 
assay must have a PPV of at least about 10% (Rosenthal et al, 1998, Sent. Oncol 
25:315-325). It would thus be desirable for a screening assay for detecting ovarian 
cancer in patients to have a high sensitivity and a high PPV. Monitoring and reflex tests 

30 would also require appropriate specifications. 
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Owing to the cost, limited sensitivity, and limited specificity of known methods 
of detecting ovarian cancer, screening is not presently performed for the general 
population. In addition, the need to perform laparotomy in order to diagnose ovarian 
cancer in patients who screen positive for indications of ovarian cancer limits the 
5 desirability of population-wide screening, such that a PPV even greater than 1 0% would 
be desirable. 

Prior use of serum CA125 level as a diagnostic marker for ovarian cancer 
indicated that this method exhibited insufficient specificity for use as a general 
screening method. Use of a refined algorithm for interpreting CA125 levels in serial 

1 0 retrospective samples obtained from patients improved the specificity of the method 
without shifting detection of ovarian cancer to an earlier stage (Skakes, 1995, Cancer 
76:2004). Screening for LPA to detect gynecological cancers including ovarian cancer 
exhibited a sensitivity of about 96% and a specificity of about 89%. However, CA125- 
based screening methods and LPA-based screening methods are hampered by the 

15 presence of CA125 and LPA, respectively, in the serum of patients afflicted with 

conditions other than ovarian cancer. For example, serum CA125 levels are known to 
be associated with menstruation, pregnancy, gastrointestinal and hepatic conditions such 
as colitis and cirrhosis, pericarditis, renal disease, and various non-ovarian malignancies. 
Serum LPA is known, for example, to be affected by the presence of non-ovarian 

20 gynecological malignancies. A screening method having a greater specificity for 

ovarian cancer than the current screening methods for CA125 and LPA could provide a 
population- wide screening for early stage ovarian cancer. 

Presently greater than about 60% of ovarian cancers diagnosed in patients are 
stage III or stage IV cancers. Treatment at these stages is largely limited to 

25 cytoreductive surgery (when feasible) and chemotherapy, both of which aim to slow the 
spread and development of metastasized tumor. Substantially all late stage ovarian 
cancer patients currently undergo combination chemotherapy as primary treatment, 
usually a combination of a platinum compound and a taxane. Median survival for 
responding patients is about one year. Combination chemotherapy involving agents 

30 such as doxorubicin, cyclophosphamide, cisplatin, hexamethylmelamine, paclitaxel, and 
methotrexate may improve survival rates in these groups, relative to single-agent 
therapies. Various recently-developed chemotherapeutic agents and treatment regimens 
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have also demonstrated usefulness for treatment of advanced ovarian cancer. For 
example, use of the topoisomerase I inhibitor topectan, use of amifostine to minimize 
chemotherapeutic side effects, and use of intraperitoneal chemotherapy for patients 
having peritoneally implanted tumors have demonstrated at least limited utility. 
5 Presently, however, the 5-year survival rate for patients afflicted with stage HI ovarian 
cancer is 25%, and the survival rate for patients afflicted with stage IV ovarian cancer is 
8%. 

In summary, the earlier ovarian cancer is detected, the aggressiveness of 
therapeutic intervention and the side effects associated with therapeutic intervention are 
10 minimized. More importantly, the earlier the cancer is detected, the survival rate and 
quality of life of ovarian cancer patients is enhanced. Thus, a pressing need exists for 
methods of detecting ovarian cancer as early as possible. There also exists a need for 
methods of detecting recurrence of ovarian cancer as well as methods for predicting and 
monitoring the efficacy of treatment. The present invention satisfies these needs. 

15 

SUMMARY OF THE INVENTION 
The invention relates to novel genes associated with ovarian cancer as well as 
methods of assessing whether a patient is afflicted with ovarian cancer. This method 
comprises the step of comparing the level of expression of a marker in a patientsample, 

20 wherein the marker is listed in Tables 1 -2, and the normal level of expression of the 
marker in a control, e.g., a sample from a patient without ovarian cancer. A significant 
difference between the level of expression of the marker in the patient sample and the 
normal level is an indication that the patient is afflicted with ovarian cancer. Preferably, 
a protein corresponding to the marker is a secreted protein. Alternatively, the marker 

25 can correspond to a protein having an extracellular portion, to one which is normally 
expressed in ovarian tissue at a detectable level, or both. 

In one method, the marker(s) are preferably selected such that the positive 
predictive value of the method is at least about 10%. Also preferred are embodiments of 
the method wherein the marker is over- or under-expressed by at least two-fold in at 

30 least about 20% of stage I ovarian cancer patients, stage II ovarian cancer patients, stage 
III ovarian cancer patients, stage IV ovarian cancer patients, grade I ovarian cancer 
patients, grade II ovarian cancer patients, grade III ovarian cancer patients, epithelial 
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ovarian cancer patients, stromal ovarian cancer patients, genn cell ovarian cancer 
patients, malignant ovarian cancer patients, benign ovarian patients, serous neoplasm 
ovarian cancer patients, mucinous neoplasm ovarian cancer patients, endometrioid 
neoplasm ovarian cancer patients and/or clear cell neoplasm ovarian cancer patients. 
5 In one embodiment of the methods of the present invention, the patient sample is 

an ovary-associated body fluid. Such fluids include, for example, blood fluids, lymph, 
ascitic fluids, gynecological fluids, cystic fluids, urine, and fluids collected by peritoneal 
rinsing. In another embodiment, the sample comprises cells obtained from the patient. • 
In this embodiment, the cells may be found in a fluid selected from the group consisting 

10 of a fluid collected by peritoneal rinsing, a fluid collected by uterine rinsing, a uterine 
fluid, a uterine exudate, a pleural fluid, and an ovarian exudate. In another embodiment, 
the patient sample is in vivo. 

In accordance with the methods of the present invention, the level of expression 
of the marker in a sample can be assessed, for example, by detecting the presence in the 

15 sample of: 

• a protein corresponding to the marker or fragment of the protein (e.g. 
using a reagent, such as an antibody, an antibody derivative, or an antibody 
fragment, which binds specifically with the protein) 

• a transcribed polynucleotide (e.g. an mRNA or a cDNA), or fragment 
20 thereof, having at least a portion with which the marker is substantially 

homologous (e.g. by contacting a mixture of transcribed polynucleotides 
obtained from the sample with a substrate having one or more of the markers 
listed in Tables 1-2 fixed thereto at selected positions) 

• a transcribed polynucleotide or fragment thereof, wherein the 

25 polynucleotide anneals with the marker under stringent hybridization 

conditions. 

• a metabolite which is produced directly (i.e., catalyzed) or indirectly by a 
protein corresponding to the marker 

The methods of the present invention are particularly useful for patients with an 
30 identified pelvic mass or symptoms associated with ovarian cancer. The methods of the 
present invention can also be of particular use with patients having an enhanced risk of 
developing ovarian cancer (e.g., patients having a familial history of ovarian cancer, 
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patients identified as having a mutant oncogene, and patients at least about 50 years of 
age). The methods of the present invention may further be of particular use in 
monitoring the efficacy of treatment of an ovarian cancer patient (e.g. the efficacy of 
chemotherapy). 

5 The methods of the present invention may be performed using a plurality (e.g. 2, 

3, 5, or 10 or more) of markers. According to a method involving a plurality of markers, 
the level of expression in the sample of each of a plurality of markers independently 
selected from the markers listed in Tables 1-2 is compared with the normal level of 
expression of each of the plurality of markers in samples of the same type obtained from 
1 0 control humans not afflicted with ovarian cancer. The markers of Tables 1 -2 may also 
be used in combination with known ovarian cancer markers in the methods of the 
present invention. 

In a preferred method of assessing whether a patient is afflicted with ovarian 
cancer (e.g., new detection ("screening"), detection of recurrence, reflex testing), the 
1 5 method comprises comparing: 

a) the level of expression of a marker in a patient sample, wherein at least 
one marker is selected from the markers of Tables 1-2, and 

b) the normal level of expression of the marker in a control non-ovarian 
cancer sample. 

20 A significant difference between the level of expression of the marker in the patient 
sample and the normal level is an indication that the patient is afflicted with ovarian 
cancer. 

The methods of the present invention further include a method of assessing the 
efficacy of a test compound for inhibiting ovarian cancer in a patient This method 
25 comprises comparing: 

a) expression of a marker in a first sample obtained from the patient and 
maintained in the presence of the test compound, wherein the marker is 
selected from the group consisting of the markers listed in Tables 1-2, 
and 

30 b) expression of the marker in a second sample obtained from the patient 

and maintained in the absence of the test compound. 
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A significant difference between the level of expression of the marker in the first 
sample, relative to the second sample, is an indication that the test compound is 
efficacious for inhibiting ovarian cancer in the patient. For example, the first and 
second samples can be portions of a single sample obtained from the patient or portions 
5 of pooled samples obtained from the patient. 

The invention further relates to a method of assessing the efficacy of a therapy 
for inhibiting ovarian cancer in a patient. This method comprises comparing: 

a) expression of a marker in a first sample obtained from the patient prior 
to providing at least a portion of the therapy to the patient, wherein the 

10 marker is selected from the group consisting of the markers listed in 

Tables 1-2, and 

b) expression of the marker in a second sample obtained from the patient 
following provision of the portion of the therapy. 

A significant difference between the level of expression of the marker in the second 
1 5 sample, relative to the first sample, is an indication that the therapy is efficacious for 

inhibiting ovarian cancer in the patient. 

It will be appreciated that in these methods the "therapy" may be any traditional 

therapy for treating ovarian cancer including, but not limited to, chemotherapy, radiation 

therapy and surgical removal of tissue, e.g., an ovarian tumor. Thus, the methods of the 
20 invention may be used to evaluate a patient before, during and after thereapy, for 

example, to evaluate the reduction in tumor burden. 

The present invention therefore further comprises a method for monitoring the 

progression of ovarian cancer in a patient, the method comprising: 

a) detecting in a patient sample at a first time point, the expression of a marker, 
25 wherein the marker is selected from the group consisting of the markers listed in Tables 

1-2; 

b) repeating step a) at a subsequent time point in time; and 

c) comparing the level of expression detected in steps a) and b), and therefrom 
monitoring the progression of ovarian cancer in the patient. 

30 The invention also includes a method of selecting a composition for inhibiting 

ovarian cancer in a patient. This method comprises the steps of: 

a) obtaining a sample comprising cancer cells from the patient; 
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b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1-2 in each of the 
aliquots; and 

5 d) selecting one of the test compositions which alters the level of 

expression of the marker in the aliquot containing that test composition, 
relative to other test compositions. 
In addition, the invention includes a method of inhibiting ovarian cancer in a 
patient. This method comprises the steps of: 
0 a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker listed in Tables 1-2 in each of the 
aliquots; and 

5 d) administering to the patient at least one of the test compositions which 

alters the level of expression of the marker in the aliquot containing that 
test composition, relative to other test compositions. 
The invention also includes a kit for assessing whether a patient is afflicted with 
ovarian cancer. This kit comprises reagents for assessing expression of a marker listed 
0 in Tables 1-2. 

In another aspect, the invention relates to a kit for assessing the suitability of 
each of a plurality of compounds for inhibiting an ovarian cancer in a patient. The kit 
comprises a reagent for assessing expression of a marker listed in Tables 1-2, and may 
also comprise a plurality of compounds. 

5 In another aspect, the invention relates to a kit for assessing the presence of 

ovarian cancer cells. This kit comprises an antibody, wherein the antibody binds 
specifically with a protein corresponding to a marker listed in Tables 1-2. The kit may 
also comprise a plurality of antibodies, wherein the plurality binds specifically with a 
protein corresponding to a different marker listed in Tables 1-2. 

0 The invention also includes a kit for assessing the presence of ovarian cancer 

cells, wherein the kit comprises a nucleic acid probe. The probe binds specifically with 
a transcribed polynucleotide corresponding to a marker listed in Tables 1-2. The kit 
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may also comprise a plurality of probes, wherein each of the probes binds specifically 
with a transcribed polynucleotide corresponding to a different marker listed in Tables 1- 
2. 

The invention further relates to a method of making an isolated hybridoma which 
5 produces an antibody useful for assessing whether a patient is afflicted with ovarian 
cancer. The method comprises isolating a protein corresponding to a marker listed in 
Tables 1-2, immunizing a mammal using the isolated protein, isolating splenocytes from 
the immunized mammal, fusing the isolated splenocytes with an immortalized cell line 
to form hybridomas, and screening individual hybridomas for production of an antibody 
10 which specifically binds with the protein to isolate the hybridoma. The invention also 
includes an antibody produced by this method. 

The invention further includes a method of assessing the ovarian carcinogenic 
potential of a test compound. This method comprises the steps of: 

a) maintaining separate aliquots of ovarian cells in the presence and 
1 5 absence of the test compound; and 

b) comparing expression of a marker in each of the aliquots. 
The marker is selected from those listed in Tables 1-2. A significantly altered 
level of expression of the marker in the aliquot maintained in the presence of (or 
exposed to) the test compound, relative to the aliquot maintained in the absence of the 
20 test compound, is an indication that the test compound possesses ovarian carcinogenic 
potential. 

Additionally, the invention includes a kit for assessing the ovarian carcinogenic 
potential of a test compound. The kit comprises ovarian cells and a reagent for 
assessing expression of a marker in each of the aliquots. The marker is selected from 

25 those listed in Tables 1-2. 

The invention further relates to a method of treating a patient afflicted with 
ovarian cancer or at risk of developing ovarian cancer. This method comprises 
enhancing expression of a marker listed in Tables 1-2 or providing to cells of the patient 
a protein corresponding to a marker listed in Tables 1-2, wherein the marker is 

30 underexpressed in patients afflicted with ovarian cancer. The protein can be provided to 
the cells, for example, by providing a vector comprising a polynucleotide encoding the 
protein to the cells. 
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The invention includes another method of treating a patient afflicted with ovarian 
cancer or at risk of developing ovarian cancer. This method comprises inhibiting 
expression or overexpression of a marker listed in Tables 1-2 by, e.g., providing to cells 
of the patient an antisense oligonucleotide complementary to a polynucleotide 
5 corresponding to a marker listed in Tables 1-2, wherein the marker is overexpressed in 
patients afflicted with ovarian cancer. 

It will be appreciated that the methods and kits of the present invention may also 
include known cancer markers including known ovarian cancer markers. It will further 
be appreciated that the methods and kits may be used to identify cancers other than 
10 ovarian cancer. 

DETAILED DESCRIPTION OF THE INVENTION 
The invention relates to newly discovered genes associated with the cancerous 
state of ovarian cells. It has been discovered that the level of expression of individual 
15 genes, also referred to as markers, and combinations of these genes, correlates with the 
presence of ovarian cancer in a patient. Methods are provided for detecting the presence 
of ovarian cancer in a sample, the absence of ovarian cancer in a sample, the stage of an 
ovarian cancer, and with other characteristics of ovarian cancer that are relevant to 
prevention, diagnosis, characterization, and therapy of ovarian cancer in a patient. 

20 

Definitions 

As used herein, each of the following terms has the meaning associated with it in 
this section. 

The articles "a" and "an" are used herein to refer to one or to more than one (i.e. 
25 to at least one) of the grammatical object of the article. By way of example, "an 
element" means one element or more than one element. 

A "marker" is a naturally-occurring polymer corresponding to at least one of the 
novel nucleic acids listed in Tables 1-2. For example, markers include, without 
limitation, sense and anti-sense strands of genomic DNA (i.e. including any introns 
30 occurring therein), RNA generated by transcription of genomic DNA (i.e. prior to 
splicing), RNA generated by splicing of RNA transcribed from genomic DNA, and 
proteins generated by translation of spliced RNA (i.e. including proteins both before and 
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after cleavage of normally cleaved regions such as transmembrane signal sequences). 
As used herein, "marker" may also include a cDNA made by reverse transcription of an 
RNA generated by transcription of genomic DNA (including spliced RNA). 

The term "probe" refers to any molecule which is capable of selectively binding 
5 to a specifically intended target molecule, for example a marker of the invention. 
Probes can be either synthesized by one skilled in the art, or derived from appropriate 
biological preparations. For purposes of detection of the target molecule, probes may be 
specifically designed to be labeled, as described herein. Examples of molecules that can 
be utilized as probes include, but are not limited to, RNA, DNA, proteins, antibodies, 
10 and organic monomers. 

An "ovary-associated" body fluid is a fluid which, when in the body of a patient, 
contacts or passes through ovarian cells or into which cells or proteins shed from 
ovarian cells e.g. ovarian epithelium, are capable of passing. Exemplary ovary- 
associated body fluids include blood fluids, lymph, ascites, gynecological fluids, cystic 
1 5 fluid, urine, and fluids collected by peritoneal rinsing. 

The "normal" level of expression of a marker is the level of expression of the 
marker in ovarian cells of a patient, e.g. a human, not afflicted with ovarian cancer. 

"Over-expression" and "under-expression" of a marker refer to expression of the 
marker of a patient at a greater or lesser level, respectively, than normal level of 
20 expression of the marker {e.g. at least two-fold greater or lesser level). 

As used herein, the term "promoter/regulatory sequence" means a nucleic acid 
sequence which is required for expression of a gene product operably linked to the 
promoter/regulatory sequence. In some instances, this sequence may be the core 
promoter sequence and in other instances, this sequence may also include an enhancer 
25 sequence and other regulatory elements which are required for expression of the gene 
product. The promoter/regulatory sequence may, for example, be one which expresses 
the gene product in a tissue-specific manner. 

A "constitutive" promoter is a nucleotide sequence which, when operably linked 
with a polynucleotide which encodes or specifies a gene product, causes the gene 
30 product to be produced in a living human cell under most or all physiological conditions 
of the cell. 
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An "inducible" promoter is a nucleotide sequence which, when operably linked 
with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only when an inducer which 
corresponds to the promoter is present in the cell. 
5 A "tissue-specific" promoter is a nucleotide sequence which, when operably 

linked with a polynucleotide which encodes or specifies a gene product, causes the gene 
product to be produced in a living human cell substantially only if the cell is a cell of the 
tissue type corresponding to the promoter. 

A "transcribed polynucleotide" is a polynucleotide (e.g. an RNA, a cDNA, or an 

1 0 analog of one of an RNA or cDNA) which is complementary to or homologous with all 
or a portion of a mature RNA made by transcription of a genomic DNA corresponding 
to a marker of the invention and normal post-transcriptional processing (e.g. splicing), if 
any, of the transcript. 

"Complementary" refers to the broad concept of sequence complementarity 

1 5 between regions of two nucleic acid strands or between two regions of the same nucleic 
acid strand. It is known that an adenine residue of a first nucleic acid region is capable 
of forming specific hydrogen bonds ("base pairing") with a residue of a second nucleic z 
acid region which is antiparallel to the first region if the residue is thymine or uracil. 
Similarly, it is known that a cytosine residue of a first nucleic acid strand is capable of - 

20 base pairing with a residue of a second nucleic acid strand which is antiparallel to the 
first strand if the residue is guanine. A first region of a nucleic acid is complementary to 
a second region of the same or a different nucleic acid if, when the two regions are 
arranged in an antiparallel fashion, at least one nucleotide residue of the first region is 
capable of base pairing with a residue of the second region. Preferably, the first region 

25 comprises a first portion and the second region comprises a second portion, whereby, 
when the first and second portions are arranged in an antiparallel fashion, at least about 
50%, and preferably at least about 75%, at least about 90%, or at least about 95% of the 
nucleotide residues of the first portion are capable of base pairing with nucleotide 
residues in the second portion. More preferably, all nucleotide residues of the first 

30 portion are capable of base pairing with nucleotide residues in the second portion. 



WO 01/070979 PCTAJS01/09126 

-14- 

"Homologous" as used herein, refers to nucleotide sequence similarity between 
two regions of the same nucleic acid strand or between regions of two different nucleic 
acid strands. When a nucleotide residue position in both regions is occupied by the 
same nucleotide residue, then the regions are homologous at that position. A first region 
5 is homologous to a second region if at least one nucleotide residue position of each 
region is occupied by the same residue. Homology between two regions is expressed in 
terms of the proportion of nucleotide residue positions of the two regions that are 
occupied by the same nucleotide residue. By way of example, a region having the 
nucleotide sequence 5'-ATTGCC-3 f and a region having the nucleotide sequence 5 1 - 

10 TATGGC-3' share 50% homology. Preferably, the first region comprises a first portion 
and the second region comprises a second portion, whereby, at least about 50%, and 
preferably at least about 75%, at least about 90%, or at least about 95% of the nucleotide 
residue positions of each of the portions are occupied by the same nucleotide residue. 
More preferably, all nucleotide residue positions of each of the portions are occupied by 

1 5 the same nucleotide residue . 

A marker is "fixed" to a substrate if it is covalently or non-covalently associated 
with the substrate such the substrate can be rinsed with a fluid (e.g. standard saline 
citrate, pH 7.4) without a substantial fraction of the marker dissociating from the 
substrate. 

20 As used herein, a "naturally-occurring" nucleic acid molecule refers to an RNA 

or DNA molecule having a nucleotide sequence that occurs in nature (e.g. encodes a 
natural protein). 

Expression of a marker in a patient is "significantly" higher or lower than the 
normal level of expression of a marker if the level of expression of the marker is greater 

25 or less, respectively, than the normal level by an amount greater than the standard error 
of the assay employed to assess expression, and preferably at least twice, and more 
preferably three, four, five or ten times that amount. Alternately, expression of the 
marker in the patient can be considered "significantly" higher or lower than the normal 
level of expression if the level of expression is at least about two, and preferably at least 

30 about three, four, or five times, higher or lower, respectively, than the normal level of 
expression of the marker. 
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Ovarian cancer is "inhibited" if at least one symptom of the cancer is alleviated, 
terminated, slowed, or prevented. As used herein, ovarian cancer is also "inhibited" if 
recurrence or metastasis of the cancer is reduced, slowed, delayed, or prevented. 

A kit is any manufacture (e.g. a package or container) comprising at least one 
5 reagent, e.g. a probe, for specifically detecting a marker of the invention, the 

manufacture being promoted, distributed, or sold as a unit for performing the methods of 
the present invention. 



Description 

1 0 The present invention is based, in part, on identification of novel markers which 

are over-expressed in ovarian cancer cells as compared to their expression in normal (i.e. 
non- cancerous) ovarian cells. The markers of the invention correspond to DNA, RNA, 
and polypeptide molecules which can be detected in one or both of normal and 
cancerous ovarian cells. The enhanced expression of one or more of these markers in 

1 5 ovarian cells is herein correlated with the cancerous state of the tissue. The invention 
thus includes compositions, kits, and methods for assessing the cancerous state of 
ovarian cells (e.g. cells obtained from a human, cultured human cells, archived or 
preserved human cells and in vivo cells). 

The compositions, kits, and methods of the invention have the following uses, 

20 among others: 

1) assessing whether a patient is afflicted with ovarian cancer; 

2) assessing the stage of ovarian cancer in a human patient; 

3) assessing the grade of ovarian cancer in a patient; 

4) assessing the benign or malignant nature of ovarian cancer in a patient; 
25 5) assessing the histological type of neoplasm (e.g. serous, mucinous, 

endometroid, or clear cell neoplasm) associated with ovarian 
cancer in a patient; 

6) making an isolated hybridoma which produces an antibody useful for 
assessing whether a patient is afflicted with ovarian cancer; 
30 7) assessing the presence of ovarian cancer cells; 

8) assessing the efficacy of one or more test compounds for inhibiting 
ovarian cancer in a patient; 
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9) assessing the efficacy of a therapy for inhibiting ovarian cancer in a 

patient; 

1 0) monitoring the progression of ovarian cancer in a patient; 

1 1 ) selecting a composition or therapy for inhibiting ovarian cancer in a 
5 patient; 

12) treating a patient afflicted with ovarian cancer; 

1 3) inhibiting ovarian cancer in a patient; 

14) assessing the ovarian carcinogenic potential of a test compound; 
and 

10 15) inhibiting an ovarian cancer in a patient at risk for developing 

ovarian cancer. 



The invention thus includes a method of assessing whether a patient is afflicted 
with ovarian cancer. This method comprises comparing the level of expression of a 

3 5 marker in a patient sample and the normal level of expression of the marker in a control, 
e.g., a non-ovarian cancer sample. A significant difference between the level of 
expression of the marker in the patient sample and the normal level is an indication that 
the patient is afflicted with ovarian cancer. The marker is selected from the group 
consisting of the markers listed in Tables 1-2. 

20 The polynucleotides set forth in Tables 1-2 represent previously unidentified 

nucleotide sequences. These nucleotide sequences were identified through subtracted 
library experiments described herein. Also provided by this invention are 
polynucleotides that correspond to the polynucleotides of Tables 1-2. In one 
embodiment, these polynucleotides are obtained by identification of a larger fragment or 

25 full-length coding sequence of these polynucleotides. Gene delivery vehicles, host cells, 
compositions and databases (all described herein) containing these polynucleotides are 
also provided by this invention. 

Any marker or combination of markers listed in Tables 1-2, as well as any 
known markers in combination with the markers set forth in Tables 1 -2, may be used in 

30 the compositions, kits, and methods of the present invention. In general, it is preferable 
to use markers for which the difference between the level of expression of the marker in 
ovarian cancer cells and the level of expression of the same marker in normal ovarian 
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cells is as great as possible. Although this difference can be as small as the limit of 
detection of the method for assessing expression of the marker, it is preferred that the 
difference be at least greater than the standard error of the assessment method, and 
preferably a difference of at least 2-, 3-, 4-, 5-, 6-, 7-, 8-, 9-, 10-, 15-, 20-, 25-, 100-, 
5 500-, 1 000-fold or greater. 

It is recognized that certain markers correspond to proteins which are secreted 
from ovarian cells (i.e. one or both of normal and cancerous cells) to the extracellular 
space surrounding the cells. These markers are preferably used in certain embodiments 
of the compositions, kits, and methods of the invention, owing to the fact that the protein 

10 corresponding to each of these markers can be detected in an ovary-associated body 
fluid sample, which may be more easily collected from a human patient than a tissue 
biopsy sample. In addition, preferred in vivo techniques for detection of a protein 
corresponding to a marker of the invention include introducing into a subject a labeled 
antibody directed against the protein. For example, the antibody can be labeled with a 

15 radioactive marker whose presence and location in a subject can be detected by standard 
imaging techniques. 

It is a simple matter for the skilled artisan to determine whether any particular 
marker corresponds to a secreted protein. In order to make this determination, the 
protein corresponding to a marker is expressed in a test cell (e.g. a cell of an ovarian cell 

20 line), extracellular fluid is collected, and the presence or absence of the protein in the 
extracellular fluid is assessed (e.g. using a labeled antibody which binds specifically 
with the protein). 

The following is an example of a method which can be used to detect secretion 
of a protein corresponding to a marker of the invention. About 8 x 10 5 293T cells are 

25 incubated at 37°C in wells containing growth medium (Dulbecco's modified Eagle's 
medium {DMEM} supplemented with 1 0% fetal bovine serum) under a 5% (v/v) C0 2 , 
95% air atmosphere to about 60-70% confluence. The cells are then transfected using a 
standard transfection mixture comprising 2 micrograms of DNA comprising an 
expression vector encoding the protein and 10 microliters of LipofectAMINE™ 

30 (GIBCO/BRL Catalog no. 1 8342-0 1 2) per well. The transfection mixture is maintained 
for about 5 hours, and then replaced with fresh growth medium and maintained in an air 
atmosphere. Each well is gently rinsed twice with DMEM which does not contain 
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methionine or cysteine (DMEM-MC; ICN Catalog no. 16-424- 54). About 1 milliliter 

35 

of DMEM-MC and about 50 rrucrocunes of Trans- S™ reagent (ICN Catalog no. 
5 1006) are added to each well. The wells are maintained under the 5% C0 2 atmosphere 
described above and incubated at 37°C for a selected period. Following incubation, 150 
5 microliters of conditioned medium is removed and centrifiiged to remove floating cells 
and debris. The presence of the protein in the supernatant is an indication that the 
protein is secreted. 

Examples of ovary-associated body fluids include blood fluids (e.g. whole blood, 
blood serum, blood having platelets removed therefrom, etc.), lymph, ascitic fluids, 

10 gynecological fluids {e.g. ovarian, fallopian, and uterine secretions, menses, vaginal 
douching fluids, fluids used to rinse cervical cell samples, etc.), cystic fluid, urine, and 
fluids collected by peritoneal rinsing (e.g. fluids applied and collected during 
laparoscopy or fluids instilled into and withdrawn from the peritoneal cavity of a human 
patient). In these embodiments, the level of expression of the. marker can be assessed by 

1 5 assessing the amount (e.g. absolute amount or concentration) of the marker in an ovary- * 
associated body fluid obtained from a patient. The fluid can, of course, be subjected to a 
variety of well-known post-collection preparative and storage techniques (e.g. storage, 
freezing, ultrafiltration, concentration, evaporation, centrifugation, etc.) prior to 
assessing the amount of the marker in the fluid. 

20 Many ovary-associated body fluids (i.e. usually excluding urine) can have 

ovarian cells, e.g. ovarian epithelium, therein, particularly when the ovarian cells are 
cancerous, and, more particularly, when the ovarian cancer is metastasizing. Cell- 
containing fluids which can contain ovarian cancer cells include, but are not limited to, 
peritoneal ascites, fluids collected by peritoneal rinsing, fluids collected by uterine 

25 rinsing, uterine fluids such as uterine exudate and menses, pleural fluid, and ovarian 
exudates. Thus, the compositions, kits, and methods of the invention can be used to 
detect expression of markers corresponding to proteins having at least one portion which 
is displayed on the surface of cells which express it. Examples of such proteins are 
indicated in the Tables herein. Although not every protein having at least one cell- 

30 surface portion is indicated in the Tables, it is a simple matter for the skilled artisan to 
determine whether the protein corresponding to any particular marker comprises a cell- 
surface protein. For example, immunological methods may be used to detect such 
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proteins on whole cells, or well known computer-based sequence analysis methods (e.g. 
the SIGNALP program; Nielsen et al, 1997, Protein Engineering 10:1-6) may be used 
to predict the presence of at least one extracellular domain (i.e. including both secreted 
proteins and proteins having at least one cell-surface domain). Expression of a marker 
5 corresponding to a protein having at least one portion which is displayed on the surface 
of a cell which expresses it may be detected without necessarily lysing the cell (e.g. 
using a labeled antibody which binds specifically with a cell-surface domain of the 
protein). 

Expression of a marker of the invention may be assessed by any of a wide 

10 variety of well known methods for detecting expression of a transcribed molecule or 
protein. Non-limiting examples of such methods include immunological methods for 
detection of secreted, cell-surface, cytoplasmic, or nuclear proteins, protein purification 
methods, protein function or activity assays, nucleic acid hybridization methods, nucleic 
acid reverse transcription methods, and nucleic acid amplification methods. 

15 In a preferred embodiment, expression of a marker is assessed using an antibody 

(e.g. a radio-labeled, chromophore-labeled, fluorophore-labeled, or enzyme-labeled 
antibody), an antibody derivative (e.g. an antibody conjugated with a substrate or with 
the protein or ligand of a protein-ligand pair {e.g. biotin-streptavidin} ), or an antibody 
fragment (e.g. a single-chain antibody, an isolated antibody hypervariable domain, etc.) 

20 which binds specifically with a protein corresponding to the marker, such as the protein 
encoded by the open reading frame corresponding to the marker or such a protein which 
has undergone all or a portion of its normal post-translational modification. 

In another preferred embodiment, expression of a marker is assessed by 
preparing mRNA/cDNA (i.e. a transcribed polynucleotide) from cells in a patient 

25 sample, and by hybridizing the mKNA/cDNA with a reference polynucleotide which is a 
complement of a polynucleotide comprising the marker, and fragments thereof. cDNA 
can, optionally, be amplified using any of a variety of polymerase chain reaction 
methods prior to hybridization with the reference polynucleotide; preferably, it is not 
amplified. Expression of one or more markers can likewise be detected using 

30 quantitative PCR to assess the level of expression of the markers). Alternatively, any 
of the many known methods of detecting mutations or variants (e.g. single nucleotide 
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polymorphisms, deletions, etc.) of a marker of the invention may be used to detect 
occurrence of a marker in a patient. 

In a related embodiment, a mixture of transcribed polynucleotides obtained from 
the sample is contacted with a substrate having fixed thereto a polynucleotide 
5 complementary to or homologous with at least a portion (e.g. at least 7, 10, 15, 20, 25, 
30, 40, 50, 100, 500, or more nucleotide residues) of a marker of the invention. If 
polynucleotides complementary to or homologous with are differentially detectable on 
the substrate (e.g. detectable using different chromophores or fluorophores, or fixed to 
different selected positions), then the levels of expression of a plurality of markers can 

10 be assessed simultaneously using a single substrate (e.g. a "gene chip" microarray of 
polynucleotides fixed at selected positions). When a method of assessing marker 
expression is used which involves hybridization of one nucleic acid with another, it is 
preferred that the hybridization be performed under stringent hybridization conditions. 

Because the compositions, kits, and methods of the invention rely on detection of 

1 5 a difference in expression levels of one or more markers of the invention, it is preferable 
that the level of expression of the marker is significantly greater than the minimum 
detection limit of the method used to assess expression in at least one of normal ovarian 
cells and cancerous ovarian cells. 

It is understood that by routine screening of additional patient samples using one 

20 or more of the markers of the invention, it will be realized that certain of the markers are 
over- or under-expressed in cancers of various types, including specific ovarian cancers, 
as well as other cancers such as breast cancer, cervical cancer, etc. For example, it will 
be confirmed that some of the markers of the invention are over- or under-expressed in 
most (i.e. 50% or more) or substantially all (i.e. 80% or more) of ovarian cancer. 

25 Furthermore, it will be confirmed that certain of the markers of the invention are 
associated with ovarian cancer of various stages (i.e. stage I, II, III, and IV ovarian 
cancers, as well as subclassifications IA, IB, IC, IIA, IIB, IIC, IIIA, IIIB, and IIIC, using 
the FIGO Stage Grouping system for primary carcinoma of the ovary; 1987, Am. J. 
Obstet. Gynecol 156:236), of various histologic subtypes (e.g. serous, mucinous, 

30 endometroid, and clear cell subtypes, as well as subclassifications and alternate 
classifications adenocarcinoma, papillary adenocarcinoma, papillary 
cystadenocarcinoma, surface papillary carcinoma, malignant adenofibroma, 
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cystadenofibroma, adenocarcinoma, cystadenocarcinoma, adenoacanthoma, 
endometrioid stromal sarcoma, mesodermal (MUllerian) mixed tumor, mesonephroid 
tumor, malignant carcinoma, Brenner tumor, mixed epithelial tumor, and 
undifferentiated carcinoma, using the WHO/FIGO system for classification of malignant 
5 ovarian tumors; Scully, Atlas of Tumor Pathology, 3d series, Washington DC), and 
various grades (i.e. grade I {well differentiated} , grade II {moderately well 
differentiated}, and grade III {poorly differentiated from surrounding normal tissue} ). 
In addition, as a greater number of patient samples are assessed for expression of the 
markers of the invention and the outcomes of the individual patients from whom the 

10 samples were obtained are correlated, it will also be confirmed that altered expression of 
certain of the markers of the invention are strongly correlated with malignant cancers 
and that altered expression of other markers of the invention are strongly correlated with 
benign tumors. The compositions, kits, and methods of the invention are thus useful for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 

1 5 nature of ovarian cancer in patients. In addition, these compositions, kits, and methods 
can be used to detect and differentiate epithelial, stromal, and germ cell ovarian cancers. 

When the compositions, kits, and methods of the invention are used for 
characterizing one or more of the stage, grade, histological type, and benign/malignant 
nature of ovarian cancer in a patient, it is preferred that the marker or panel of markers 

20 of the invention is selected such that a positive result is obtained in at least about 20%, 
and preferably at least about 40%, 60%, or 80%, and more preferably in substantially all 
patients afflicted with an ovarian cancer of the corresponding stage, grade, histological 
type, or benign/malignant nature. Preferably, the marker or panel of markers of the 
invention is selected such that a PPV of greater than about 10% is obtained for the 

25 general population (more preferably coupled with an assay specificity greater than 
99.5%). 

When a plurality of markers of the invention are used in the compositions, kits, 
and methods of the invention, the level of expression of each marker in a patient sample 
can be compared with the normal level of expression of each of the plurality of markers 
30 in non-cancerous samples of the same type, either in a single reaction mixture (i.e. using 
reagents, such as different fluorescent probes, for each marker) or in individual reaction 
mixtures corresponding to one or more of the markers. In one embodiment, a 
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significantly enhanced level of expression of more than one of the plurality of markers 
in the sample, relative to the corresponding normal levels, is an indication that the 
patient is afflicted with ovarian cancer. In another embodiment, a significantly lower 
level of expression in the sample of each of the plurality of markers, relative to the 
5 corresponding normal levels, is an indication that the patient is afflicted with ovarian 
cancer. In yet another embodiment, a significantly enhanced level of expression of one 
or more marks and a significantly lower level of expression of one or more markers in a 
sample relative to the corresponding normal levels, is an indication that the patient is 
afflicted with ovarian cancer. When a plurality of markers is used, it is preferred that 2, 

10 3, 4, 5, 8, 10, 12, 15, 20, 30, or 50 or more individual markers be used, wherein fewer 
markers are preferred. 

In order to maximize the sensitivity of the compositions, kits, and methods of the 
invention (Le. by interference attributable to cells of non-ovarian origin in a patient 
sample), it is preferable that the marker of the invention used therein be a marker which 

15 has a restricted tissue distribution, e.g., normally not expressed in a non-epithelial tissue, 
and more preferably a marker which is normally not expressed in a non-ovarian tissue. 

Only a small number of markers are known to be associated with ovarian cancers 
(e.g. AKT2, Ki-RAS, ERBB2, c-MYC, RBI, and TP53; Lynch, supra). These markers are 
not, of course, included among the markers of the invention, although they may be used 

20 together with one or more markers of the invention in a panel of markers, for example. 
It is well known that certain types of genes, such as oncogenes, tumor suppressor genes, 
growth factor-like genes, protease-like genes, and protein kinase-like genes are often 
involved with development of cancers of various types. Thus, among the markers of the 
invention, use of those which correspond to proteins which resemble known proteins 

25 encoded by known oncogenes and tumor suppressor genes, and those which correspond 
to proteins which resemble growth factors, proteases, and protein kinases are preferred. 

Known oncogenes and tumor suppressor genes include, for example, abl, abr, 
akt2, ape, bcUa, bcl2J3, bcl3, bcr, brcal, brca2, cbl, ccndl, cdc42, cdk4, crk- II, 
csflrlfins, dbl, dec, dpc4lsmad4, e-cad, e2fllrbap, egfr/erbb-1, elkl, elk3, eph, erg, etsl, 

30 ets2,fer,f^lsrc2,jlillergb2,fos,fpslfes,fral,fra2,fyn, hck, hek, her2/erbb- 2/neu, 
her3/erbb-3, her4lerbb-4, hrasl, hst2, hstfl, igfbp2, ink4a, ink4b, int2lfgf3,jun,junb, 
jund, kip2, kit, kras2a, kras2b, Ick, lyn, mas, max, mcc, mdm2 l met, mlhl, mmplO, mos, 
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msh2, msh3, msh6, myb, myba, mybb, myc, mycll, mycn, nfl, nf2, nme2, nras 9 p53 9 
pdgfb,phb, piml,pmsl,pms2,ptc,pten, rafl, rapla, rbl, rel, ret, rosl, ski, srcl, tall, 
tgfbr2, tgfb3, tgfbr3 t thral, thrb, tiaml, timp3 ) tjpl, tp53 t trk, vav, vhl, vil2, wafl, wntl, 
wnt2, wtl, and yesl (Hesketh, 1997, In: The Oncogene and Tumour Suppressor Gene 
5 Facts Book, 2nd Ed., Academic Press; Fishel et al, 1994, Science 266:1403-1405). 

Known growth factors include platelet-derived growth factor alpha, platelet- 
derived growth factor beta (simian sarcoma viral {v-sis} oncogene homolog), 
thrombopoietin (myeloproliferative leukemia virus oncogene ligand, megakaryocyte 
growth and development factor), erythropoietin, B cell growth factor, macrophage 

10 stimulating factor 1 (hepatocyte growth factor-like protein), hepatocyte growth factor 
(hepapoietin A), insulin-like growth factor 1 (somatomedia C), hepatoma-derived 
growth factor, amphiregulin (schwannoma-derived growth factor), bone morphogenetic 
proteins 1, 2, 3, 3 beta, and 4, bone morphogenetic protein 7 (osteogenic protein 1), bone 
morphogenetic protein 8 (osteogenic protein 2), connective tissue growth factor, 

15 connective tissue activation peptide 3, epidermal growth factor (EGF), teratocarcinoma- 
derived growth factor 1, endothelin, endothelin 2, endothelin 3, stromal cell-derived 
factor 1 , vascular endothelial growth factor (VEGF), VEGF-B, VEGF-C, placental 
growth factor (vascular endothelial growth factor-related protein), transforming growth . * 
factor alpha, transforming growth factor beta 1 and its precursors, transforming growth 

20 factor beta 2 and its precursors, fibroblast growth factor 1 (acidic), fibroblast growth 
factor 2 (basic), fibroblast growth factor 5 and its precursors, fibroblast growth factor 6 
and its precursors, fibroblast growth factor 7 (keratinocyte growth factor), fibroblast 
growth factor 8 (androgen-induced), fibroblast growth factor 9 (glia-activating factor), 
pleiotrophin (heparin binding growth factor 8, neurite growth-promoting factor 1), 

25 brain-derived neurotrophic factor, and recombinant glial growth factor 2. 

Known proteases include interleukin-1 beta convertase and its precursors, Mch6 
and its precursors, Mch2 isoform alpha, Mch4, Cpp32 isoform alpha, Lice2 gamma 
cysteine protease, Ich-lS, Ich-IL, Ich-2 and its precursors, TY protease, matrix 
metalloproteinase 1 (interstitial collagenase), matrix metalloproteinase 2 (gelatinase A, 

30 72kD gelatinase, 72kD type IV collagenase), matrix metalloproteinase 7 (matrilysin), 
matrix metalloproteinase 8 (neutrophil collagenase), matrix metalloproteinase 12 
(macrophage elastase), matrix metalloproteinase 13 (collagenase 3), metallopeptidase 1, 
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cysteine-rich metalloprotease (disintegrin) and its precursors, subtilisin-like protease Pc8 
and its precursors, chymotrypsin, snake venom-like protease, cathepsin 1, cathepsin D 
(lysosomal aspartyl protease), stromelysin, aminopeptidase N, plasminogen, tissue 
plasminogen activator, plasminogen activator inhibitor type II, and urokinase-type 
5 plasminogen activator. 

Known protein kinases include DAP kinase, serine/threonine protein kinases 
NIK, PK428, Krs-2, SAK, and EMK, interferon-inducible double stranded RNA 
dependent protein kinase, FAST kinase, AIM1, IPLl-like midbody-associated protein 
kinase- 1, NIMA-like protein kinase 1 (NLK1), the cyclin-dependent kinases (cdkl-10), 

1 0 checkpoint kinase Chkl , Nek3 protein kinase, BMK1 beta kinase, Clkl , Clk2, Clk3, 
extracellular signal-regulated kinases 1, 3, and 6, cdc28 protein kinase 1, cdc28 protein 
kinase 2, pLK, Mytl, c-Jun N-terminal kinase 2, Cam kinase 1, the MAP kinases, 
insulin-stimulated protein kinase 1, beta-adrenergic receptor kinase 2, ribosomal protein 
S6 kinase, kinase suppressor of ras-1 (KSR1), putative serine/threonine protein kinase 

15 Prk, PkB kinase, cAMP-dependent protein kinase, cGMP-dependent protein kinase, type 
II cGMP-dependent protein kinase, protein kinases Dyrk2, Dyrk3, and Dyrk4, Rho- 
associated coiled-coil containing protein kinase pl60ROCK, protein tyrosine kinase t- 
Rorl, Ste20-related kinases, cell adhesion kinase beta, protein kinase 3, stress-activated 
protein kinase 4, protein kinase Zpk, serine kinase hPAK65, dual specificity mitogen- 

20 activated protein kinases 1 and 2, casein kinase I gamma 2, p21 -activated protein kinase 
Pakl, lipid-activated protein kinase PRK2, focal adhesion kinase, dual-specificity 
tyrosine-phosphorylation regulated kinase, myosin light chain kinase, serine kinases 
SRPK2, TESK1, and VRK2, B lymphocyte serine/threonine protein kinase, stress- 
activated protein kinases JNK1 and JNK2, phosphorylase kinase, protein tyrosine kinase 

25 Tec, Jak2 kinase, protein kinase Ndr, MEK kinase 3, SHB adaptor protein (a Src 
homology. 2 protein), agammagiobulinaemia protein-tyrosine kinase (Atk), protein 
kinase ATR, guanylate kinase 1, thrombopoeitin receptor and its precursors, DAG 
kinase epsilon, and kinases encoded by oncogenes or viral oncogenes such as v-fgr 
(Gardner-Rasheed), v-abl (Abelson murine leukemia viral oncogene homolog 1), v-arg 

30 (Abelson murine leukemia viral oncogene homolog, Abelson-related gene), v-fes and v- 
fps (feline sarcoma viral oncogene and Fujinami avian sarcoma viral oncogene 
homologs), proto-oncogene o-cot, oncogene pim-\, and oncogene masl. 
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It is recognized that the compositions, kits, and methods of the invention will be 
of particular utility to patients having an enhanced risk of developing ovarian cancer and 
their medical advisors. Patients recognized as having an enhanced risk of developing 
ovarian cancer include, for example, patients having a familial history of ovarian cancer, 
5 patients identified as having a mutant oncogene (i.e. at least one allele), and patients of 
advancing age (i.e. women older than about 50 or 60 years). 

The level of expression of a marker in normal (i.e. non-cancerous) human 
ovarian tissue can be assessed in a variety of ways. In one embodiment, this normal 
level of expression is assessed by assessing the level of expression of the marker in a 

10 portion of ovarian cells which appears to be non-cancerous and by comparing this 
normal level of expression with the level of expression in a portion of the ovarian cells 
which is suspected of being cancerous. For example, when laparoscopy or other 
medical procedure, reveals the presence of a lump on one portion of a patient's ovary, 
but not on another portion of the same ovary or on the other ovary, the normal level of 

15 expression of a marker may be assessed using one or both or the non-affected ovary and 
a non-affected portion of the affected ovary, and this normal level of expression may be 
compared with the level of expression of the same marker in an affected portion (i.e. the 
lump) of the affected ovary. Alternately, and particularly as further information 
becomes available as a result of routine performance of the methods described herein, 

20 population-average values for normal expression of the markers of the invention may be 
used. In other embodiments, the 'normal 1 level of expression of a marker may be 
determined by Assessing expression of the marker in a patient sample obtained from a 
non-cancer-afflicted patient, from a patient sample obtained from a patient before the 
suspected onset of ovarian cancer in the patient, from archived patient samples, and the 

25 like. 

The invention includes compositions, kits, and methods for assessing the 
presence of ovarian cancer cells in a sample (e.g. an archived tissue sample or a sample 
obtained from a patient). These compositions, kits, and methods are substantially the 
same as those described above, except that, where necessary, the compositions, kits, and 
30 methods are adapted for use with samples other than patient samples. For example, 
when the sample to be used is a parafinized, archived human tissue sample, it can be 
necessary to adjust the ratio of compounds in the compositions of the invention, in the 
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kits of the invention, or the methods used to assess levels of marker expression in the 
sample. Such methods are well known in the art and within the skill of the ordinary 
artisan. 

The invention includes a kit for assessing the presence of ovarian cancer cells 
5 (e.g. in a sample such as a patient sample). The kit comprises a plurality of reagents, 
each of which is capable of binding specifically with a nucleic acid or polypeptide 
corresponding to a marker of the invention. Suitable reagents for binding with a 
polypeptide corresponding to a marker of the invention include antibodies, antibody 
derivatives, antibody fragments, and the like. Suitable reagents for binding with a 
10 nucleic acid (e.g. a genomic DNA, an mRNA, a spliced mRNA, a cDNA, or the like) 
include complementary nucleic acids. For example, the nucleic acid reagents may 
include oligonucleotides (labeled or non-labeled) fixed to a substrate, labeled 
oligonucleotides not bound with a substrate, pairs of PCR primers, molecular beacon 
probes, and the like. 

1 5 The kit of the invention may optionally comprise additional components useful 

for performing the methods of the invention. By way of example, the kit may comprise 
fluids (e.g. SSC buffer) suitable for annealing complementary nucleic acids or for 
binding an antibody with a protein with which it specifically binds, one or more sample 
compartments, an instructional material which describes performance of a method of the 

20 invention, a sample of normal ovarian cells, a sample of ovarian cancer cells, and the 



The invention also includes a method of making an isolated hybridoma which 
produces an antibody useful for assessing whether patient is afflicted with an ovarian 
cancer. In this method, a protein corresponding to a marker of the invention is isolated 

25 (e.g. by purification from a cell in which it is expressed or by transcription and 
translation of a nucleic acid encoding the protein in vivo or in vitro using known 
methods). A vertebrate, preferably a mammal such as a mouse, rat, rabbit, or sheep, is 
immunized using the isolated protein. The vertebrate may optionally (and preferably) be 
immunized at least one additional time with the isolated protein, so that the vertebrate 

30 exhibits a robust immune response to the protein. Splenocytes are isolated from the 
immunized vertebrate and fused with an immortalized cell line to form hybridomas, 
using any of a variety of methods well known in the art. Hybridomas formed in this 



like. 
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manner are then screened using standard methods to identify one or more hybridomas 
which produce an antibody which specifically binds with the protein. The invention 
also includes hybridomas made by this method and antibodies made using such 
•hybridomas. 

5 The invention also includes a method of assessing the efficacy of a test 

compound for inhibiting ovarian cancer cells. As described above, differences in the 
level of expression of the markers of the invention correlate with the cancerous state of 
ovarian cells. Although it is recognized that changes in the levels of expression of 
certain of the markers of the invention likely result from the cancerous state of ovarian 

1 0 cells, it is likewise recognized that changes in the levels of expression of other of the 
markers of the invention induce, maintain, and promote the cancerous state of those 
cells. Thus, compounds which inhibit an ovarian cancer in a patient will cause the level 
of expression of one or more of the markers of the invention to change to a level nearer 
the normal level of expression for that marker (i.e. the level of expression for the marker 

1 5 in non-cancerous ovarian cells). 

This method thus comprises comparing expression of a marker in a first ovarian 
cell sample and maintained in the presence of the test compound and expression of the 
marker in a second ovarian cell sample and maintained in the absence of the test 
compound. A significant alteration in the level of expression of a marker listed in 

20 Tables 1-2 is an indication that the test compound inhibits ovarian cancer. The ovarian 
cell samples may, for example, be aliquots of a single sample of normal ovarian cells 
obtained from a patient, pooled samples of normal ovarian cells obtained from a patient, 
cells of a normal ovarian cell line, aliquots of a single sample of ovarian cancer cells 
obtained from a patient, pooled samples of ovarian cancer cells obtained from a patient, 

25 cells of an ovarian cancer cell line, or the like. In one embodiment, the samples are 
ovarian cancer cells obtained from a patient and a plurality of compounds known to be 
effective for inhibiting various ovarian cancers are tested in order to identify the 
compound which is likely to best inhibit the ovarian cancer in the patient. 

This method may likewise be used to assess the efficacy of a therapy for 

30 inhibiting ovarian cancer in a patient. In this method, the level of expression of one or 
more markers of the invention in a pair of samples (one subjected to the therapy, the 
other not subjected to the therapy) is assessed. As with the method of assessing the 
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efBcacy of test compounds, if the therapy induces a significant alteration in the level of 
expression of a marker listed in Tables 1-2 then the therapy is efficacious for inhibiting 
ovarian cancer. As above, if samples from a selected patient are used in this method, 
then alternative therapies can be assessed in vitro in order to select a therapy most likely 
5 to be efficacious for inhibiting ovarian cancer in the patient. 

As described herein, ovarian cancer in patients is associated with an altereration 
in the level of expression of one or more markers listed in Tables 1-2. While, as 
discussed above, some of these changes in expression level result from occurrence of the 
ovarian cancer, others of these changes induce, maintain, and promote the cancerous 

10 state of ovarian cancer cells. Thus, ovarian cancer characterized by an increase in the 
level of expression of one or more markers listed in either or both of Tables 1-2 can be 
inhibited by inhibiting expression of those markers. 

Expression of a marker listed in Tables 1-2 can be inhibited in a number of ways 
generally known in the art. For example, an antisense oligonucleotide can be provided 

15 to the ovarian cancer cells in order to inhibit transcription, translation, or both, of the 
marker(s). Alternately, a polynucleotide encoding an antibody, an antibody derivative, 
or an antibody fragment, and operably linked with an appropriate promoter/regulator 
region, can be provided to the cell in order to generate intracellular antibodies which 
will inhibit the function or activity of the protein corresponding to the marker(s). Using 

20 the methods described herein, a variety of molecules, particularly including molecules 
sufficiently small that they are able to cross the cell membrane, can be screened in order 
to identify molecules which inhibit expression of the marker(s). The compound so 
identified can be provided to the patient in order to inhibit expression of the marker(s) in 
the ovarian cancer cells of the patient. 

25 Expression of a marker listed in Tables 1-2 can be enhanced in a number of ways 

generally known in the art. For example, a polynucleotide encoding the marker and 
operably linked with an appropriate promoter/regulator region can be provided to 
ovarian cancer cells of the patient in order to induce enhanced expression of the protein 
(and mRNA) corresponding to the marker therein. Alternatively, if the protein is 

30 capable of crossing the cell membrane, inserting itself in the cell membrane, or is 

normally a secreted protein, then expression of the protein can be enhanced by providing 




WO 01/070979 PCTAJS01/09126 

-29- 

the protein (e.g. directly or by way of the bloodstream or another ovary-associated fluid) 
to ovarian cancer cells in the patient. 

As described above, the cancerous state of human ovarian cells is correlated with 
changes in the levels of expression of the markers of the invention. The invention 
5 includes a method for assessing the human ovarian cell carcinogenic potential of a test 
compound. This method comprises maintaining separate aliquots of human ovarian 
cells in the presence and absence of the test compound. Expression of a marker of the 
invention in each of the aliquots is compared. A significant alteration in the level of 
expression of a marker listed in Tables 1-2 in the aliquot maintained in the presence of 

10 the test compound (relative to the aliquot maintained in the absence of the test 
compound) is an indication that the test compound possesses human ovarian cell 
carcinogenic potential. The relative carcinogenic potentials of various test compounds 
can be assessed by comparing the degree of enhancement or inhibition of the level of 
expression of the relevant markers, by comparing the number of markers for which the 

1 5 level of expression is enhanced or inhibited, or by comparing both. 

Various aspects of the invention are described in further detail in the following 
subsections. 

I. Isolated Nucleic Acid Molecules 

20 One aspect of the invention pertains to novel isolated nucleic acid molecules that 

correspond to a marker of the invention, including nucleic acids which encode a 
polypeptide corresponding to a marker of the invention or a portion of such a 
polypeptide. Isolated nucleic acids of the invention also include nucleic acid molecules 
sufficient for use as hybridization probes to identify nucleic acid molecules that 

25 correspond to a marker of the invention, including nucleic acids which encode a 

polypeptide corresponding to a marker of the invention, and fragments of such nucleic 
acid molecules, e.g., those suitable for use as PCR primers for the amplification or 
mutation of nucleic acid molecules. As used herein, the term "nucleic acid molecule" is 
intended to include DNA molecules (e.g., cDNA or genomic DNA) and RNA molecules 

30 (e.g., mKNA) and analogs of the DNA or RNA generated using nucleotide analogs. The 
nucleic acid molecule can be single-stranded or double-stranded, but preferably is 
double-stranded DNA. 
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An "isolated" nucleic acid molecule is one which is separated from other nucleic 
acid molecules which are present in the natural source of the nucleic acid molecule. 
Preferably, an "isolated" nucleic acid molecule is free of sequences (preferably protein- 
encoding sequences) which naturally flank the nucleic acid sequences located at the 
5 5 ! and 3' ends of the nucleic acid) in the genomic DNA of the organism from which the 
nucleic acid is derived. For example, in various embodiments, the isolated nucleic acid 
molecule can contain less than about 5 kB, 4 kB, 3 kB, 2 kB, 1 kB, 0.5 kB or 0.1 kB of 
nucleotide sequences which naturally flank the nucleic acid molecule in genomic DNA 
of the cell from which the nucleic acid is derived. Moreover, an "isolated" nucleic acid 

10 molecule, such as a cDNA molecule, can be substantially free of other cellular material, 
or culture medium when produced by recombinant techniques, or substantially free of 
chemical precursors or other chemicals when chemically synthesized. 

A nucleic acid molecule of the present invention can be isolated using standard 
molecular biology techniques and the sequence information in the database records 

1 5 described herein. Using all or a portion of such nucleic acid sequences, nucleic acid 
molecules of the invention can be isolated using standard hybridization and cloning 
techniques (e.g., as described in Sambrook et at., ed., Molecular Cloning: A Laboratory 
Manual, 2nd ed, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
1989). 

20 A nucleic acid molecule of the invention can be amplified using cDNA, mRNA, 

or genomic DNA as a template and appropriate oligonucleotide primers according to 
standard PCR amplification techniques. The nucleic acid so amplified can be cloned 
into an appropriate vector and characterized by DNA sequence analysis. Furthermore, 
oligonucleotides corresponding to all or a portion of a nucleic acid molecule of the 

25 invention can be prepared by standard synthetic techniques, e.g., using an automated 
DNA synthesizer. 

In another preferred embodiment, an isolated nucleic acid molecule of the 
invention comprises a nucleic acid molecule which has a nucleotide sequence 
complementary to the nucleotide sequence of a nucleic acid corresponding to a marker 
30 of the invention or to the nucleotide sequence of a nucleic acid encoding a protein which 
corresponds to a marker of the invention. A nucleic acid molecule which is 
complementary to a given nucleotide sequence is one which is sufficiently 
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complementary to the given nucleotide sequence that it can hybridize to the given 
nucleotide sequence thereby forming a stable duplex. 

Moreover, a nucleic acid molecule of the invention can comprise only a portion 
•of a nucleic acid sequence, wherein the full length nucleic acid sequence comprises a 
5 marker of the invention or which encodes a polypeptide corresponding to a marker of 
the invention. Such nucleic acids can be used, for example, as a probe or primer. The 
probe/primer typically is used as one or more substantially purified oligonucleotides. 
The oligonucleotide typically comprises a region of nucleotide sequence that hybridizes 
under stringent conditions to at least about 7, preferably about 15, more preferably about 

10 25, 50, 75, 100, 125, 150, 175, 200, 250, 300, 350, or 400 or more consecutive 
nucleotides of a nucleic acid of the invention. 

Probes based on the sequence of a nucleic acid molecule of the invention can be 
used to detect transcripts or genomic sequences corresponding to one or more markers 
of the invention. The probe comprises a label group attached thereto, e.g., a 

15 radioisotope, a fluorescent compound, an enzyme, or an enzyme co-factor. Such probes 
can be used as part of a diagnostic test kit for identifying cells or tissues which mis- 
express the protein, such as by measuring levels of a nucleic acid molecule encoding the 
protein in a sample of cells from a subject, e.g., detecting mRNA levels or determining 
whether a gene encoding the protein has been mutated or deleted. 

20 The invention further encompasses nucleic acid molecules that differ, due to 

degeneracy of the genetic code, from the nucleotide sequence of nucleic acids encoding 
a protein which corresponds to a marker of the invention, and thus encode the same 
protein. 



25 polymorphisms that lead to changes in the amino acid sequence can exist within a 

population (e.g. , the human population). Such genetic polymorphisms can exist among 
individuals within a population due to natural allelic variation. An allele is one of a 
group of genes which occur alternatively at a given genetic locus. In addition, it will be 
appreciated that DNA polymorphisms that affect RNA expression levels can also exist 

30 that may affect the overall expression level of that gene (e.g., by affecting regulation or 
degradation). 



It will be appreciated by those skilled in the art that DNA sequence 
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As used herein, the phrase "allelic variant" refers to a nucleotide sequence which 
occurs at a given locus or to a polypeptide encoded by the nucleotide sequence. 

As used herein, the terms "gene" and "recombinant gene" refer to nucleic acid 
molecules comprising an open reading frame encoding a polypeptide corresponding to a 
5 marker of the invention. Such natural allelic variations can typically result in 1-5% 
variance in the nucleotide sequence of a given gene. Alternative alleles can be identified 
by sequencing the gene of interest in a number of different individuals. This can be 
readily carried out by using hybridization probes to identify the same genetic locus in a 
variety of individuals. Any and all such nucleotide variations and resulting amino acid 

1 0 polymorphisms or variations that are the result of natural allelic variation and that do not 
alter the functional activity are intended to be within the scope of the invention. 

In another embodiment, an isolated nucleic acid molecule of the invention is at 
least 7, 15, 20, 25, 30, 40, 60, 80, 100, 150, 200, 250, 300, 350, 400, 450, 550, 650, 700, 
800, 900, 1000, 1200, 1400, 1600, 1800, 2000, 2200, 2400, 2600, 2800, 3000, 3500, 

15 4000, 4500, or more nucleotides in length and hybridizes under stringent conditions to a 
nucleic acid corresponding to a marker of the invention or to a nucleic acid encoding a 
protein corresponding to a marker of the invention. As used herein, the term "hybridizes 
under stringent conditions" is intended to describe conditions for hybridization and 
washing under which nucleotide sequences at least 75% (80%, 85%, preferably 90%) 

20 identical to each other typically remain hybridized to each other. Such stringent 

conditions are known to those skilled in the art and can be found in sections 6.3.1-6.3.6 
of Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989). A 
preferred, non-limiting example of stringent hybridization conditions for annealing two 
single-stranded DNA each of which is at least about 100 bases in length and/or for 

25 annealing a single-stranded DNA and a single-stranded RNA each of which is at least 
about 100 bases in length, are hybridization in 6X sodium chloride/sodium citrate (SSC) 
at about 45°C, followed by one or more washes in 0.2X SSC, 0.1% SDS at 50-65°C. 
Further preferred hybridization conditions are taught in Lockhart, et al., Nature 
Biotechnology, Volume 14, 1996 August: 1675- 1680; Breslauer, et al, Proc. Natl. Acad. 

30 Sci. USA, Volume 83, 1 986 June: 3746-3750; Van Ness, et al, Nucleic Acids Research, 
Volume 19, No. 19, 1991 September: 5143-5151; McGraw,^ al, BioTechniques, 
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Volume 8, No. 6 1990: 674-678; and Milner, et al, Nature Biotechnology, Volume 15, 
1997 June: 537-541, all expressly incorporated by reference. 

In addition to naturally-occurring allelic variants of a nucleic acid molecule of 
the invention that can exist in the population, the skilled artisan will further appreciate 
5 that sequence changes can be introduced by mutation thereby leading to changes in the 
amino acid sequence of the encoded protein, without altering the biological activity of 
the protein encoded thereby. For example, one can make nucleotide substitutions 
leading to amino acid substitutions at "non-essential" amino acid residues. A "non- 
essential" amino acid residue is a residue that can be altered from the wild-type 

10 sequence without altering the biological activity, whereas an "essential" amino acid 
residue is required for biological activity. For example, amino acid residues that are not 
conserved or only semi-conserved among homologs of various species may be non- 
essential for activity and thus would be likely targets for alteration. Alternatively, amino 
acid residues that are conserved among the homologs of various species (e.g., murine 

1 5 and human) may be essential for activity and thus would not be likely targets for 
alteration. 

Accordingly, another aspect of the invention pertains to nucleic acid molecules 
encoding a polypeptide of the invention that contain changes in amino acid residues that 
are not essential for activity. Such polypeptides differ in amino acid sequence from the 
20 naturally-occurring proteins which correspond to the markers of the invention, yet retain 
biological activity. In one embodiment, such a protein has an amino acid sequence that 
is at least about 40% identical, 50%, 60%, 70%, 80%, 90%, 95%, or 98% identical to the 
amino acid sequence of one of the proteins which correspond to the markers of the 
invention. 

25 An isolated nucleic acid molecule encoding a variant protein can be created by 

introducing one or more nucleotide substitutions, additions or deletions into the 
nucleotide sequence of nucleic acids of the invention, such that one or more amino acid 
residue substitutions, additions, or deletions are introduced into the encoded protein. 
Mutations can be introduced by standard techniques, such as site-directed mutagenesis 

30 and PCR-mediated mutagenesis. Preferably, conservative amino acid substitutions are 
made at one or more predicted non-essential amino acid residues. A "conservative 
amino acid substitution" is one in which the amino acid residue is replaced with an 
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amino acid residue having a similar side chain. Families of amino acid residues having 
similar side chains have been defined in the art. These families include amino acids 
with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic 
acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, 
5 serine, threonine, tyrosine, cysteine), non-polar side chains {e.g., alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains 
(e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, 
phenylalanine, tryptophan, histidine). Alternatively, mutations can be introduced 
randomly along all or part of the coding sequence, such as by saturation mutagenesis, 

10 and the resultant mutants can be screened for biological activity to identify mutants that 
retain activity. Following mutagenesis, the encoded protein can be expressed 
recombinantly and the activity of the protein can be determined. 

The present invention encompasses antisense nucleic acid molecules, i.e., 
molecules which are complementary to a sense nucleic acid of the invention, e.g., 

15 complementary to the coding strand of a double-stranded cDNA molecule. 

corresponding to a marker of the invention or complementary to an mRNA sequence 
corresponding to a marker of the invention. Accordingly, an antisense nucleic acid of 
the invention can hydrogen bond to (i.e. anneal with) a sense nucleic acid of the 
invention. The antisense nucleic acid can be complementary to an entire coding strand, 

20 or to only a portion thereof, e.g., all or part of the protein coding region (or open reading 
frame). An antisense nucleic acid molecule can also be antisense to all or part of a non- 
coding region of the coding strand of a nucleotide sequence encoding a polypeptide of 
the invention. The non-coding regions ("5* and 3' untranslated regions") are the 5 r and 3' 
sequences which flank the coding region and are not translated into amino acids. 

25 An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 35, 

40, 45, or 50 or more nucleotides in length. An antisense nucleic acid of the invention 
can be constructed using chemical synthesis and enzymatic ligation reactions using 
procedures known in the art. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or 

30 variously modified nucleotides designed to increase the biological stability of the 
molecules or to increase the physical stability of the duplex formed between the 
antisense and sense nucleic acids, e.g., phosphorothioate derivatives and acridine 
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substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
5 carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 
D-mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio- 

10 N6-isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5- 
methyluracil, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl- 
2-thiouracil, 3 -(3 -amino-3 -N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. 
Alternatively, the antisense nucleic acid can be produced biologically using an 

1 5 expression vector into which a nucleic acid has been sub-cloned in an antisense 

orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an antisense 
orientation to a target nucleic acid of interest, described further in the following 
subsection). 

The antisense nucleic acid molecules of the invention are typically administered 
20 to a subject or generated in situ such that they hybridize with or bind to cellular mRNA 
and/or genomic DNA encoding a polypeptide corresponding to a selected marker of the 
invention to thereby inhibit expression of the marker, e.g., by inhibiting transcription 
and/or translation. The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense 
25 nucleic acid molecule which binds to DNA duplexes, through specific interactions in the 
major groove of the double helix. Examples of a route of administration of antisense 
nucleic acid molecules of the invention includes direct injection at a tissue site or 
infusion of the antisense nucleic acid into an ovary-associated body fluid. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then 
30 administered systemically. For example, for systemic administration, antisense 
molecules can be modified such that they specifically bind to receptors or antigens 
expressed on a selected cell surface, e.g., by linking the antisense nucleic acid molecules 
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to peptides or antibodies which bind to cell surface receptors or antigens. The antisense 
nucleic acid molecules can also be delivered to cells using the vectors described herein. 
To achieve sufficient intracellular concentrations of the antisense molecules, vector 
• constructs in which the antisense nucleic acid molecule is placed under the control of a 
5 strong pol II or pol III promoter are preferred. 

An antisense nucleic acid molecule of the invention can be an a-anomeric 
nucleic acid molecule. An a-anomeric nucleic acid molecule forms specific double- 
stranded hybrids with complementary RNA in which, contrary to the usual a-units, the 
strands run parallel to each other (Gaultier et al, 1987, Nucleic Acids Res. 15:6625- 

10 6641). The antisense nucleic acid molecule can also comprise a 2'-o- 

methylribonucleotide Qnoueetal, 1987, Nucleic Acids. Res. 15:6131-6148) or a 
chimeric RNA-DNA analogue (Inoue et al, 1987, FEBSLett. 215:327-330). 

The invention also encompasses ribozymes. Ribozymes are catalytic RNA 
molecules with ribonuclease activity which are capable of cleaving a single-stranded 

1 5 nucleic acid, such as an mRNA, to which they have a complementary region. Thus, 
ribozymes (e.g., hammerhead ribozymes as described in Haselhoff and Gerlach, 1988, 
Nature 334:585-591) can be used to catalytically cleave mRNA transcripts to thereby 
inhibit translation of the protein encoded by the mRNA. A ribozyme having specificity 
for a nucleic acid molecule encoding a polypeptide corresponding to a marker of the 

20 invention can be designed based upon the nucleotide sequence of a cDNA 

corresponding to the marker. For example, a derivative of a Tetrahymena L-19 IVS 
RNA can be constructed in which the nucleotide sequence of the active site is 
complementary to the nucleotide sequence to be cleaved (see Cech et al U.S. Patent No. 
4,987,071; and Cech et al U.S. Patent No. 5,1 16,742). Alternatively, an mRNA 

25 encoding a polypeptide of the invention can be used to select a catalytic RNA having a 
specific ribonuclease activity from a pool of RNA molecules (see, e.g., Bartel and 
Szostak, 1993, Science 261:1411-1418). 

The invention also encompasses nucleic acid molecules which form triple helical 
structures. For example, expression of a polypeptide of the invention can be inhibited 

30 by targeting nucleotide sequences complementary to the regulatory region of the gene 
encoding the polypeptide {e.g., the promoter and/or enhancer) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally Helene 
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(1991) Anticancer Drug Des. 6(6):569-84; Helene (1992) Ann. NY. Acad Sci. 660:27- 
36; and Maher (1992) Bioassays 14(12):807-15. 

In various embodiments, the nucleic acid molecules of the invention can be 
•modified at the base moiety, sugar moiety or phosphate backbone to improve, e.g., the 
5 stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of the nucleic acids can be modified to generate peptide nucleic 
acids (see Hyrup et al, 1996, Bioorganic & Medicinal Chemistry 4(1): 5-23). As used 
herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid mimics, e.g., 
DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 

10 pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 
backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA 
under conditions of low ionic strength. The synthesis of PNA oligomers can be 
performed using standard solid phase peptide synthesis protocols as described in Hyrup 
etal (1996), supra; Perry-O'Keefeetal. (1996) Proc. Natl Acad Sci. USA 93:14670- 

15 675. 

PNAs can be used in therapeutic and diagnostic applications. For example, 
PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting 
replication. PNAs can also be used, e.g., in the analysis of single base pair mutations in 

20 a gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used . 
in combination with other enzymes, e.g., SI nucleases (Hyrup (1996), supra; or as 
probes or primers for DNA sequence and hybridization (Hyrup, 1996, supra; Perry- 
WKeefeetal, 1996, Proc. Natl. Acad Sci. USA 93:14670-675). 

In another embodiment, PNAs can be modified, e.g., to enhance their stability or 

25 cellular uptake, by attaching lipophilic or other helper groups to PNA, by the formation 
of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug delivery 
known in the art. For example, PNA-DNA chimeras can be generated which can 
combine the advantageous properties of PNA and DNA. Such chimeras allow DNA 
recognition enzymes, e.g. , RNASE H and DNA polymerases, to interact with the DNA 

30 portion while the PNA portion would provide high binding affinity and specificity. 

PNA-DNA chimeras can be linked using linkers of appropriate lengths selected in terms 
of base stacking, number of bonds between the nucleobases, and orientation (Hyrup, 
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1996, supra). The synthesis of PNA-DNA chimeras can be performed as described in 
Hymp (1996), supra, and Finn etal (1996) Nucleic Acids Res. 24(17):3357-63. For 
example, a DNA chain can be synthesized on a solid support using standard 
phosphoramidite coupling chemistry and modified nucleoside analogs. Compounds 
5 such as 5'-(4-methoxytrityl)amino-5 r «deoxy-thymidine phosphoramidite can be used as a 
link between the PNA and the 5' end of DNA (Mag et al, 1989, Nucleic Acids Res. 
17:5973-88). PNA monomers are then coupled in a step-wise manner to produce a 
chimeric molecule with a 5' PNA segment and a 3' DNA segment (Finn et al, 1996, 
Nucleic Acids Res. 24(17):3357-63). Alternatively, chimeric molecules can be 

10 synthesized with a 5' DNA segment and a 3' PNA segment (Peterser et al, 1975, 
Bioorganic Med. Chem. Lett. 5:1119-11 124). 

In other embodiments, the oligonucleotide can include other appended groups 
such as peptides (e.g., for targeting host cell receptors in vivo), or agents facilitating 
transport across the cell membrane (see, e.g., Letsinger et al. } 1989, Proc. Natl Acad. 

15 Set USA 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad. Sci. USA 84:648-652; 
PCT Publication No. WO 88/09810) or the blood-brain barrier (see, e.g., PCT 
Publication No. WO 89/10134). In addition, oligonucleotides can be modified with 
hybridization-triggered cleavage agents (see, e.g., Krol et al, 1988, Bio/Techniques 
6:958-976) or intercalating agents (see, e.g., Zon, 1988, Pharm. Res. 5:539-549). To 

20 this end, the oligonucleotide can be conjugated to another molecule, e.g., a peptide, 
hybridization triggered cross-linking agent, transport agent, hybridization-triggered 
cleavage agent, etc. 

The invention also includes molecular beacon nucleic acids having at least one 
region which is complementary to a nucleic acid of the invention, such that the 

25 molecular beacon is useful for quantitating the presence of the nucleic acid of the 

invention in a sample. A "molecular beacon" nucleic acid is a nucleic acid comprising a 
pair of complementary regions and having a fluorophore and a fluorescent quencher 
associated therewith. The fluorophore and quencher are associated with different 
portions of the nucleic acid in such an orientation that when the complementary regions 

30 are annealed with one another, fluorescence of the fluorophore is quenched by the 
quencher. When the complementary regions of the nucleic acid are not annealed with 
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one another, fluorescence of the fluorophore is quenched to a lesser degree. Molecular 
beacon nucleic acids are described, for example, in U.S. Patent 5,876,930. 

II. Isolated Proteins and Antibodies 
5 One aspect of the invention pertains to isolated proteins which correspond to 

individual markers of the invention, and biologically active portions thereof, as well as 
polypeptide fragments suitable for use as immunogens to raise antibodies directed 
against a polypeptide corresponding to a marker of the invention. In one embodiment, 
the native polypeptide corresponding to a marker can be isolated from cells or tissue 

1 0 sources by an appropriate purification scheme using standard protein purification 
techniques. In another embodiment, polypeptides corresponding to a marker of the 
invention are produced by recombinant DNA techniques. Alternative to recombinant 
expression, a polypeptide corresponding to a marker of the invention can be synthesized 
chemically using standard peptide synthesis techniques. 

15 An "isolated" or "purified" protein or biologically active portion thereof is 

substantially free of cellular material or other contaminating proteins from the cell or 
tissue source from which the protein is derived, or substantially free of chemical, 
precursors or other chemicals when chemically synthesized. The language 
"substantially free of cellular material" includes preparations of protein in which the 

20 protein is separated from cellular components of the cells from which it is isolated or 
recombinantly produced. Thus, protein that is substantially free of cellular material 
includes preparations of protein having less than about 30%, 20%, 10%, or 5% (by dry 
weight) of heterologous protein (also referred to herein as a "contaminating protein"). 
When the protein or biologically active portion thereof is recombinantly produced, it is 

25 also preferably substantially free of culture medium, Le. 9 culture medium represents less 
than about 20%, 10%, or 5% of the volume of the protein preparation. When the protein 
is produced by chemical synthesis, it is preferably substantially free of chemical 
precursors or other chemicals, i.e., it is separated from chemical precursors or other 
chemicals which are involved in the synthesis of the protein. Accordingly such 

30 preparations of the protein have less than about 30%, 20%, 1 0%, 5% (by dry weight) of 
chemical precursors or compounds other than the polypeptide of interest. 
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Biologically active portions of a polypeptide corresponding to a marker of the 
invention include polypeptides comprising amino acid sequences sufficiently identical to 
or derived from the amino acid sequence of the protein corresponding to the marker, 
which include fewer amino acids than the full length protein, and exhibit at least one 
5 activity of the corresponding full-length protein. Typically, biologically active portions 
comprise a domain or motif with at least one activity of the corresponding protein. A 
biologically active portion of a protein of the invention can be a polypeptide which is, 
for example, 10, 25, 50, 100 or more amino acids in length. Moreover, other 
biologically active portions, in which other regions of the protein are deleted, can be 

10 prepared by recombinant techniques and evaluated for one or more of the functional 
activities of the native form of a polypeptide of the invention. 

Preferred polypeptides are encoded by the nucleotide sequences of Tables 1-2. 
Other useful proteins are substantially identical (e.g., at least about 40%, preferably 
50%, 60%, 70%, 80%, 90%, 95%, or 99%) to one of these sequences and retain the 

1 5 functional activity of the protein of the corresponding naturally-occurring protein yet 
differ in amino acid sequence due to natural allelic variation or mutagenesis. 

To determine the percent identity of two amino acid sequences or of two nucleic 
acids, the sequences are aligned for optimal comparison purposes {e.g., gaps can be 
introduced in the sequence of a first amino acid or nucleic acid sequence for optimal 

20 alignment with a second amino or nucleic acid sequence). The amino acid residues or 
nucleotides at corresponding amino acid positions or nucleotide positions are then 
compared. When a position in the first sequence is occupied by the same amino acid 
residue or nucleotide as the corresponding position in the second sequence, then the 
molecules are identical at that position. The percent identity between the two sequences 

25 is a function of the number of identical positions shared by the sequences (i.e., % 

identity = # of identical positions/total # of positions (e.g., overlapping positions) xlOO). 
In one embodiment the two sequences are the same length. 

The determination of percent identity between two sequences can be 
accomplished using a mathematical algorithm. A preferred, non-limiting example of a 

30 mathematical algorithm utilized for the comparison of two sequences is the algorithm of 
Karlin and Altschul (1990) Proc. Natl Acad. Set USA 87:2264-2268, modified as in 
Karlin and Altschul (1993) Proc. Natl. Acad Sci. USA 90:5873-5877. Such an 
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algorithm is incorporated into the NBLAST and XBLAST programs of Altschul, et ah 
(1990) X Mol. Biol. 215:403-410. BLAST nucleotide searches can be performed with 
the NBLAST program, score = 100, wordlength = 12 to obtain nucleotide sequences 
homologous to a nucleic acid molecules of the invention. BLAST protein searches can 
5 be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino 
acid sequences homologous to a protein molecules of the invention. To obtain gapped 
alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et ah (1997) Nucleic Acids Res. 25:3389-3402. Alternatively, PSI-Blast can be 
used to perform an iterated search which detects distant relationships between 

1 0 molecules. When utilizing BLAST, Gapped BLAST, and PSI-Blast programs, the 
default parameters of the respective programs (e.g., XBLAST and NBLAST) can be 
used. See http://www.ncbi.nlm.nih.gov. Another preferred, non-limiting example of a 
mathematical algorithm utilized for the comparison of sequences is the algorithm of 
Myers and Miller, (1988) CABIOS 4:11-17. Such an algorithm is incorporated into the 

1 5 ALIGN program (version 2.0) which is part of the GCG sequence alignment software 
package. When utilizing the ALIGN program for comparing amino acid sequences, a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be 
used. Yet another useful algorithm for identifying regions of local sequence similarity 
and alignment is the FAST A algorithm as described in Pearson and Lipman (1988) 

20 Proc. Natl. Acad. Sci. USA 85:2444-2448. When using the FASTA algorithm for 

comparing nucleotide or amino acid sequences, a PAM120 weight residue table can, for 
example, be used with a £-tuple value of 2. 

The percent identity between two sequences can be determined using techniques 
similar to those described above, with or without allowing gaps. In calculating percent 

25 identity, only exact matches are counted. 

The invention also provides chimeric or fusion proteins corresponding to a 
marker of the invention. As used herein, a "chimeric protein' 1 or "fusion protein" 
comprises all or part (preferably a biologically active part) of a polypeptide 
corresponding to a marker of the invention operably linked to a heterologous 

30 polypeptide (j. e. , a polypeptide other than the polypeptide corresponding to the marker). 
Within the fusion protein, the term "operably linked" is intended to indicate that the 
polypeptide of the invention and the heterologous polypeptide are fused in-frame to each 
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other. The heterologous polypeptide can be fused to the ammo-terminus or the 
carboxyl-tenninus of the polypeptide of the invention. 

One useful fusion protein is a GST fusion protein in which a polypeptide 
corresponding to a marker of the invention is fused to the carboxyl terminus of GST 
5 sequences. Such fusion proteins can facilitate the purification of a recombinant 
polypeptide of the invention. 

In another embodiment, the fusion protein contains a heterologous signal 
sequence at its amino terminus. For example, the native signal sequence of a 
polypeptide corresponding to a marker of the invention can be removed and replaced 

10 with a signal sequence from another protein. For example, the gp67 secretory sequence 
of the baculovirus envelope protein can be used as a heterologous signal sequence 
(Ausubel et al, ed., Current Protocols in Molecular Biology, John Wiley & Sons, NY, 
1992). Other examples of eukaryotic heterologous signal sequences include the 
secretory sequences of melittin and human placental alkaline phosphatase (Stratagene; 

15 La Jolla, California). In yet another example, useful prokaryotic heterologous signal 
sequences include the phoA secretory signal (Sambrook et al, supra) and the protein A 
secretory signal (Pharmacia Biotech; Piscataway, New Jersey). 

In yet another embodiment, the fusion protein is an immunoglobulin fusion 
protein in which all or part of a polypeptide corresponding to a marker of the invention 

20 is fused to sequences derived from a member of the immunoglobulin protein family. 
The immunoglobulin fusion proteins of the invention can be incorporated into 
pharmaceutical compositions and administered to a subject to inhibit an interaction 
between a ligand (soluble or membrane-bound) and a protein on the surface of a cell 
(receptor), to thereby suppress signal transduction in vivo. The immunoglobulin fusion 

25 protein can be used to affect the bioavailability of a cognate ligand of a polypeptide of 
the invention. Inhibition of ligand/receptor interaction can be useful therapeutically, 
both for treating proliferative and differentiative disorders and for modulating (e.g. 
promoting or inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of 
the invention can be used as immunogens to produce antibodies directed against a 

30 polypeptide of the invention in a subject, to purify ligands and in screening assays to 
identify molecules which inhibit the interaction of receptors with ligands. 
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Chimeric and fusion proteins of the invention can be produced by standard 
recombinant DNA techniques. In another embodiment, the fusion gene can be 
synthesized by conventional techniques including automated DNA synthesizers. 
•Alternatively, PCR amplification of gene fragments can be carried out using anchor 
5 primers which give rise to complementary overhangs between two consecutive gene 
fragments which can subsequently be annealed and re-amplified to generate a chimeric 
gene sequence (see, e.g., Ausubel et al, supra). Moreover, many expression vectors are 
commercially available that already encode a fusion moiety (e.g., a GST polypeptide). 
A nucleic acid encoding a polypeptide of the invention can be cloned into such an 
1 0 expression vector such that the fusion moiety is linked in-frame to the polypeptide of the 
invention. 

A signal sequence can be used to facilitate secretion and isolation of the secreted 
protein or other proteins of interest. Signal sequences are typically characterized by a 
core of hydrophobic amino acids which are generally cleaved from the mature protein 

15 during secretion in one or more cleavage events. Such signal peptides contain 

processing sites that allow cleavage of the signal sequence from the mature proteins as 
they pass through the secretory pathway. Thus, the invention pertains to the described 
polypeptides having a signal sequence, as well as to polypeptides from which the signal 
sequence has been proteolytically cleaved (i.e. , the cleavage products). In one 

20 embodiment, a nucleic acid sequence encoding a signal sequence can be operably linked 
in an expression vector to a protein of interest, such as a protein which is ordinarily not 
secreted or is otherwise difficult to isolate. The signal sequence directs secretion of the 
protein, such as from a eukaryotic host into which the expression vector is transformed, 
and the signal sequence is subsequently or concurrently cleaved. The protein can then 

25 be readily purified from the extracellular medium by art recognized methods. 
Alternatively, the signal sequence can be linked to the protein of interest using a 
sequence which facilitates purification, such as with a GST domain. 

The present invention also pertains to variants of the polypeptides corresponding 
to individual markers of the invention. Such variants have an altered amino acid 

30 sequence which can function as either agonists (mimetics) or as antagonists. Variants 
can be generated by mutagenesis, e.g., discrete point mutation or truncation. An agonist 
can retain, substantially the same, or a subset, of the biological activities of the naturally 
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occurring form of the protein. An antagonist of a protein can inhibit one or more of the 
activities of the naturally occurring form of the protein by, for example, competitively . 
binding to a downstream or upstream member of a cellular signaling cascade which 
includes the protein of interest. Thus, specific biological effects can be elicited by 
5 treatment with a variant of limited function. Treatment of a subject with a variant 

having a subset of the biological activities of the naturally occurring form of the protein 
can have fewer side effects in a subject relative to treatment with the naturally occurring 
form of the protein. 

Variants of a protein of the invention which function as either agonists 

1 0 (mimetics) or as antagonists can be identified by screening combinatorial libraries of 
mutants, e.g., truncation mutants, of the protein of the invention for agonist or antagonist 
activity. In one embodiment, a variegated library of variants is generated by 
combinatorial mutagenesis at the nucleic acid level and is encoded by a variegated gene 
library. A variegated library of variants can be produced by, for example, enzymatically . 

15 ligating a mixture of synthetic oligonucleotides into gene sequences such that a 

degenerate set of potential protein sequences is expressible as individual polypeptides, 
or alternatively, as a set of larger fusion proteins (e.g., for phage display). There are a 
variety of methods which can be used to produce libraries of potential variants of the 
polypeptides of the invention from a degenerate oligonucleotide sequence. Methods for 

20 synthesizing degenerate oligonucleotides are known in the art (see, e.g., Narang, 1983, 
Tetrahedron 39:3; Itakura etal s 1984, 4nnu. Rev. Biochem. 53:323; Itakura etal, 1984, 
Science 198:1056; Ike et ctl, 1983 Nucleic Acid Res. 11:477). 

In addition, libraries of fragments of the coding sequence of a polypeptide 
corresponding to a marker of the invention can be used to generate a variegated 

25 population of polypeptides for screening and subsequent selection of variants. For 
example, a library of coding sequence fragments can be generated by treating a double 
stranded PCR fragment of the coding sequence of interest with a nuclease under 
conditions wherein nicking occurs only about once per molecule, denaturing the double 
stranded DNA, renaturing the DNA to form double stranded DNA which can include 

30 sense/antisense pairs from different nicked products, removing single stranded portions 
from reformed duplexes by treatment with SI nuclease, and ligating the resulting 
fragment library into an expression vector. By this method, an expression library can be 
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derived which encodes amino terminal and internal fragments of various sizes of the 
protein of interest. 

Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
5 libraries for gene products having a selected property. The most widely used 

techniques, which are amenable to high through-put analysis, for screening large gene 
libraries typically include cloning the gene library into replicable expression vectors, 
transforming appropriate cells with the resulting library of vectors, and expressing the 
combinatorial genes under conditions in which detection of a desired activity facilitates 

10 isolation of the vector encoding the gene whose product was detected. Recursive 

ensemble mutagenesis (REM), a technique which enhances the frequency of functional 
mutants in the libraries, can be used in combination with the screening assays to identify 
variants of a protein of the invention (Arkin and Yourvan, 1992, Proc. Natl Acad. Set 
USA 59:781 1-7815; Delgrave et al t 1993, Protein Engineering 6(3):327- 33 1). 

1 5 An isolated polypeptide corresponding to a marker of the invention, or a 

fragment thereof, can be used as an immunogen to generate antibodies using standard 
techniques for polyclonal and monoclonal antibody preparation. The full-length 
polypeptide or protein can be used or, alternatively, the invention provides antigenic 
peptide fragments for use as immunogens. The antigenic peptide of a protein of the 

20 invention comprises at least 8 (preferably 10, 15, 20, or 30 or more) amino acid residues 
of the amino acid sequence of one of the polypeptides of the invention, and encompasses 
an epitope of the protein such that an antibody raised against the peptide forms a specific 
immune complex with a marker of the invention to which the protein corresponds. 
Preferred epitopes encompassed by the antigenic peptide are regions that are located on 

25 the surface of the protein, e.g., hydrophilic regions. Hydrophobicity sequence analysis, 
hydrophilicity sequence analysis, or similar analyses can be used to identify hydrophilic 
regions. 

An immunogen typically is used to prepare antibodies by immunizing a suitable 
(i.e. immunocompetent) subject such as a rabbit, goat, mouse, or other mammal or 
30 vertebrate. An appropriate immunogenic preparation can contain, for example, 

recombinantly-expressed or chemically-synthesized polypeptide. The preparation can 
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further include an adjuvant, such as Freund's complete or incomplete adjuvant, or a 
similar immunostimulatory agent. 

Accordingly, another aspect of the invention pertains to antibodies directed 
against a polypeptide of the invention. The terms "antibody" and "antibody substance" 
5 as used interchangeably herein refer to immunoglobulin molecules and immunologically 
active portions of immunoglobulin molecules, Le. 9 molecules that contain an antigen 
binding site which specifically binds an antigen, such as a polypeptide of the invention, 
e.g., an epitope of a polypeptide of the invention. A molecule which specifically binds 
to a given polypeptide of the invention is a molecule which binds the polypeptide, but 

10 does not substantially bind other molecules in a sample, e.g., a biological sample, which 
naturally contains the polypeptide. Examples of immunologically active portions of 
immunoglobulin molecules include F(ab) and F(ab')2 fragments which can be generated 
by treating the antibody with an enzyme such as pepsin. The invention provides 
polyclonal and monoclonal antibodies. The term "monoclonal antibody" or 

1 5 "monoclonal antibody composition", as used herein, refers to a population of antibody 
molecules that contain only one species of an antigen binding site capable of 
immunoreacting with a particular epitope. 

Polyclonal antibodies can be prepared as described above by immunizing a 
suitable subject with a polypeptide of the invention as an immunogen. Preferred 

20 polyclonal antibody compositions are ones that have been selected for antibodies 

directed against a polypeptide or polypeptides of the invention. Particularly preferred 
polyclonal antibody preparations are ones that contain only antibodies directed against a 
polypeptide or polypeptides of the invention. Particularly preferred immunogen 
compositions are those that contain no other human proteins such as, for example, 

25 immunogen compositions made using a non-human host cell for recombinant expression 
of a polypeptide of the invention. In such a manner, the only human epitope or epitopes 
recognized by the resulting antibody compositions raised against this immunogen will 
be present as part of a polypeptide or polypeptides of the invention. 

The antibody titer in the immunized subject can be monitored over time by 

30 standard techniques, such as with an enzyme linked immunosorbent assay (ELISA) 
using immobilized polypeptide. If desired, the antibody molecules can be harvested or 
isolated from the subject (e.g., from the blood or serum of the subject) and further 
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purified by well-known techniques, such as protein A chromatography to obtain the IgG 
fraction. Alternatively, antibodies specific for a protein or polypeptide of the invention 
can be selected or (e.g., partially purified) or purified by, e.g., affinity chromatography. 
For example, a recombinantly expressed and purified (or partially purified) protein of 
5 the invention is produced as described herein, and covalently or non-covalently coupled 
to a solid support such as, for example, a chromatography column. The column can then 
be used to affinity purify antibodies specific for the proteins of the invention from a 
sample containing antibodies directed against a large number of different epitopes, 
thereby generating a substantially purified antibody composition, i.e., one that is 

1 0 substantially free of contaminating antibodies. By a substantially purified antibody 
composition is meant, in this context, that the antibody sample contains at most only 
30% (by dry weight) of contaminating antibodies directed against epitopes other than 
those of the desired protein or polypeptide of the invention, and preferably at most 20%, 
yet more preferably at most 10%, and most preferably at most 5% (by dry weight) of the 

1 5 sample is contaminating antibodies. A purified antibody composition means that at least 
99% of the antibodies in the composition are directed against the desired protein or 
polypeptide of the invention. 

At an appropriate time after immunization, e.g., when the specific antibody titers 
are highest, antibody-producing cells can be obtained from the subject and used to 

20 prepare monoclonal antibodies by standard techniques, such as the hybridoma technique 
originally described by Kohler and Milstein (1975) Nature 256:495-497, the human B 
cell hybridoma technique (see Kozbor et al, 1983, Immunol. Today 4:72), the EBV- 
hybridoma technique (see Cole et al, pp. 77-96 In Monoclonal Antibodies and Cancer 
Therapy, Alan R. Liss, Inc., 1985) or trioma techniques. The technology for producing 

25 hybridomas is well known (see generally Current Protocols in Immunology, Coligan et 
al ed., John Wiley & Sons, New York, 1994). Hybridoma cells producing a 
monoclonal antibody of the invention are detected by screening the hybridoma culture 
supernatants for antibodies that bind the polypeptide of interest, e.g., using a standard 
ELISA assay. 

3 0 Alternative to preparing monoclonal antibody-secreting hybridomas, a 

monoclonal antibody directed against a polypeptide of the invention can be identified 
and isolated by screening a recombinant combinatorial immunoglobulin library {e.g., an 
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antibody phage display library) with the polypeptide of interest. Kits for generating and 
screening phage display libraries are commercially available (e.g., the Pharmacia 
Recombinant Phage Antibody System, Catalog No. 27-9400-01 ; and the Stratagene 
SurfLAP Phage Display Kit, Catalog No. 240612). Additionally, examples of methods 
5 and reagents particularly amenable for use in generating and screening antibody display 
library can be found in, for example, U.S. Patent No. 5,223,409; PCT Publication No. 
WO 92/1 8619; PCT Publication No. WO 91/17271 ; PCT Publication No. WO 
92/20791; PCT Publication No. WO 92/15679; PCT Publication No. WO 93/01288; 
PCT Publication No. WO 92/01047; PCT Publication No. WO 92/09690; PCT 

10 Publication No. WO 90/02809; Fuchs et al (1991) Bio/Technology 9:1370-1372; Hay et 
al (1992) Hum, Antibod Hybridomas 3:81-85; Huse et al (1989) Science 246:1275- 
1281; Griffiths etal (1993) EMBOJ. 12:725-734. 

Additionally, recombinant antibodies, such as chimeric and humanized 
monoclonal antibodies, comprising both human and non-human portions, which can be 

1 5 made using standard recombinant DNA techniques, are within the scope of the 

invention. A chimeric antibody is a molecule in which different portions are derived 
from different animal species, such as those having a variable region derived from a 
murine mAb and a human immunoglobulin constant region. (See, e.g } Cabilly et al., 
U.S. Patent No. 4,816,567; and Boss et al., U.S. Patent No. 4,816,397, which are 

20 incorporated herein by reference in their entirety.) Humanized antibodies are antibody 
molecules from non-human species having one or more complementarily determining 
regions (CDRs) from the non-human species and a framework region from a human 
immunoglobulin molecule. (See, e.g.; Queen, U.S. Patent No. 5,585,089, which is 
incorporated herein by reference in its entirety.) Such chimeric and humanized 

25 monoclonal antibodies can be produced by recombinant DNA techniques known in the 
art, for example using methods described in PCT Publication No. WO 87/02671; 
European Patent Application 184,187; European Patent Application 171,496; European 
Patent Application 173,494; PCT Publication No. WO 86/01533; U.S. Patent No. 
4,816,567; European Patent Application 125,023; Better et al (1988) Science 240:1041- 

30 1043; Liu et al (1987) Proc. Natl Acad Sci. USA 84:3439-3443; Liu et al (1987) J. 
Immunol 139:3521- 3526; Sun etal (1987) Proa Natl Acad Set USA 84:214-218; 
Nishimura et al (1987) Cancer Res, 47:999-1005; Wood et al (1985) Nature 314:446- 
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449; and Shaw et al (1988)./ Natl Cancer Inst 80:1553-1559); Morrison (1985) 
Science 229:1202-1207; Oi et al (1986) Bio/Techniques 4:214; U.S. Patent 5,225,539; 
Jones etal (1986) Nature 321:552-525; Verhoeyan etal (1988) Science 239:1534; and 
Beidler et al (1988) J. Immunol 141:4053-4060. 
5 Antibodies of the invention may be used as therapeutic agents in treating 

cancers. In a preferred embodiment, completely human antibodies of the invention are 
used for therapeutic treatment of human cancer patients, particularly those having an 
ovarian cancer. Such antibodies can be produced, for example, using transgenic mice 
which are incapable of expressing endogenous immunoglobulin heavy and light chains 

10 genes, but which can express human heavy and light chain genes. The transgenic mice 
are immunized in the normal fashion with a selected antigen, e.g., all or a portion of a 
polypeptide corresponding to a marker of the invention. Monoclonal antibodies directed 
against the antigen can be obtained using conventional hybridoma technology. The 
human immunoglobulin transgenes harbored by the transgenic mice rearrange during B 

15 cell differentiation, and subsequently undergo class switching and somatic mutation. i 
Thus, using such a technique, it is possible to produce therapeutically useful IgG, IgA 
and IgE antibodies. For an overview of this technology for producing human antibodies, 
see Lonberg and Huszar (1995) Int. Rev. Immunol 13:65-93). For a detailed discussion 
of this technology for producing human antibodies and human monoclonal antibodies 

20 and protocols for producing such antibodies, see, e.g. , U.S. Patent 5,625,126; U.S. 
Patent 5,633,425; U.S. Patent 5,569,825; U.S. Patent 5,661,016; and U.S. Patent 
5,545,806. In addition, companies such as Abgenix, Inc. (Freemont, CA), can be 
engaged to provide human antibodies directed against a selected antigen using 
technology similar to that described above. 

25 Completely human antibodies which recognize a selected epitope can be 

generated using a technique referred to as "guided selection." In this approach a 
selected non-human monoclonal antibody, e.g., a murine antibody, is used to guide the 
selection of a completely human antibody recognizing the same epitope (Jespers et al, 
1994, Bio/technology 12:899-903). 

30 An antibody directed against a polypeptide corresponding to a marker of the 

invention (e.g., a monoclonal antibody) can be used to isolate the polypeptide by 
standard techniques, such as affinity chromatography or immunoprecipitation. 
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Moreover, such an antibody can be used to detect the marker (e.g., in a cellular lysate or 
cell supernatant) in order to evaluate the level and pattern of expression of the marker. 
The antibodies can also be used diagnostically to monitor protein levels in tissues or 
body fluids (e.g. in an ovary-associated body fluid) as part of a clinical testing 
5 procedure, e.g., to, for example, determine the efficacy of a given treatment regimen. 
Detection can be facilitated by coupling the antibody to a detectable substance. 
Examples of detectable substances include various enzymes, prosthetic groups, 
fluorescent materials, luminescent materials, bioluminescent materials, and radioactive 
materials. Examples of suitable enzymes include horseradish peroxidase, alkaline 
1 0 phosphatase, p-galactosidase, or acetylcholinesterase; examples of suitable prosthetic 
group complexes include streptavidin/biotin and avidin/biotin; examples of suitable 
fluorescent materials include umbelliferone, fluorescein, fluorescein isothiocyanate, 
rhodamine, dichlorotriazinylamine fluorescein, dansyl chloride or phycoerythrin; an 
example of a luminescent material includes luminol; examples of bioluminescent 

15 materials include luciferase, luciferin, and aequorin, and examples of suitable 

• , . , , * 2 K 131, 35 ~ 3 _ _ 
radioactive material include I, I, S or H. 

Further, an antibody (or fragment thereof) can be conjugated to a therapeutic 

moiety such as a cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin 

or cytotoxic agent includes any agent that is detrimental to cells. Examples include 

20 taxol, cytochalasin B, gramicidin D, ethidium bromide, emetine, mitomycin, etoposide, 
tenoposide, vincristine, vinblastine, colchicin, doxorubicin, daunorubicin, dihydroxy 
anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1-dehydrotestosterone, 
glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and analogs 
or homologs thereof Therapeutic agents include, but are not limited to, antimetabolites 

25 (e.g., methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil 
decarbazine), alkylating agents (e.g., mechlorethamine, thioepa chlorambucil, 
melphalan, carmustine (BSNU) and lomustine (CCNU), cyclothosphamide, busulfan, 
dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine platinum (II) 
(DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and 

30 doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, 
mithramycin, and anthramycin (AMC)), and anti-mitotic agents (e.g., vincristine and 
vinblastine). 



WO 01/070979 



PCMJS01/09126 



-51 - 

The conjugates of the invention can be used for modifying a given biological 
response, the drug moiety is not to be construed as limited to classical chemical 
therapeutic agents. For example, the drug moiety may be a protein or polypeptide 
possessing a desired biological activity. Such proteins may include, for example, a toxin 
5 such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a protein such as 
tumor necrosis factor, .alpha.-interferon, .beta. -interferon, nerve growth factor, platelet 
derived growth factor, tissue plasminogen activator; or, biological response modifiers 
such as, for example, lymphokines, interleukin-1 ("IL-l"), interleukin-2 ("IL-2"), 
interleukin-6 ("IL-6"), granulocyte macrophase colony stimulating factor ("GM-CSF"), 

10 granulocyte colony stimulating factor ("G-CSF"), or other growth factors. 

Techniques for conjugating such therapeutic moiety to antibodies are well 
known, see. e.g., Arnon et al., "Monoclonal Antibodies For Immunotargeting Of Drugs 
In Cancer Therapy", in Monoclonal Antibodies And Cancer Therapy, Reisfeld et al. 
(eds.), pp. 243-56 (Alan R. Liss, Inc. 1985); Hellstrom et al., "Antibodies For Drug 

1 5 Delivery", in Controlled Drug Delivery (2nd Ed.), Robinson et al. (eds.), pp. 623-53 
(Marcel Dekker, Inc. 1987); Thorpe, "Antibody Carriers Of Cytotoxic Agents In Cancer 
Therapy: A Review", in Monoclonal Antibodies f 84: Biological And Clinical 
Applications, Pinchera et al. (eds.), pp. 475-506 (1985); "Analysis, Results, And Future 
Prospective Of The Therapeutic Use Of Radiolabeled Antibody In Cancer Therapy", in 

20 Monoclonal Antibodies For Cancer Detection And Therapy, Baldwin et al. (eds.), pp. 
303-16 (Academic Press 1985), and Thorpe et al., "The Preparation And Cytotoxic 
Properties Of Antibody-Toxin Conjugates", Immunol. Rev., 62:1 19-58 (1982). 

Alternatively, an antibody can be conjugated to a second antibody to form an 
antibody heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

25 Accordingly, in one aspect, the invention provides substantially purified 

antibodies or fragments thereof, and non-human antibodies or fragments thereof, which 
antibodies or fragments specifically bind to a polypeptide comprising an amino acid 
sequence selected from the group consisting of the amino acid sequences of the present 
invention, an amino acid sequence encoded by the cDNA of the present invention, a 

30 fragment of at least 1 5 amino acid residues of an amino acid sequence of the present 
invention, an amino acid sequence which is at least 95% identical to the amino acid 
sequence of the present invention (wherein the percent identity is determined using the 
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ALIGN program of the GCG software package with a PAM120 weight residue table, a 
gap length penalty of 12, and a gap penalty of 4) and an amino acid sequence which is 
encoded by a nucleic acid molecule which hybridizes to a nucleic acid molecule 
consisting of the nucleic acid molecules of the present invention, or a complement 
thereof, under conditions of hybridization of 6X SSC at 45°C and washing in 0.2 X 
SSC, 0. 1% SDS at 65°C. In various embodiments, the substantially purified antibodies 
of the invention, or fragments thereof, can be human, non-human, chimeric and/or 
humanized antibodies. 

In another aspect, the invention provides non-human antibodies or fragments 
thereof, which antibodies or fragments specifically bind to a polypeptide comprising an 
amino acid sequence selected from the group consisting of: the amino acid sequence of 
the present invention, an amino acid sequence encoded by the cDNA of the present 
invention, a fragment of at least 15 amino acid residues of the amino acid sequence of 
the present invention, an amino acid sequence which is at least 95% identical to the 
amino acid sequence of the present invention (wherein the percent identity is determined 
using the ALIGN program of the GCG software package with a PAM120 weight residue ■* 
table, a gap length penalty of 12, and a gap penalty of 4) and an amino acid sequence V 
which is encoded by a nucleic acid molecule which hybridizes to a nucleic acid 
molecule consisting of the nucleic acid molecules of the present invention, or a 
complement thereof, under conditions of hybridization of 6X SSC at 45°C and washing 
in 0.2 X SSC, 0.1% SDS at 65°C. Such non-human antibodies can be goat, mouse, 
sheep, horse, chicken, rabbit, or rat antibodies. Alternatively, the non-human antibodies 
of the invention can be chimeric and/or humanized antibodies. In addition, the non- 
human antibodies of the invention can be polyclonal antibodies or monoclonal 
antibodies. 

In still a further aspect, the invention provides monoclonal antibodies or 
fragments thereof, which antibodies or fragments specifically bind to a polypeptide 
comprising an amino acid sequence selected from the group consisting of the amino acid 
sequences of the present invention, an amino acid sequence encoded by the cDNA of the 
present invention, a fragment of at least 15 amino acid residues of an amino acid 
sequence of the present invention, an amino acid sequence which is at least 95% 
identical to an amino acid sequence of the present invention (wherein the percent 
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identity is determined using the ALIGN program of the GCG software package with a 
PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4) and an 
amino acid sequence which is encoded by a nucleic acid molecule which hybridizes to a 
nucleic acid molecule consisting of the nucleic acid molecules of the present invention, 
5 or a complement thereof, under conditions of hybridization of 6X SSC at 45°C and 
washing in 0.2 X SSC, 0. 1 % SDS at 65°C. The monoclonal antibodies can be human, 
humanized, chimeric and/or non-human antibodies. 

The substantially purified antibodies or fragments thereof may specifically bind 
to a signal peptide, a secreted sequence, an extracellular domain, a transmembrane or a 

1 0 cytoplasmic domain or cytoplasmic membrane of a polypeptide of the invention. In a 
particularly preferred embodiment, the substantially purified antibodies or fragments 
thereof, the non-human antibodies or fragments thereof, and/or the monoclonal 
antibodies or fragments thereof, of the invention specifically bind to a secreted sequence 
or an extracellular domain of the amino acid sequences of the present invention. 

15 Any of the antibodies of the invention can be conjugated to a therapeutic moiety 

or to a detectable substance. Non-limiting examples of detectable substances that can be 
conjugated to the antibodies of the invention are an enzyme, a prosthetic group, a 
fluorescent material, a luminescent material, a bioluminescent material, and a 
radioactive material. 

20 The invention also provides a kit cQntaining an antibody of the invention 

conjugated to a detectable substance, and instructions for use. Still another aspect of the 
invention is a pharmaceutical composition comprising an antibody of the invention and 
a pharmaceutically acceptable carrier. In preferred embodiments, the pharmaceutical 
composition contains an antibody of the invention, a therapeutic moiety, and a 

25 pharmaceutically acceptable carrier. 

Still another aspect of the invention is a method of making an antibody that 
specifically recognizes a polypeptide of the present invention, the method comprising 
immunizing a mammal with a polypeptide. The polypeptide used as an immungen 
comprises an amino acid sequence selected from the group consisting of the amino acid 

30 sequence of the present invention, an amino acid sequence encoded by the cDNA of the 
nucleic acid molecules of the present invention, a fragment of at least 15 amino acid 
residues of the amino acid sequence of the present invention, an amino acid sequence 



WO 01/070979 PCTYUS01/09126 

-54- 

which is at least 95% identical to the amino acid sequence of the present invention 
(wherein the percent identity is determined using the ALIGN program of the GCG 
software package with a PAM120 weight residue table, a gap length penalty of 12, and a 
gap penalty of 4) and an amino acid sequence which is encoded by a nucleic acid 
5 molecule which hybridizes to a nucleic acid molecule consisting of the nucleic acid 
molecules of the present invention, or a complement thereof, under conditions of 
hybridization of 6X SSC at 45°C and washing in 0.2 X SSC, 0.1% SDS at 65°C. 

After immunization, a sample is collected from the mammal that contains an 
antibody that specifically recognizes the polypeptide. Preferably, the polypeptide is 
10 recombinantly produced using a non-human host cell. Optionally, the antibodies can be 
further purified from the sample using techniques well known to those of skill in the art. 
The method can further comprise producing a monoclonal antibody- producing cell from 
the cells of the mammal. Optionally, antibodies are collected from the 
antibody-producing cell. 

15 

III. Recombinant Expression Vectors and Host Cells 

Another aspect of the invention pertains to vectors, preferably expression 
vectors, containing a nucleic acid encoding a polypeptide corresponding to a marker of 
the invention (or a portion of such a polypeptide). As used herein, the term "vector" 

20 refers to a nucleic acid molecule capable of transporting another nucleic acid to which it 
has been linked. One type of vector is a "plasmid", which refers to a circular double 
stranded DNA loop into which additional DNA segments can be ligated. Another type 
of vector is a viral vector, wherein additional DNA segments can be ligated into the viral 
genome. Certain vectors are capable of autonomous replication in a host cell into which 

25 they are introduced (e.g., bacterial vectors having a bacterial origin of replication and 
episomal mammalian vectors). Other vectors (e.g., non-episomal mammalian vectors) 
are integrated into the genome of a host cell upon introduction into the host cell, and 
thereby are replicated along with the host genome. Moreover, certain vectors, namely 
expression vectors, are capable of directing the expression of genes to which they are 

30 operably linked. In general, expression vectors of utility in recombinant DNA 
techniques are often in the form of plasmids (vectors). However, the invention is 
intended to include such other forms of expression vectors, such as viral vectors (e.g., 
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replication defective retroviruses, adenoviruses and adeno-associated viruses), which 
serve equivalent functions. 

The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell. This 
5 means that the recombinant expression vectors include one or more regulatory 
sequences, selected on the basis of the host cells to be used for expression, which is 
operably linked to the nucleic acid sequence to be expressed. Within a recombinant 
expression vector, "operably linked" is intended to mean that the nucleotide sequence of 
interest is linked to the regulatory sequence(s) in a manner which allows for expression 

10 of the nucleotide sequence (e.g. , in an in vitro transcription/translation system or in a 
host cell when the vector is introduced into the host cell). The term "regulatory 
sequence" is intended to include promoters, enhancers and other expression control 
elements {e.g., polyadenylation signals). Such regulatory sequences are described, for 
example, in Goeddel, Methods in Enzymology: Gene Expression Technology vol.185, 

15 Academic Press, San Diego, CA (1991). Regulatory sequences include those which 
direct constitutive expression of a nucleotide sequence in many types of host cell and 
those which direct expression of the nucleotide sequence only in certain host cells (e.g., 
tissue-specific regulatory sequences). It will be appreciated by those skilled in the art 
that the design of the expression vector can depend on such factors as the choice of the 

20 host cell to be transformed, the level of expression of protein desired, and the like. The 
expression vectors of the invention can be introduced into host cells to thereby produce 
proteins or peptides, including fusion proteins or peptides, encoded by nucleic acids as 
described herein. 

The recombinant expression vectors of the invention can be designed for 
25 expression of a polypeptide corresponding to a marker of the invention in prokaryotic 
(e.g., E. coli) or eukaryotic cells (e.g., insect cells {using baculovirus expression 
vectors}, yeast cells or mammalian cells). Suitable host cells are discussed further in 
Goeddel, supra. Alternatively, the recombinant expression vector can be transcribed 
and translated in vitro, for example using T7 promoter regulatory sequences and 17 
30 polymerase. 
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Expression of proteins in prokaryotes is most often carried out in E. coli with 
vectors containing constitutive or inducible promoters directing the expression of either 
fusion or non-fusion proteins. Fusion vectors add a number of amino acids to a protein 
encoded therein, usually to the amino terminus of the recombinant protein. Such fusion 
5 vectors typically serve three purposes: 1) to increase expression of recombinant protein; 
2) to increase the solubility of the recombinant protein; and 3) to aid in the purification 
of the recombinant protein by acting as a ligand in affinity purification. Often, in fusion 
expression vectors, a proteolytic cleavage site is introduced at the junction of the fusion 
moiety and the recombinant protein to enable separation of the recombinant protein 

10 from the fusion moiety subsequent to purification of the fusion protein. Such enzymes, 
and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. 
Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith and 
Johnson, 1988, Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) and 
pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), 

1 5 maltose E binding protein, or protein A, respectively, to the target recombinant protein. 
Examples of suitable inducible non-fusion E. coli expression vectors include 
pTrc (Amann et al y 1988, Gene 69:301-3 15) and pET 1 Id (Studier et al. 9 p. 60-89, In 
Gene Expression Technology: Methods in Enzymology vol.185, Academic Press, San 
Diego, CA, 1991). Target gene expression from the pTrc vector relies on host RNA 

20 polymerase transcription from a hybrid trp-lac fusion promoter. Target gene expression 
from the pET 1 Id vector relies on transcription from a T7 gnlO-lac fusion promoter 
mediated by a Co-expressed viral RNA polymerase (T7 gnl). This viral polymerase is 
supplied by host strains BL21(DE3) or HMS174(DE3) from a resident prophage 
harboring a T7 gnl gene under the transcriptional control of the lacUV 5 promoter. 

25 One strategy to maximize recombinant protein expression in E. coli is to express 

the protein in a host bacteria with an impaired capacity to proteolytically cleave the 
recombinant protein (Gottesman, p. 1 19-128, In Gene Expression Technology: Methods 
in Enzymology vol. 185, Academic Press, San Diego, CA, 1990. Another strategy is to 
alter the nucleic acid sequence of the nucleic acid to be inserted into an expression 

30 vector so that the individual codons for each amino acid are those preferentially utilized 
in E. coli (Wada et qL, 1992, Nucleic Acids Res. 20:2111-2118). Such alteration of 
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nucleic acid sequences of the invention can be carried out by standard DNA synthesis 
techniques. 

In another embodiment, the expression vector is a yeast expression vector. 
.Examples of vectors for expression in yeast S. cerevisiae include pYepSecl (Baldari et 
5 al, 1987, EMBOJ. 6:229-234), pMFa (Kurjan and Herskowitz, 1982, Cell 30:933- 
943), pJRY88 (Schultz et al, 1987, Gene 54:1 13-123), pYES2 (Invitrogen Corporation, 
San Diego, CA), and pPicZ (Invitrogen Corp, San Diego, CA). 

Alternatively, the expression vector is a baculovirus expression vector. 
Baculovirus vectors available for expression of proteins in cultured insect cells {e.g., Sf 
1 0 9 cells) include the pAc series (Smith et al. , 1 983, Mol Cell Biol 3 :2 1 56-2 1 65) and the 
pVL series (Lucklowand Summers, 1989, Virology 170:31-39). 

In yet another embodiment, a nucleic acid of the invention is expressed in 
mammalian cells using a mammalian expression vector. Examples of mammalian 
expression vectors include pCDM8 (Seed, 1987, Nature 329:840) and pMT2PC 
1 5 (Kaufman et al, 1 987, EMBO J. 6:1 87-1 95). When used in mammalian cells, the 
expression vectors control functions are often provided by viral regulatory elements. 
For example, commonly used promoters are derived from polyoma, Adenovirus 2, 
cytomegalovirus and Simian Virus 40. For other suitable expression systems for both 
prokaryotic and eukaryotic cells see chapters 1 6 and 1 7 of Sambrook et al, supra. 
20 In another embodiment, the recombinant mammalian expression vector is 

capable of directing expression of the nucleic acid preferentially in a particular cell type 
(e.g., tissue-specific regulatory elements are used to express the nucleic acid). Tissue- 
specific regulatory elements are known in the art. Non-limiting examples of suitable 
tissue-specific promoters include the albumin promoter (liver-specific; Pinkert et al, 
1987, Genes Dev. 1 :268-277), lymphoid-specific promoters (Calame and Eaton, 1988, 
Adv. Immunol 43:235-275), in particular promoters of T cell receptors (Winoto and 
Baltimore, 1989, EMBO J. 8:729-733) and immunoglobulins (Banerji etal, 1983, Cell 
33:729-740; Queen and Baltimore, 1983, Cell 33:741-748), neuron-specific promoters 
(e.g., the neurofilament promoter; Byrne and Ruddle, 1989, Proc. Natl Acad. Sci. USA 
86:5473-5477), pancreas-specific promoters (Edlund et al, 1985, Science 230:912-916), 
and mammary gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 
4,873,316 and European Application Publication No. 264,166). Developmentally- 
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regulated promoters are also encompassed, for example the murine hox promoters 
(Kessel and Gruss, 1990, Science 249:374-379) and the cc-fetoprotein promoter (Camper 
-and Tilghman, 1989, Genes Dev. 3:537-546). 

The invention further provides a recombinant expression vector comprising a 
5 DNA molecule of the invention cloned into the expression vector in an antisense 

orientation. That is, the DNA molecule is operably linked to a regulatory sequence in a 
manner which allows for expression (by transcription of the DNA molecule) of an RNA 
molecule which is antisense to the mRNA encoding a polypeptide of the invention. 
Regulatory sequences operably linked to a nucleic acid cloned in the antisense 

1 0 orientation can be chosen which direct the continuous expression of the antisense RNA 
molecule in a variety of cell types, for instance viral promoters and/or enhancers, or 
regulatory sequences can be chosen which direct constitutive, tissue-specific or cell type 
specific expression of antisense RNA. The antisense expression vector can be in the 
form of a recombinant plasmid, phagemid, or attenuated virus in which antisense nucleic 

1 5 acids are produced under the control of a high efficiency regulatory region, the activity 
of which can be determined by the cell type into which the vector is introduced. For a 
discussion of the regulation of gene expression using antisense genes see Weintraub et 
al, 1986, Trends in Genetics, Vol. 1(1). 

Another aspect of the invention pertains to host cells into which a recombinant 

20 expression vector of the invention has been introduced. The terms "host cell M and 
"recombinant host cell" are used interchangeably herein. It is understood that such 
terms refer not only to the particular subject cell but to the progeny or potential progeny 
of such a cell. Because certain modifications may occur in succeeding generations due 
to either mutation or environmental influences, such progeny may not, in fact, be 

25 identical to the parent cell, but are still included within the scope of the term as used 
herein. 

A host cell can be any prokaryotic (e.g., E. coli) or eukaryotic cell (e.g., insect 
cells, yeast or mammalian cells). 

Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
30 conventional transformation or transfection techniques. As used herein, the terms 
"transformation" and "transfection" are intended to refer to a variety of art-recognized 
techniques for introducing foreign nucleic acid into a host cell, including calcium 
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phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, or electroporation. Suitable methods for transforming or transfecting host 
cells can be found in Sambrook, et al (supra), and other laboratory manuals. 

For stable transfection of mammalian cells, it is known that, depending upon the 
5 expression vector and transfection technique used, only a small fraction of cells may 
integrate the foreign DNA into their genome. In order to identify and select these 
integrants, a gene that encodes a selectable marker (e.g., for resistance to antibiotics) is 
generally introduced into the host cells along with the gene of interest. Preferred 
selectable markers include those which confer resistance to drugs, such as G418, 

10 hygromycin and methotrexate. Cells stably transfected with the introduced nucleic acid 
can be identified by drug selection (e.g., cells that have incorporated the selectable 
marker gene will survive, while the other cells die). 

A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce a polypeptide corresponding to a marker of the 

1 5 invention. Accordingly, the invention further provides methods for producing a 
polypeptide corresponding to a marker of the invention using the host cells of the 
invention. In one embodiment, the method comprises culturing the host cell of 
invention (into which a recombinant expression vector encoding a polypeptide of the 
invention has been introduced) in a suitable medium such that the marker is produced. 

20 In another embodiment, the method further comprises isolating the marker polypeptide 
from the medium or the host cell. 

The host cells of the invention can also be used to produce nonhuman transgenic 
animals. For example, in one embodiment, a host cell of the invention is a fertilized 
oocyte or an embryonic stem cell into which a sequences encoding a polypeptide 

25 corresponding to a marker of the invention have been introduced. Such host cells can 
then be used to create non-human transgenic animals in which exogenous sequences 
encoding a marker protein of the invention have been introduced into their genome or 
homologous recombinant animals in which endogenous gene(s) encoding a polypeptide 
corresponding to a marker of the invention sequences have been altered. Such animals 

30 are useful for studying the function and/or activity of the polypeptide corresponding to 
the marker and for identifying and/or evaluating modulators of polypeptide activity. As 
used herein, a "transgenic animal" is a non-human animal, preferably a mammal, more 
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preferably a rodent such as a rat or mouse, in which one or more of the cells of the 
animal includes a transgene. Other examples of transgenic animals include non-human 
primates, sheep, dogs, cows, goats, chickens, amphibians, etc. A transgene is exogenous 
DNA which is integrated into the genome of a cell from which a transgenic animal 
5 ■ develops and which remains in the genome of the mature animal, thereby directing the 
expression of an encoded gene product in one or more cell types or tissues of the 
transgenic animal. As used herein, an "homologous recombinant animaT is a non- 
human animal, preferably a mammal, more preferably a mouse, in which an endogenous 
gene has been altered by homologous recombination between the endogenous gene and 

10 an exogenous DNA molecule introduced into a cell of the animal, e.g., an embryonic 
cell of the animal, prior to development of the animal. 

A transgenic animal of the invention can be created by introducing a nucleic acid 
encoding a polypeptide corresponding to a marker of the invention into the male 
pronuclei of a fertilized oocyte, e.g., by microinjection, retroviral infection, and allowing 

15 the oocyte to develop in a pseudopregnant female foster animal. Intronic sequences and 
polyadenylation signals can also be included in the transgene to increase the efficiency 
of expression of the transgene. A tissue-specific regulatory sequence(s) can be operably 
linked to the transgene to direct expression of the polypeptide of the invention to 
particular cells. Methods for generating transgenic animals via embryo manipulation 

20 and microinjection, particularly animals such as mice, have become conventional in the 
art and are described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009, U.S. 
Patent No. 4,873,1 9 1 and in Hogan, Manipulating the Mouse Embryo, Cold Spring 
Harbor Laboratory Press, Cold Spring Harbor, N.Y., 1986. Similar methods are used for 
production of other transgenic animals. A transgenic founder animal can be identified 

25 based upon the presence of the transgene in its genome and/or expression of mRNA 
encoding the transgene in tissues or cells of the animals. A transgenic founder animal 
can then be used to breed additional animals carrying the transgene. Moreover, 
transgenic animals carrying the transgene can further be bred to other transgenic animals 
carrying other transgenes. 

30 To create an homologous recombinant animal, a vector is prepared which 

contains at least a portion of a gene encoding a polypeptide corresponding to a marker of 
the invention into which a deletion, addition or substitution has been introduced to 
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thereby alter, e.g., functionally disrupt, the gene. In a preferred embodiment, the vector 
is designed such that, upon homologous recombination, the endogenous gene is 
functionally disrupted (i.e., no longer encodes a functional protein; also referred to as a 
« "knock out 1 ' vector). Alternatively, the vector can be designed such that, upon 
5 homologous recombination, the endogenous gene is mutated or otherwise altered but 
still encodes functional protein (e.g., the upstream regulatory region can be altered to 
thereby alter the expression of the endogenous protein). In the homologous 
recombination vector, the altered portion of the gene is flanked at its 5 1 and 3 f ends by 
additional nucleic acid of the gene to allow for homologous recombination to occur 

10 between the exogenous gene carried by the vector and an endogenous gene in an 

embryonic stem cell. The additional flanking nucleic acid sequences are of sufficient 
length for successful homologous recombination with the endogenous gene. Typically, 
several kilobases of flanking DNA (both at the 5' and 3 1 ends) are included in the vector 
(see, e.g., Thomas and Capecchi, 1987, Cell 51:503 for a description of homologous 

15 recombination vectors). The vector is introduced into an embryonic stem cell line (e.g., 
by electroporation) and cells in which the introduced gene has homologously 
recombined with the endogenous gene are selected (see, e.g., Li et al, 1992, Cell 
69:915). The selected cells are then injected into a blastocyst of an animal (e.g., a 
mouse) to form aggregation chimeras (see, e.g., Bradley, Teratocarcinomas and 

20 Embryonic Stem Cells: A Practical Approach, Robertson, Ed., IRL, Oxford, 1987, pp. 
1 13-152). A chimeric embryo can then be implanted into a suitable pseudopregnant 
female foster animal and the embryo brought to term. Progeny harboring the 
homologously recombined DNA in their germ cells can be used to breed animals in 
which all cells of the animal contain the homologously recombined DNA by germline 

25 transmission of the transgene. Methods for constructing homologous recombination 
vectors and homologous recombinant animals are described further in Bradley (1991) 
Current Opinion in Bio/Technology 2:823-829 and in PCT Publication NOS. WO 
90/1 1354, WO 91/01 140, WO 92/0968, and WO 93/04169. 

In another embodiment, transgenic non-human animals can be produced which 

30 contain selected systems which allow for regulated expression of the transgene. One 
example of such a system is the cre/loxP recombinase system of bacteriophage PI. For 
a description of the cre/loxP recombinase system, see, e.g., Lakso et al (1992) Proc. 
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Natl Acad. Scl USA 89:6232-6236. Another example of a recombinase system is the 
FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al t 1991, Science 
251:135 1-1355). If a cre/loxP recombinase system is used to regulate expression of the 
•transgene, animals containing transgenes encoding both the Cre recombinase and a 
5 selected protein are required. Such animals can be provided through the construction of 
"double" transgenic animals, e.g., by mating two transgenic animals, one containing a 
transgene encoding a selected protein and the other containing a' transgene encoding a 
recombinase. 

Clones of the non-human transgenic animals described herein can also be 
10 produced according to the methods described in Wilmut et al (1997) Nature 385:810- 
813 and PCT Publication NOS. WO 97/07668 and WO 97/07669. 

IV. Pharmaceutical Compositions 

The nucleic acid molecules, polypeptides, and antibodies (also referred to herein 

15 as "active compounds") corresponding to a marker of the invention can be incorporated 
into pharmaceutical compositions suitable for administration. Such compositions 
typically comprise the nucleic acid molecule, protein, or antibody and a 
pharmaceutically acceptable carrier. As used herein the language "pharmaceutically 
acceptable carrier" is intended to include any and all solvents, dispersion media, 

20 coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, 
and the like, compatible with pharmaceutical administration. The use of such media and 
agents for pharmaceutically active substances is well known in the art. Except insofar as 
any conventional media or agent is incompatible with the active compound, use thereof 
in the compositions is contemplated. Supplementary active compounds can also be 

25 incorporated into the compositions. 

The invention includes methods for preparing pharmaceutical compositions for 
modulating the expression or activity of a polypeptide or nucleic acid corresponding to a 
marker of the invention. Such methods comprise formulating a pharmaceutically 
acceptable carrier with an agent which modulates expression or activity of a polypeptide 

30 or nucleic acid corresponding to a marker of the invention. Such compositions can 
further include additional active agents. Thus, the invention further includes methods 
for preparing a pharmaceutical composition by formulating a pharmaceutically 
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acceptable carrier with an agent which modulates expression or activity of a polypeptide 
or nucleic acid corresponding to a marker of the invention and one or more additional 
active compounds. 

The invention also provides methods (also referred to herein as 
5 "screening assays") for identifying modulators, i.e., candidate or test compounds or 
agents (e.g., peptides, peptidomimetics, peptoids, small molecules or other drugs) which 
(a) bind to the marker, or (b) have a modulatory (e.g., stimulatory or inhibitory) effect 
on the activity of the marker or, more specifically, (c) have a modulatory effect on the 
interactions of the marker with one or more of its natural substrates (e.g., peptide, 
10 protein, hormone, co-factor, or nucleic acid), or (d) have a modulatory effect on the 

expression of the marker. Such assays typically comprise a reaction between the marker 
and one or more assay components. The other components may be either the test 
compound itself, or a combination of test compound and a natural binding partner of the 
marker. 

15 The test compounds of the present invention may be obtained from any available 

source, including systematic libraries of natural and/or synthetic compounds. Test 
compounds may also be obtained by any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; peptoid libraries 
(libraries of molecules having the functionalities of peptides, but with a novel, non- 
20 peptide backbone which are resistant to enzymatic degradation but which nevertheless 
remain bioactive; see, e.g., Zuckermann et al., 1994, J. Med. Chem. 37:2678-85); 
spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconvolution; the 'one-bead one-compound 1 library method; and 
synthetic library methods using affinity chromatography selection. The biological 
25 library and peptoid library approaches are limited to peptide libraries, while the other 
four approaches are applicable to peptide, non-peptide oligomer or small molecule 
libraries of compounds (Lam, 1997, Anticancer Drug Des. 12: 145). 

Examples of methods for the synthesis of molecular libraries can be found in the 
art, for example in: DeWitt et al (1993) Proc. Natl. Acad. Sci. U.S.A. 90:6909; Erb et 
30 al. (1994) Proc. Natl. Acad. Sci. USA 91:1 1422; Zuckermann et al. (1994). J. Med. 
Chem. 37:2678; Cho etal (1993) Science 261:1303; Carrell etal. (1994) Angew. Chem. 
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Int. Ed. Engl 33:2059; Carell etal. (1994) Angew. Chem. Int. Ed Engl 33:2061; and in 
Gallop etal (1994) J. Med Chem. 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten, 1992, 
Biotechniques 13:412-421), or on beads (Lam, 1991, Nature 354:82-84), chips (Fodor, 
5 1 993, Nature 364:555-556), bacteria and/or spores, (Ladner, USP 5,223,409), plasmids 
(Cull etal, 1992, Proc Natl Acad Sci USA 89:1865-1869) or on phage (Scott and Smith, 
1990, Science 249:386-390; Devlin, 1990, Science 249:404-406; Cwirla et al, 1990, 
Proc. Natl Acad Sci. 87:6378-6382; Felici, 1991, J. Mol Biol 222:301-310; Ladner, 
supra.). 

10 In one embodiment, the invention provides assays for screening candidate or test 

compounds which are substrates of a marker or biologically active portion thereof. In 
another embodiment, the invention provides assays for screening candidate or test 
compounds which bind to a marker or biologically active portion thereof. Determining 
the ability of the test compound to directly bind to a marker can be accomplished, for 

1 5 example, by coupling the compound with a radioisotope or enzymatic label such that 
binding of the compound to the marker can be determined by detecting the labeled 
marker compound in a complex. For example, compounds (e.g., marker substrates) can 
be labeled with 125 1, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope 
detected by direct counting of radioemission or by scintillation counting. Alternatively, 

20 assay components can be enzymatically labeled with, for example, horseradish 
peroxidase, alkaline phosphatase, or luciferase, and the enzymatic label detected by 
determination of conversion of an appropriate substrate to product. 

In another embodiment, the invention provides assays for screening candidate or 
test compounds which modulate the activity of a marker or a biologically active portion 

25 thereof. In all likelihood, the marker can, in vivo, interact with one or more molecules, 
such as but not limited to, peptides, proteins, hormones, cofactors and nucleic acids. For 
the purposes of this discussion, such cellular and extracellular molecules are referred to 
herein as "binding partners" or marker "substrate". 

One necessary embodiment of the invention in order to facilitate such screening 

30 is the use of the marker to identify its natural in vivo binding partners. There are many 
ways to accomplish this which are known to one skilled in the art. One example is the 
use of the marker protein as "bait protein" in a two-hybrid assay or three-hybrid assay 
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(see, eg., U.S. Patent No. 5,283,317; Zervos etal, 1993, Cell 72:223-232; Madura etal, 
1993, J. Biol Chem. 268:12046-12054; Bartel et al ,1993, Biotechniques 14:920-924; 
Iwabuchi et al, 1993 Oncogene 8:1693-1696; Brent WO94/10300) in order to identify 
other proteins which bind to or interact with the marker (binding partners) and, 
5 therefore, are possibly involved in the natural function of the marker. Such marker 
binding partners are also likely to be involved in the propagation of signals by the 
marker or downstream elements of a marker-mediated signaling pathway. Alternatively, 
such marker binding partners may also be found to be inhibitors of the marker. 

The two-hybrid system is based on the modular nature of most transcription 

10 factors, which consist of separable DNA-binding and activation domains. Briefly, the 
assay utilizes two different DNA constructs. In one construct, the gene that encodes a 
marker protein fused to a gene encoding the DNA binding domain of a known 
transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, from a 
library of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 

1 5 fused to a gene that codes for the activation domain of the known transcription factor. If 
the "bait" and the "prey" proteins are able to interact, in vivo, forming a marker- 
• dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to 

20 the transcription factor. Expression of the reporter gene can be readily detected and cell 
colonies containing the functional transcription factor can be isolated and used to obtain 
the cloned gene which encodes the protein which interacts with the marker protein. 

In a further embodiment, assays may be devised through the use of the invention 
for the purpose of identifying compounds which modulate (e.g., affect either positively 

25 or negatively) interactions between a marker and its substrates and/or binding partners. 
Such compounds can include, but are not limited to, molecules such as antibodies, 
peptides, hormones, oligonucleotides, nucleic acids, and analogs thereof. Such 
compounds may also be obtained from any available source, including systematic 
libraries of natural and/or synthetic compounds. The preferred assay components for use 

30 in this embodiment is an ovarian cancer marker identified herein, the known binding 
partner and/or substrate of same, and the test compound. Test compounds can be 
supplied from any source. 



WO 01/070979 



PCTAJS01/09126 



-66- 

The basic principle of the assay systems used to identify compounds that 
interfere with the interaction between the marker and its binding partner involves 
preparing a reaction mixture containing the marker and its binding partner under 
conditions and for a time sufficient to allow the two products to interact and bind, thus 
5 forming a complex. In order to test an agent for inhibitory activity, the reaction mixture 
is prepared in the presence and absence of the test compound. The test compound can 
be initially included in the reaction mixture, or can be added at a time subsequent to the 
addition of the marker and its binding partner. Control reaction mixtures are incubated 
without the test compound or with a placebo. The formation of any complexes between 

10 the marker and its binding partner is then detected. The formation of a complex in the 
control reaction, but less or no such formation in the reaction mixture containing the test 
compound, indicates that the compound interferes with the interaction of the marker and 
its binding partner. Conversely, the formation of more complex in the presence of 
compound than in the control reaction indicates that the compound may enhance 

1 5 interaction of the marker and its binding partner. 

The assay for compounds that interfere with the interaction of the marker with its 
binding partner may be conducted in a heterogeneous or homogeneous format. 
Heterogeneous assays involve anchoring either the marker or its binding partner onto a 
solid phase and detecting complexes anchored to the solid phase at the end of the 

20 reaction. In homogeneous assays, the entire reaction is carried out in a liquid phase. In 
either approach, the order of addition of reactants can be varied to obtain different 
information about the compounds being tested. For example, test compounds that 
interfere with the interaction between the markers and the binding partners (e.g., by 
competition) can be identified by conducting the reaction in the presence of the test 

25 substance, i.e., by adding the test substance to the reaction mixture prior to or 

simultaneously with the marker and its interactive binding partner. Alternatively, test 
compounds that disrupt preformed complexes, e.g., compounds with higher binding 
constants that displace one of the components from the complex, can be tested by adding 
the test compound to the reaction mixture after complexes have been formed. The 

30 various formats are briefly described below. 
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In a heterogeneous assay system, either the marker or its binding partner is 
anchored onto a solid surface or matrix, while the other corresponding non-anchored 
component may be labeled, either directly or indirectly. In practice, microtitre plates are 
often utilized for this approach. The anchored species can be immobilized by a number 
5 of methods, either non-covalent or covalent, that are typically well known to one who 
practices the art. Non-covalent attachment can often be accomplished simply by coating 
the solid surface with a solution of the marker or its binding partner and drying. 
Alternatively, an immobilized antibody specific for the assay component to be anchored 
can be used for this purpose. Such surfaces can often be prepared in advance and stored. 

10 In related embodiments, a fusion protein can be provided which adds a domain 

that allows one or both of the assay components to be anchored to a matrix. For 
example, glutathione-S-transferase/marker fusion proteins or glutathione-S- 
transferase/binding partner can be adsorbed onto glutathione sepharose beads (Sigma 
Chemical, St. Louis, MO) or glutathione derivatized microtiter plates, which are then 

1 5 combined with the test compound or the test compound and either the non-adsorbed 
marker or its binding partner, and the mixture incubated under conditions conducive to 
complex formation (e.g., physiological conditions). Following incubation, the beads or 
microtiter plate wells are washed to remove any unbound assay components, the 
immobilized complex assessed either directly or indirectly, for example, as described 

20 above. Alternatively, the complexes can be dissociated from the matrix, and the level of 
marker binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 
screening assays of the invention. For example, either a marker or a marker binding 
partner can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 

25 marker protein or target molecules can be prepared from biotin-NHS (N-hydroxy- 

succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, 
Rockford, IL), and immobilized in the wells of streptavidin-coated 96 well plates (Pierce 
Chemical). In certain embodiments, the protein-immobilized surfaces can be prepared in 
advance and stored. 

30 In order to conduct the assay, the corresponding partner of the immobilized assay 

component is exposed to the coated surface with or without the test compound. After 
the reaction is complete, unreacted assay components are removed (e.g., by washing) 
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and any complexes formed will remain immobilized on the solid surface. The detection 
of complexes anchored on the solid surface can be accomplished in a number , of ways. 
Where the non-immobilized component is pre-labeled, the detection of label 
immobilized on the surface indicates that complexes were formed. Where the non- 
5 immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface; e.g., using a labeled antibody specific for the 
initially non-immobilized species (the antibody, in turn, can be directly labeled or 
indirectly labeled with, e.g., a labeled anti-Ig antibody). Depending upon the order of 
addition of reaction components, test compounds which modulate (inhibit or enhance) 

10 complex formation or which disrupt preformed complexes can be detected. 

In an alternate embodiment of the invention, a homogeneous assay may be used. 
This is typically a reaction, analogous to those mentioned above, which is conducted in 
a liquid phase in the presence or absence of the test compound. The formed complexes 
are then separated from unreacted components, and the amount of complex formed is 

15 determined. As mentioned for heterogeneous assay systems, the order of addition of 
reactants to the liquid phase can yield information about which test compounds 
modulate (inhibit or enhance) complex formation and which disrupt preformed 
complexes. 

In such a homogeneous assay, the reaction products may be separated from 
20 unreacted assay components by any of a number of standard techniques, including but 
not limited to: differential centrifiigation, chromatography, electrophoresis and 
immunoprecipitation. In differential centrifiigation, complexes of molecules may be 
separated from uncomplexed molecules through a series of centrifugal steps, due to the 
different sedimentation equilibria of complexes based on their different sizes and 
25 densities (see, for example, Rivas, G., and Minton, A.P., Trends Biochem Sci 1993 

Aug;18(8):284-7). Standard chromatographic techniques may also be utilized to separate 
complexed molecules from uncomplexed ones. For example, gel filtration 
chromatography separates molecules based on size, and through the utilization of an 
appropriate gel filtration resin in a column format, for example, the relatively larger 
30 complex may be separated from the relatively smaller uncomplexed components. 

Similarly, the relatively different charge properties of the complex as compared to the 
uncomplexed molecules may be exploited to differentially separate the complex from 
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the remaining individual reactants, for example through the use of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, 1998, JMol Recognit. 1 1:141-148; Hage and 
Tweed, 1997, J. Chromatogr. B. Biomed Set AppL, 699:499-525). Gel electrophoresis 
5 may also be employed to separate complexed molecules from unbound species (see, 
e.g., Ausubel et al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, 
New York. 1999). In this technique, protein or nucleic acid complexes are separated 
based on size or charge, for example. In order to maintain the binding interaction during 
the electrophoretic process, nondenaturing gels in the absence of reducing agent are 

10 typically preferred, but conditions appropriate to the particular interactants will be well 
known to one skilled in the art. Immunoprecipitation is another common technique 
utilized for the isolation of a protein-protein complex from solution (see, e.g., Ausubel et 
al (eds.), In: Current Protocols in Molecular Biology, J. Wiley & Sons, New York. 
1 999). In this technique, all proteins binding to an antibody specific to one of the 

15 binding molecules are precipitated from solution by conjugating the antibody to a 
polymer bead that may be readily collected by centrifugation. The bound assay 
components are released from the beads (through a specific proteolycis event or other 
technique well known in the art which will not disturb the protein-protein interaction in 
the complex), and a second immunoprecipitation step is performed, this time utilizing 

20 antibodies specific for the correspondingly different interacting assay component. In this 
manner, only formed complexes should remain attached to the beads. Variations in 
complex formation in both the presence and the absence of a test compound can be 
compared, thus offering information about the ability of the compound to modulate 
interactions between the marker and its binding partner. 

25 Also within the scope of the present invention are methods for direct detection of 

interactions between the marker and its natural binding partner and/or a test compound 
in a homogeneous or heterogeneous assay system without further sample manipulation. 
For example, the technique of fluorescence energy transfer may be utilized (see, e.g., 
Lakowicz et al, U.S. Patent No. 5,63 1 , 1 69; Stavrianopoulos et al, U.S. Patent No. 

30 4,868,103). Generally, this technique involves the addition of a fluorophore label on a 
first 'donor 5 molecule (e.g., marker or test compound) such that its emitted fluorescent 
energy will be absorbed by a fluorescent label on a second, 'acceptor 5 molecule (e.g., 
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marker or test compound), which in turn is able to fluoresce due to the absorbed energy. 
Alternately, the 'donor' protein molecule may simply utilize the natural fluorescent 
energy of tryptophan residues. Labels are chosen that emit different wavelengths of 
light, such that the 'acceptor' molecule label may be differentiated from that of the 
5 'donor'. Since the efficiency of energy transfer between the labels is related to the 
distance separating the molecules, spatial relationships between the molecules can be 
assessed. In a situation in which binding occurs between the molecules, the fluorescent 
emission of the 'acceptor' molecule label in the assay should be maximal. An FET 
binding event can be conveniently measured through standard fluorometric detection 

10 means well known in the art (e.g., using a fluorimeter). A test substance which either 
enhances or hinders participation of one of the species in the preformed complex will 
result in the generation of a signal variant to that of background. In this way, test 
substances that modulate interactions between a marker and its binding partner can be 
identified in controlled assays. 

15 In another embodiment, modulators of marker expression are identified in a 

method wherein a cell is contacted with a candidate compound and the expression of 
mRNA or protein, corresponding to a marker in the cell, is determined. The level of 
expression of mRNA or protein in the presence of the candidate compound is compared 
to the level of expression of mRNA or protein in the absence of the candidate 

20 compound. The candidate compound can then be identified as a modulator of marker 
expression based on this comparison. For example, when expression of marker mRNA 
or protein is greater (statistically significantly greater) in the presence of the candidate 
compound than in its absence, the candidate compound is identified as a stimulator of 
marker mRNA or protein expression. Conversely, when expression of marker mRNA 

25 or protein is less (statistically significantly less) in the presence of the candidate 

compound than in its absence, the candidate compound is identified as an inhibitor of 
marker mRNA or protein expression. The level of marker mRNA or protein expression 
in the cells can be determined by methods described herein for detecting marker mRNA 
or protein. 

30 In another aspect, the invention pertains to a combination of two or more of the 

assays described herein. For example, a modulating agent can be identified using a cell- 
based or a cell free assay, and the ability of the agent to modulate the activity of a 



WO 01/070979 



PCT/US01/09126 



-71- 

marker protein can be further confirmed in vivo, e.g., in a whole animal model for 
cellular transformation and/or tumorigenesis. 

This invention further pertains to novel agents identified by the above-described 
screening assays. Accordingly, it is within the scope of this invention to further use an 
5 agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein (e.g., an marker modulating agent, an antisense 
marker nucleic acid molecule, an marker-specific antibody, or an marker-binding 
partner) can be used in an animal model to determine the efficacy, toxicity, or side 
effects of treatment with such an agent. Alternatively, an agent identified as described 

1 0 herein can be used in an animal model to determine the mechanism of action of such an 
agent. Furthermore, this invention pertains to uses of novel agents identified by the 
above-described screening assays for treatments as described herein. 

It is understood that appropriate doses of small molecule agents and protein or 
polypeptide agents depends upon a number of factors within the knowledge of the 

15 ordinarily skilled physician, veterinarian, or researcher. The dose(s) of these agents will 
vary, for example, depending upon the identity, size, and condition of the subject or 
sample being treated, further depending upon the route by which the composition is to 
be administered, if applicable, and the effect which the practitioner desires the agent to 
have upon the nucleic acid or polypeptide of the invention. Exemplary doses of a small 

20 molecule include milligram or microgram amounts per kilogram of subject or sample 
weight (e.g. about 1 microgram per kilogram to about 500 milligrams per kilogram, 
about 100 micrograms per kilogram to about 5 milligrams per kilogram, or about 1 
microgram per kilogram to about 50 micrograms per kilogram). Exemplary doses of a 
protein or polypeptide include gram, milligram or microgram amounts per kilogram of 

25 subject or sample weight (e.g. about 1 microgram per kilogram to about 5 grams per 
kilogram, about 100 micrograms per kilogram to about 500 milligrams per kilogram, or 
about 1 milligram per kilogram to about 50 milligrams per kilogram). It is furthermore 
understood that appropriate doses of one of these agents depend upon the potency of the 
agent with respect to the expression or activity to be modulated. Such appropriate doses 

30 can be determined using the assays described herein. When one or more of these agents 
is to be administered to an animal (e.g. a human) in order to modulate expression or 
activity of a polypeptide or nucleic acid of the invention, a physician, veterinarian, or 
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researcher can, for example, prescribe a relatively low dose at first, subsequently 
increasing the dose until an appropriate response is obtained. In addition, it is 
understood that the specific dose level for any particular animal subject will depend 
upon a variety of factors including the activity of the specific agent employed, the age, 
5 body weight, general health, gender, and diet of the subject, the time of administration, 
the route of administration, the rate of excretion, any drug combination, and the degree 
of expression or activity to be modulated. 

A pharmaceutical composition of the invention is formulated to be compatible 
with its intended route of administration. Examples of routes of administration include 

10 parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 

transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 

1 5 antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediamine-tetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 

20 enclosed in ampules, disposable syringes or multiple dose vials made of glass or plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous 
solutions (where water soluble) or dispersions and sterile powders for the 
extemporaneous preparation of sterile injectable solutions or dispersions. For 
intravenous administration, suitable carriers include physiological saline, bacteriostatic 

25 water, Cremophor EL (BASF; Parsippany, NJ) or phosphate buffered saline (PBS). In 
all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It must be stable under the conditions of manufacture and storage 
and must be preserved against the contaminating action of microorganisms such as 
bacteria and fungi. The carrier can be a solvent or dispersion medium containing, for 

30 example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid 

polyethylene glycol, and the like), and suitable mixtures thereof. The proper fluidity can 
be maintained, for example, by the use of a coating such as lecithin, by the maintenance 
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of the required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 
5 for example, sugars, polyalcohols such as mannitol, sorbitol, or sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about 
by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

Sterile injectable solutions can be prepared by incorporating the active 

10 compound (e.g., a polypeptide or antibody) in the required amount in an appropriate 
solvent with one or a combination of ingredients enumerated above, as required, 
followed by filtered sterilization. Generally, dispersions are prepared by incorporating 
the active compound into a sterile vehicle which contains a basic dispersion medium, 
and then incorporating the required other ingredients from those enumerated above. In 

15 the case of sterile powders for the preparation of sterile injectable solutions, the 

preferred methods of preparation are vacuum drying and freeze-drying which yields a 
powder of the active ingredient plus any additional desired ingredient from a previously 
sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They 

20 can be enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 

25 Pharmaceutical^ compatible binding agents, and/or adjuvant materials can be 

included as part of the composition. The tablets, pills, capsules, troches, and the like can 
contain any of the following ingredients, or compounds of a similar nature: a binder 
such as microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch 
or lactose, a disintegrating agent such as alginic acid, Primogel, or com starch; a 

30 lubricant such as magnesium stearate or Sterotes; a glidant such as colloidal silicon 
dioxide; a sweetening agent such as sucrose or saccharin; or a flavoring agent such as 
peppermint, methyl salicylate, or orange flavoring. 
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For administration by inhalation, the compounds are delivered in the form of an 
aerosol spray from a pressurized container or dispenser which contains a suitable 
propellant, e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
5 transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
10 compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

1 5 In one embodiment, the active compounds are prepared with carriers that will 

protect the compound against rapid elimination from the body, such as a controlled 
release formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 

20 Methods for preparation of such formulations will be apparent to those skilled in the art. 
The materials can also be obtained commercially from Alza Corporation and Nova 
Pharmaceuticals, Inc. Liposomal suspensions (including liposomes having monoclonal 
antibodies incorporated therein or thereon) can also be used as pharmaceutically 
acceptable carriers. These can be prepared according to methods known to those skilled 

25 in the art, for example, as described in U.S. Patent No. 4,522,81 1 . 

It is especially advantageous to formulate oral or parenteral compositions in 
dosage unit form for ease of administration and uniformity of dosage. Dosage unit form 
as used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 

30 calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. The specification for the dosage unit forms of the invention are 
dictated by and directly dependent on the unique characteristics of the active compound 
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and the particular therapeutic effect to be achieved, and* the limitations inherent in the art 
of compounding such an active compound for the treatment of individuals. 

For antibodies, the preferred dosage is 0.1 mg/kg to 100 mg/kg of body weight 
• (generally 10 mg/kg to 20 mg/kg). If the antibody is to act in the brain, a dosage of 50 
5 mg/kg to 100 mg/kg is usually appropriate. Generally, partially human antibodies and 
fully human antibodies have a longer half-life within the human body than other 
antibodies. Accordingly, lower dosages and less frequent administration is often 
possible. Modifications such as lipidation can be used to stabilize antibodies and to 
enhance uptake and tissue penetration (e.g., into the ovarian epithelium). A method for 
10 lipidation of antibodies is described by Cruikshank et al (1997) J. Acquired Immune 
Deficiency Syndromes and Human Retrovirology 14:193. 

The nucleic acid molecules corresponding to a marker of the invention can be 
inserted into vectors and used as gene therapy vectors. Gene therapy vectors can be 
delivered to a subject by, for example, intravenous injection, local administration (U.S. 
15 Patent 5,328,470), or by stereotactic injection (see, e.g., Chen et al., 1994, Proc. Natl 
Acad. Sci. USA 91:3054-3057). The pharmaceutical preparation of the gene therapy 
vector can include the gene therapy vector in an acceptable diluent, or can comprise a 
slow release matrix in which the gene delivery vehicle is imbedded. Alternatively, 
where the complete gene delivery vector can be produced intact from recombinant cells, 
20 e.g. retroviral vectors, the pharmaceutical preparation can include one or more cells 
which produce the gene delivery system. 

The pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

25 V, Predictive Medicine 

The present invention pertains to the field of predicti ve medicine in which 
diagnostic assays, prognostic assays, pharmacogenomics, and monitoring clinical trails 
are used for prognostic (predictive) purposes to thereby treat an individual 
prophylactically. Accordingly, one aspect of the present invention relates to diagnostic 

30 assays for determining the level of expression of polypeptides or nucleic acids 

corresponding to one or more markers of the invention, in order to determine whether 
an individual is at risk of developing ovarian cancer. Such assays can be used for 
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prognostic or predictive purposes to thereby prophylactically treat an individual prior to 
the onset of the cancer. 

Yet another aspect of the invention pertains to monitoring the influence of agents 
(e.g., drugs or other compounds administered either to inhibit ovarian cancer or to treat 
5 or prevent any other disorder {i.e. in order to understand any ovarian carcinogenic 
effects that such treatment may have} ) on the expression or activity of a marker of the 
invention in clinical trials. These and other agents are described in further detail in the 
following sections. 

10 A. Diagnostic Assays 

An exemplary method for detecting the presence or absence of a polypeptide or 
nucleic acid corresponding to a marker of the invention in a biological sample involves 
obtaining a biological sample (e.g. an ovary-associated body fluid) from a test subject 
and contacting the biological sample with a compound or an agent capable of detecting 

15 the polypeptide or nucleic acid (e.g., mRNA, genomic DNA, or cDNA). The detection 
methods of the invention can thus be used to detect mRNA, protein, cDNA, or genomic 
DNA, for example, in a biological sample in vitro as well as in vivo. For example, in 
viti o techniques for detection of mRNA include Northern hybridizations and in situ 
hybridizations. In vitro techniques for detection of a polypeptide corresponding to a 

20 marker of the invention include enzyme linked immunosorbent assays (ELISAs), 

Western blots, immunoprecipitations and immunofluorescence. In vitro techniques for 
detection of genomic DNA include Southern hybridizations. Furthermore, in vivo 
techniques for detection of a polypeptide corresponding to a marker of the invention 
include introducing into a subject a labeled antibody directed against the polypeptide. 

25 For example, the antibody can be labeled with a radioactive marker whose presence and 
location in a subject can be detected by standard imaging techniques. 

A general principle of such diagnostic and prognostic assays involves preparing 
a sample or reaction mixture that may contain a marker, and a probe, under appropriate 
conditions and for a time sufficient to allow the marker and probe to interact and bind, 

30 thus forming a complex that can be removed and/or detected in the reaction mixture. 
These assays can be conducted in a variety of ways. 



WO 01/070979 



PCTYUS01/09126 



- 77 - 

For example, one method to conduct such an assay would involve anchoring the 
marker or probe onto a solid phase support, also referred to as a substrate, and detecting 
target marker/probe complexes anchored on the solid phase at the end of the reaction. 
In one embodiment of such a method, a sample from a subject, which is to be assayed 
5 for presence and/or concentration of marker, can be anchored onto a carrier or solid 
phase support. In another embodiment, the reverse situation is possible, in which the 
probe can be anchored to a solid phase and a sample from a subject can be allowed to 
react as an unanchored component of the assay. 

There are many established methods for anchoring assay components to a solid 
10 phase. These include, without limitation, marker or probe molecules which are 

immobilized through conjugation of biotin and streptavidin. Such biotinylated assay 
components can be prepared from biotin-NHS (N-hydroxy-succinimide) using 
techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In 
15 certain embodiments, the surfaces with immobilized assay components can be prepared 
in advance and stored. 

Other suitable carriers or solid phase supports for such assays include any 
material capable of binding the class of molecule to which the marker or probe belongs. 
Well-known supports or carriers include, but are not limited to, glass, polystyrene, 
20 nylon, polypropylene, nylon, polyethylene, dextran, amylases, natural and modified 
celluloses, polyacrylamides, gabbros, and magnetite. 

In order to conduct assays with the above mentioned approaches, the non- 
immobilized component is added to the solid phase upon which the second component 
is anchored. After the reaction is complete, uncomplexed components may be removed 
25 (e.g., by washing) under conditions such that any complexes formed will remain 

immobilized upon the solid phase. The detection of marker/probe complexes anchored 
to the solid phase can be accomplished in a number of methods outlined herein. 

In a preferred embodiment, the probe, when it is the unanchored assay 
component, can be labeled for the purpose of detection and readout of the assay, either 
30 directly or indirectly, with detectable labels discussed herein and which are well-known 
to one skilled in the art. 
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It is also possible to directly detect marker/probe complex formation without 
further manipulation or labeling of either component (marker or probe), for example by 
utilizing the technique of fluorescence energy transfer (see, for example, Lakowicz et 
al 9 U.S. Patent No. 5,631,169; Stavrianopoulos, et al. 9 U.S. Patent No. 4,868,103). A 
5 fluorophore label on the first, 'donor' molecule is selected such that, upon excitation 
with incident light of appropriate wavelength, its emitted fluorescent energy will be 
absorbed by a fluorescent label on a second 'acceptor' molecule, which in turn is able to 
fluoresce due to the absorbed energy. Alternately, the 'donor' protein molecule may 
simply utilize the natural fluorescent energy of tryptophan residues. Labels are chosen 

10 that emit different wavelengths of light, such that the 'acceptor' molecule label may be 
differentiated from that of the 'donor'. Since the efficiency of energy transfer between 
the labels is related to the distance separating the molecules, spatial relationships 
between the molecules can be assessed. In a situation in which binding occurs between 
the molecules, the fluorescent emission of the 'acceptor' molecule label in the assay 

1 5 should be maximal. An FET binding event can be conveniently measured through 
standard fluorometric detection means well known in the art (e.g., using a fluorimeter). 

In another embodiment, determination of the ability of a probe to recognize a 
marker can be accomplished without labeling either assay component (probe or marker) 
by utilizing a technology such as real-time Biomolecular Interaction Analysis (BIA) 

20 (see, e.g., Sjolander, S. and Urbaniczky, C, 1991, Anal Chem. 63:2338-2345 and 
Szabo et al, 1995, Curr. Opin. Struct. Biol. 5:699-705). As used herein, "BIA" or 
"surface plasmon resonance" is a technology for studying biospecific interactions in real 
time, without labeling any of the interactants (e.g., BIAcore). Changes in the mass at the 
binding surface (indicative of a binding event) result in alterations of the refractive index 

25 of light near the surface (the optical phenomenon of surface plasmon resonance (SPR)), 
resulting in a detectable signal which can be used as an indication of real-time reactions 
between biological molecules. 

Alternatively, in another embodiment, analogous diagnostic and prognostic 
assays can be conducted with marker and probe as solutes in a liquid phase. In such an 

30 assay, the complexed marker and probe are separated from uncomplexed components by 
any of a number of standard techniques, including but not limited to: differential, 
centrifugation, chromatography, electrophoresis and immunoprecipitation. In 
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differential centrifugation, marker/probe complexes may be separated from 
uncomplexed assay components through a series of centrifugal steps, due to the different 
sedimentation equilibria of complexes based on their different sizes and densities (see, 
for example, Rivas, G., and Minton, A.P., 1993, Trends Biochem Sci. 18(8):284-7). 
5 Standard chromatographic techniques may also be utilized to separate complexed 
molecules from uncomplexed ones. For example, gel filtration chromatography 
separates molecules based on size, and through the utilization of an appropriate gel 
filtration resin in a column format, for example, the relatively larger complex may be 
separated from the relatively smaller uncomplexed components. Similarly, the 

10 relatively different charge properties of the marker/probe complex as compared to the 
uncomplexed components may be exploited to differentiate the complex from 
uncomplexed components, for example through the utilization of ion-exchange 
chromatography resins. Such resins and chromatographic techniques are well known to 
one skilled in the art (see, e.g., Heegaard, N.H., 1998, J. Mol Recognit Winter 1 1(1- 

15 6):141-8; Hage, D.S., and Tweed, S.A. JChromatogr B Biomed Sci Appl 1997 Oct 
10;699(l-2):499-525). Gel electrophoresis may also be employed to separate 
complexed assay components from unbound components (see, e.g., Ausubel et al 9 ed., 
Current Protocols in Molecular Biology, John Wiley & Sons, New York, 1987-1999). 
In this technique, protein or nucleic acid complexes are separated based on size or 

20 charge, for example. In order to maintain the binding interaction during the 

electrophoretic process, non-denaturing gel matrix materials and conditions in the 
absence of reducing agent are typically preferred. Appropriate conditions to the 
particular assay and components thereof will be well known to one skilled in the art. 

In a particular embodiment, the level of mRNA corresponding to the marker can 

25 be determined both by in situ and by in vitro formats in a biological sample using 

methods known in the art. The term "biological sample" is intended to include tissues, 
cells, biological fluids and isolates thereof, isolated from a subject, as well as tissues, 
cells and fluids present within a subject. Many expression detection methods use 
isolated RNA. For in vitro methods, any RNA isolation technique that does not select 

30 against the isolation of mRNA can be utilized for the purification of RNA from ovarian 
cells (see, e.g. , Ausubel et al , ed., Current Protocols in Molecular Biology, John Wiley 
& Sons, New York 1987-1999). Additionally, large numbers of tissue samples can 
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readily be processed using techniques well known to those of skill in the art, such as, for 
example, the single-step RNA isolation process of Chomczynski (1989, U.S. Patent No. 
4,843,155). 

The isolated mRNA can be used in hybridization or amplification assays that 
5 include, but are not limited to, Southern or Northern analyses, polymerase chain reaction 
analyses and probe arrays. One preferred diagnostic method for the detection of mRNA 
levels involves contacting the isolated mRNA with a nucleic acid molecule (probe) that 
can hybridize to the mRNA encoded by the gene being detected. The nucleic acid probe 
can be, for example, a full-length cDNA, or a portion thereof, such as an oligonucleotide 

10 of at least 7, 15, 30, 50, 100, 250 or 500 nucleotides in length and sufficient to 

specifically hybridize under stringent conditions to a mRNA or genomic DNA encoding 
a marker of the present invention. Other suitable probes for use in the diagnostic assays 
of the invention are described herein. Hybridization of an mRNA with the probe 
indicates that the marker in question is being expressed. 

15 In one format, the mRNA is immobilized on a solid surface and contacted with a 

probe, for example by running the isolated mRNA on an agarose gel and transferring the 
mRNA from the gel to a membrane, such as nitrocellulose. In an alternative format, the 
probe(s) are immobilized on a solid surface and the mRNA is contacted with the 
probe(s), for example, in an Asymetrix gene chip array. A skilled artisan can readily 

20 adapt known mRNA detection methods for use in detecting the level of mRNA encoded 
by the markers of the present invention. 

An alternative method for determining the level of mRNA corresponding to a 
marker of the present invention in a sample involves the process of nucleic acid 
amplification, e.g., by rtPCR (the experimental embodiment set forth in Mullis, 1987, 

25 U.S. Patent No. 4,683,202), ligase chain reaction (Barany, 1991, Proc. Natl Acad. Sci. 
USA, 88:189-193), self sustained sequence replication (Guatelli et ai 9 1990, Proc. Natl 
Acad Sci. USA 87:1874-1878), transcriptional amplification system (Kwoh et al, 1989, 
Proc. Natl Acad. Sci. USA 86:1173-1177), Q-Beta Replicase (Lizardi etal, 1988, 
Bio/Technology 6:1 197), rolling circle replication (Lizardi et aL 9 U.S. Patent No. 

30 5,854,033) or any other nucleic acid amplification method, followed by the detection of 
the amplified molecules using techniques well known to those of skill in the art. These 
detection schemes are especially useful for the detection of nucleic acid molecules if 
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such molecules are present in very low numbers. As used herein, amplification primers 
are defined as being a pair of nucleic acid molecules that can anneal to 5' or 3/ regions 
of a gene (plus and minus strands, respectively, or vice-versa) and contain a short region 
in between. In general, amplification primers are from about 10 to 30 nucleotides in 
5 length and flank a region from about 50. to 200 nucleotides in length. Under appropriate 
conditions and with appropriate reagents, such primers permit the amplification of a 
nucleic acid molecule comprising the nucleotide sequence flanked by the primers. 

For in situ methods, mRNA does not need to be isolated from the ovarian cells 
prior to detection. In such methods, a cell or tissue sample is prepared/processed using 
1 0 known histological methods. The sample is then immobilized on a support, typically a 
glass slide, and then contacted with a probe that can hybridize to mRNA that encodes 
the marker. 

As an alternative to making determinations based on the absolute expression 
level of the marker, determinations may be based on the normalized expression level of 

1 5 the marker. Expression levels are normalized by correcting the absolute expression 
level of a marker by comparing its expression to the expression of a gene that is not a 
marker, e.g., a housekeeping gene that is constitutively expressed. Suitable genes for 
normalization include housekeeping genes such as the actin gene, or epithelial cell- 
specific genes. This normalization allows the comparison of the expression level in one 

20 sample, e.g., a patient sample, to another sample, e.g., a non-ovarian cancer sample, or 
between samples from different sources. 

Alternatively, the expression level can be provided as a relative expression level. 
To determine a relative expression level of a marker, the level of expression of the 
marker is determined for 10 or more samples of normal versus cancer cell isolates, 

25 preferably 50 or more samples, prior to the determination of the expression level for the 
sample in question. The mean expression level of each of the genes assayed in the 
larger number of samples is determined and this is used as a baseline expression level 
for the marker. The expression level of the marker determined for the test sample 
(absolute level of expression) is then divided by the mean expression value obtained for 

30 that marker. This provides a relative expression level. 
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Preferably, the samples used in the baseline determination will be from ovarian 
cancer or from non-ovarian cancer cells of ovarian tissue. The choice of the cell source 
is dependent on the use of the relative expression level. Using expression found in 
normal tissues as a mean expression score aids in validating whether the marker assayed 
5 is ovarian specific (versus normal cells). In addition, as more data is accumulated, the 
mean expression value can be revised, providing improved relative expression values 
based on accumulated data. Expression data from ovarian cells provides a means for 
grading the severity of the ovarian cancer state. 

In another embodiment of the present invention, a polypeptide corresponding to 

10 a marker is detected. A preferred agent for detecting a polypeptide of the invention is an 
antibody capable of binding to a polypeptide corresponding to a marker of the invention, 
preferably an antibody with a detectable label. Antibodies can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g. , Fab or F(ab') 2 ) 
can be used. The term "labeled", with regard to the probe or antibody, is intended to 

15 encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) 
a detectable substance to the probe or antibody, as well as indirect labeling of the probe 
or antibody by reactivity with another reagent that is directly labeled. Examples of 
indirect labeling include detection of a primary antibody using a fluorescently labeled 
secondary antibody and end-labeling of a DNA probe with biotin such that it can be 

20 detected with fluorescently labeled streptavidin. 

Proteins from ovarian cells can be isolated using techniques that are well known 
to those of skill in the art. The protein isolation methods employed can, for example, be 
such as those described in Harlow and Lane (Harlow and Lane, 1988, Antibodies: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 

25 York). 

A variety of formats can be employed to determine whether a sample contains a 
protein that binds to a given antibody. Examples of such formats include, but are not 
limited to, enzyme immunoassay (EIA), radioimmunoassay (RIA), Western blot 
analysis and enzyme linked immunoabsorbant assay (ELISA). A skilled artisan can 
30 readily adapt known protein/antibody detection methods for use in determining whether 
ovarian cells express a marker of the present invention. 
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In one format, antibodies, or antibody fragments, can be used in methods such as 
Western blots or immunofluorescence techniques to detect the expressed proteins. In 
such uses, it is generally preferable to immobilize either the antibody or proteins on a 
solid support. Suitable solid phase supports or carriers include any support capable of 
5 binding an antigen or an antibody. Well-known supports or carriers include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, natural and 
modified celluloses, polyacrylamides, gabbros, and magnetite. 

One skilled in the art will know many other suitable carriers for binding antibody 
or antigen, and will be able to adapt such support for use with the present invention. For 

10 example, protein isolated from ovarian cells can be run on a polyacrylamide gel 

electrophoresis and immobilized onto a solid phase support such as nitrocellulose. The 
support can then be washed with suitable buffers followed by treatment with the 
detectably labeled antibody. The solid phase support can then be washed with the buffer 
a second time to remove unbound antibody. The amount of bound label on the solid 

1 5 support can then be detected by conventional means. 

The invention also encompasses kits for detecting the presence of a polypeptide 
or nucleic acid corresponding to a marker of the invention in a biological sample (e.g. an . 
ovary-associated body fluid such as a urine sample). Such kits can be used to determine 
if a subject is suffering from or is at increased risk of developing ovarian cancer. For 

20 example, the kit can comprise a labeled compound or agent capable of detecting a 
polypeptide or an mRNA encoding a polypeptide corresponding to a marker of the 
invention in a biological sample and means for determining the amount of the 
polypeptide or mRNA in the sample (e.g., an antibody which binds the polypeptide or an 
oligonucleotide probe which binds to DNA or mRNA encoding the polypeptide). Kits 

25 can also include instructions for interpreting the results obtained using the kit. 

For antibody-based kits, the kit can comprise, for example: (1) a first antibody 
(e.g., attached to a solid support) which binds to a polypeptide corresponding to a 
marker of the invention; and, optionally, (2) a second, different antibody which binds to 
either the polypeptide or the first antibody and is conjugated to a detectable label. 

30 For oligonucleotide-based kits, the kit can comprise, for example: (1) an 

oligonucleotide, e.g., a detectably labeled oligonucleotide, which hybridizes to a nucleic 
acid sequence encoding a polypeptide corresponding to a marker of the invention or (2) 
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a pair of primers useful for amplifying a nucleic acid molecule corresponding to a 
marker of the invention. The kit can also comprise, e.g., a buffering agent, a 
preservative, or a protein stabilizing agent. The kit can further comprise components 
necessary for detecting the detectable label {e.g., an enzyme or a substrate). The kit can 
5 also contain a control sample or a series of control samples which can be assayed and 
compared to the test sample. Each component of the kit can be enclosed within an 
individual container and all of the various containers can be within a single package, 
along with instructions for interpreting the results of the assays performed using the kit. 

10 B. Pharmacogenomics 

Agents or modulators which have a stimulatory or inhibitory effect on expression 
of a marker of the invention can be administered to individuals to treat (prophylactically 
or therapeutically) ovarian cancer in the patient. In conjunction with such treatment, the 
pharmacogenomics (z.e., the study of the relationship between an individual's genotype 

1 5 and that individual's response to a foreign compound or drug) of the individual may be 
considered. Differences in metabolism of therapeutics can lead to severe toxicity or 
therapeutic failure by altering the relation between dose and blood concentration of the 
pharmacologically active drug. Thus, the pharmacogenomics of the individual permits 
the selection of effective agents {e.g. , drugs) for prophylactic or therapeutic treatments . 

20 based on a consideration of the individual's genotype. Such pharmacogenomics can. 
further be used to determine appropriate dosages and therapeutic regimens. 
Accordingly, the level of expression of a marker of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. 

25 Pharmacogenomics deals with clinically significant variations in the response to 

drugs due to altered drug disposition and abnormal action in affected persons. See, e.g., 
Linder (1997) Clin. Chem. 43(2):254-266. In general, two types of pharmacogenetic 
conditions can be differentiated. Genetic conditions transmitted as a single factor 
altering the way drugs act on the body are referred to as "altered drug action." Genetic 

30 conditions transmitted as single factors altering the way the body acts on drugs are 
referred to as "altered drug metabolism". These pharmacogenetic conditions can occur 
either as rare defects or as polymorphisms. For example, glucose-6-phosphate 
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dehydrogenase (G6PD) deficiency is a common inherited en2ymopathy in which the 
main clinical complication is hemolysis after ingestion of oxidant drugs (anti-malarials, 
sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 

As an illustrative embodiment, the activity of drug metabolizing enzymes is a 
5 major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2 (NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
drug response and serious toxicity after taking the standard and safe dose of a drug. 

10 These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different among 
different populations. For example, the gene coding for CYP2D6 is highly polymorphic 
and several mutations have been identified in PM, which all lead to the absence of 
functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite frequently 

15 experience exaggerated drug response and side effects when they receive standard 
doses. If a metabolite is the active therapeutic moiety, a PM will show no therapeutic 
response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6- 
formed metabolite morphine. The other extreme are the so called ultra-rapid 
metabolizers who do not respond to standard doses. Recently, the molecular basis of 

20 ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 

Thus, the level of expression of a marker of the invention in an individual can be 
determined to thereby select appropriate agent(s) for therapeutic or prophylactic 
treatment of the individual. In addition, pharmacogenetic studies can be used to apply 
genotyping of polymorphic alleles encoding drug-metabolizing enzymes to the 

25 identification of an individual's drug responsiveness phenotype. This knowledge, when 
applied to dosing or drug selection, can avoid adverse reactions or therapeutic failure 
and thus enhance therapeutic or prophylactic efficiency when treating a subject with a 
modulator of expression of a marker of the invention. 



30 



C. Monitoring Clinical Trials 

Monitoring the influence of agents (e.g., drug compounds) on the level of 
expression of a marker of the invention can be applied not only in basic drug screening, 
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but also in clinical trials. For example, the effectiveness of an agent to affect marker 
expression can be monitored in clinical trials of subjects receiving treatment for ovarian 
cancer. In a preferred embodiment, the present invention provides a method for 
monitoring the effectiveness of treatment of a subject with an agent (e.g., an agonist, 
5 antagonist, peptidomimetic, protein, peptide, nucleic acid, small molecule, or other drug 
candidate) comprising the steps of (i) obtaining a pre-administration sample from a 
subject prior to administration of the agent; (ii) detecting the level of expression of one 
or more selected markers of the invention in the pre-administration sample; (iii) 
obtaining one or more post-administration samples from the subject; (iv) detecting the 

10 level of expression of the marker(s) in the post-administration samples; (v) comparing 
the level of expression of the marker(s) in the pre-administration sample with the level 
of expression of the marker(s) in the post-administration sample or samples; and (vi) 
altering the administration of the agent to the subject accordingly. For example, 
increased administration of the agent can be desirable to increase expression of the 

15 marker(s) to higher levels than detected, i.e., to increase the effectiveness of the agent. 
Alternatively, decreased administration of the agent can be desirable to decrease . 
expression of the marker(s) to lower levels than detected, i.e., to decrease the 
effectiveness of the agent. 

20 D. Electronic Apparatus Readable Media and Arrays 

Electronic apparatus readable media comprising a marker of the present 
invention is also provided. As used herein, "electronic apparatus readable media" refers 
to any suitable medium for storing, holding or containing data or information that can be 
read and accessed directly by an electronic apparatus. Such media can include, but are 

25 not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, 
and magnetic tape; optical storage media such as compact disc; electronic storage media 
such as RAM, ROM, EPROM, EEPROM and the like; general hard disks and hybrids of 
these categories such as magnetic/optical storage media. The medium is adapted or 
configured for having recorded thereon a marker of the present invention. 

30 As used herein, the term "electronic apparatus" is intended to include any 

suitable computing or processing apparatus or other device configured or adapted for 
storing data or information. Examples of electronic apparatus suitable for use with the 
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present invention include stand-alone computing apparatus; networks, including a local 
area network (LAN), a wide area network (WAN) Internet, Intranet, and Extranet; 
electronic appliances such as a personal digital assistants (PDAs), cellular phone, pager 
and the like; and local and distributed processing systems. 
5 As used herein, "recorded" refers to a process for storing or encoding 

information on the electronic apparatus readable medium. Those skilled in the art can 
readily adopt any of the presently known methods for recording information on known 
media to generate manufactures comprising the markers of the present invention. 
A variety of software programs and formats can be used to store the marker 

10 information of the present invention on the electronic apparatus readable medium. For 
example, the nucleic acid sequence corresponding to the markers can be represented in a 
word processing text file, formatted in commercially-available software such as 
WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored in 
a database application, such as DB2, Sybase, Oracle, or the like, as well as in other 

1 5 forms. Any number of dataprocessor structuring formats {e.g. , text file or database) may 
be employed in order to obtain or create a medium having recorded thereon the the ... 
markers of the present invention. 

By providing the markers of the invention in readable form, one can routinely 
access the marker sequence information for a variety of purposes. For example, one 

20 skilled in the art can use the nucleotide or amino acid sequences of the present invention 
in readable form to compare a target sequence or target structural motif with the 
sequence information stored within the data storage means. Search means are used to 
identify fragments or regions of the sequences of the invention which match a particular 
target sequence or target motif. 

25 The present invention therefore provides a medium for holding instructions for 

performing a method for determining whether a subject has ovarian cancer or a pre- 
disposition to ovarian cancer, wherein the method comprises the steps of determining 
the presence or absence of a marker and based on the presence or absence of the marker, 
determining whether the subject has ovarian cancer or a pre-disposition to ovarian 

30 cancer and/or recommending a particular treatment for ovarian cancer or pre-ovarian 
cancer condition. 
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The present invention further provides in an electronic system and/or in a 
network, a method for determining whether a subject has ovarian cancer or a pre- 
disposition to ovarian cancer associated with a marker wherein the method comprises 
the steps of determining the presence or absence of the marker, and based on the 
5 presence or absence of the marker, determining whether the subject has ovarian cancer 
or a pre-disposition to ovarian cancer, and/or recommending a particular treatment for 
the ovarian cancer or pre-ovarian cancer condition. The method may further comprise 
the step of receiving phenotypic information associated with the subject and/or acquiring 
from a network phenotypic information associated with the subject. 

1 0 The present invention also provides in a network, a method for determining 

whether a subject has ovarian cancer or a pre-disposition to ovarian cancer associated 
with a marker, said method comprising the steps of receiving information associated 
with the marker receiving phenotypic information associated with the subject, acquiring 
information from the network corresponding to the marker and/or ovarian cancer, and 

1 5 based on one or more of the phenotypic information, the marker, and the acquired 

information, determining whether the subject has a ovarian cancer or a pre-disposition to 
ovarian cancer. The method may further comprise the step of recommending a 
particular treatment for the ovarian cancer or pre-ovarian cancer condition. 

The present invention also provides a business method for determining whether a 

20 subject has ovarian cancer or a pre-disposition to ovarian cancer, said method 

comprising the steps of receiving information associated with the marker, receiving 
phenotypic infdrmation associated with the subject, acquiring information from the 
network corresponding to the marker and/or ovarian cancer, and based on one or more 
of the phenotypic information, the marker, and the acquired information, determining 

25 whether the subject has ovarian cancer or a pre-disposition to ovarian cancer. The 
method may further comprise the step of recommending a particular treatment for the 
ovarian cancer or pre-ovarian cancer condition. 

The invention also includes an array comprising a marker of the present 
invention. The array can be used to assay expression of one or more genes in the array. 

30 In one embodiment, the array can be used to assay gene expression in a tissue to 

ascertain tissue specificity of genes in the array. In this manner, up to about 7600 genes 
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can be simultaneously assayed for expression. This allows a profile to be developed 
showing a battery of genes specifically expressed in one or more tissues. 

In addition to such qualitative determination, the invention allows the 
quantitation of gene expression. Thus, not only tissue specificity, but also the level of 
5 expression of a battery of genes in the tissue is ascertainable. Thus, genes can be 
grouped on the basis of their tissue expression per se and level of expression in that 
tissue. This is useful, for example, in ascertaining the relationship of gene expression 
between or among tissues. Thus, one tissue can be perturbed and the effect on gene 
expression in a second tissue can be determined. In this context, the effect of one cell 

10 type on another cell type in response to a biological stimulus can be determined. Such a 
determination is useful, for example, to know the effect of cell-cell interaction at the 
level of gene expression. If an agent is administered therapeutically to treat one cell 
type but has an undesirable effect on another cell type, the invention provides an assay 
to determine the molecular basis of the undesirable effect and thus provides the 

1 5 opportunity to co-administer a counteracting agent or otherwise treat the undesired 
effect. Similarly, even within a single cell type, undesirable biological effects can be 
determined at the molecular level. Thus, the effects of an agent on expression of other 
than the target gene can be ascertained and counteracted. 

In another embodiment, the array can be used to monitor the time course o£ 

20 expression of one or more genes in the array. This can occur in various biological 

contexts, as disclosed herein, for example development of ovarian cancer, progression of 
ovarian cancer, and processes, such a cellular transformation associated with ovarian 
cancer. 

The array is also useful for ascertaining the effect of the expression of a gene on 
25 the expression of other genes in the same cell or in different cells. This provides, for 
example, for a selection of alternate molecular targets for therapeutic intervention if the 
ultimate or downstream target cannot be regulated. 

The array is also useful for ascertaining differential expression patterns of one or 
more genes in normal and abnormal cells. This provides a battery of genes that could 
30 serve as a molecular target for diagnosis or therapeutic intervention. 
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E. Surrogate Markers 

The markers of the invention may serve as surrogate markers for one or more 
disorders or disease states or for conditions leading up to disease states, and in 
particular, ovarian cancer. As used herein, a "surrogate marker" is an objective 
5 biochemical marker which correlates with the absence or presence of a disease or 
disorder, or with the progression of a disease or disorder (e.g., with the presence or 
absence of a tumor). The presence or quantity of such markers is independent of the 
disease. Therefore, these markers may serve to indicate whether a particular course of 
treatment is effective in lessening a disease state or disorder. Surrogate markers are of 

10 particular use when the presence or extent of a disease state or disorder is difficult to 
assess through standard methodologies (e.g., early stage tumors), or when an assessment 
of disease progression is desired before a potentially dangerous clinical endpoint is 
reached (e.g., an assessment of cardiovascular disease may be made using cholesterol 
levels as a surrogate marker, and an analysis of HIV infection may be made using HIV 

15 RNA levels as a surrogate marker, well in advance of the undesirable clinical outcomes 
of myocardial infarction or fully-developed AIDS). Examples of the use of surrogate 
markers in the art include: Koomen et al. (2000) J. Mass. Spectrom. 35: 258-264; and 
James (1 994) AIDS Treatment News Archive 209. 

The markers of the invention are also useful as pharmacodynamic markers. As 

20 used herein, a "pharmacodynamic marker" is an objective biochemical marker which 
correlates specifically with drug effects. The presence or quantity of a 
pharmacodynamic marker is not related to the disease state or disorder for which the 
drug is being administered; therefore, the presence or quantity of the marker is 
indicative of the presence or activity of the drug in a subject. For example, a 

25 pharmacodynamic marker may be indicative of the concentration of the drug in a 

biological tissue, in that the marker is either expressed or transcribed or not expressed or 
transcribed in that tissue in relationship to the level of the drug. In this fashion, the 
distribution or uptake of the drug may be monitored by the pharmacodynamic marker. 
Similarly, the presence or quantity of the pharmacodynamic marker may be related to 

30 the presence or quantity of the metabolic product of a drug, such that the presence or 
quantity of the marker is indicative of the relative breakdown rate of the drug in vivo. 
Pharmacodynamic markers are of particular use in increasing the sensitivity of detection 
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of drug effects, particularly when the drug is administered in low doses. Since even a 
small amount of a drug may be sufficient to activate multiple rounds of marker 
transcription or expression, the amplified marker may be in a quantity which is more 
readily detectable than the drug itself. Also, the marker may be more easily detected 
5 due to the nature of the marker itself; for example, using the methods described herein, 
antibodies may be employed in an immune-based detection system for a protein marker, 
or marker-specific radiolabeled probes may be used to detect a mRNA marker. 
Furthermore, the use of a pharmacodynamic marker may offer mechanism-based 
prediction of risk due to drug treatment beyond the range of possible direct 

1 0 observations. Examples of the use of pharmacodynamic markers in the art include: 
Matsuda et al US 6,033,862; Hattis et al. (1991) Env. Health Perspect. 90: 229-238; 
Schentag (1999) Am. J. Health-Syst. Pharm. 56 Suppl. 3: S21-S24; and Nicolau (1999) 
Am, J. Health-Syst. Pharm. 56 Suppl. 3: S16-S20. 

The markers of the invention are also useful as pharmacogenomic markers. As 

1 5 used herein, a "pharmacogenomic marker" is an objective biochemical marker which 
correlates with a specific clinical drug response or susceptibility in a subject (see, e;g., 
McLeod et al. (1999) Eur. J. Cancer 35(12): 1650-1652). The presence or quantity of 
the pharmacogenomic marker is related to the predicted response of the subject to a 
specific drug or class of drugs prior to administration of the drug. By assessing the^ 

20 presence or quantity of one or more pharmacogenomic markers in a subject, a drug ;. 
therapy which is most appropriate for the subject, or which is predicted to have a greater 
degree of success, may be selected. For example, based on the presence or quantity of 
RNA or protein for specific tumor markers in a subject, a drug or course of treatment 
may be selected that is optimized for the treatment of the specific tumor likely to be 

25 present in the subject. Similarly, the presence or absence of a specific sequence 
mutation in marker DNA may correlate with drug response. The use of 
pharmacogenomic markers therefore permits the application of the most appropriate 
treatment for each subject without having to administer the therapy. 
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VI. Experimental Protocol 

A. Subtracted Libraries 

Subtracted libraries are generated using a PCR based method that allows the 
isolation of clones expressed at higher levels in one population of mRNA (tester) 
5 compared to another population (driver). Both tester and driver mRNA populations are 
converted into cDNA by reverse transcription, and then PCR amplified using the 
SMART PCR kit from Clontech. Tester and driver cDNAs are then hybridized using 
the PCR-Select cDNA subtraction kit from Clontech. This technique results in both 
subtraction and normalization, which is an equalization of copy number of low- 

1 0 abundance and high-abundance sequences. After generation of the subtractive libraries, 
a group of 96 or more clones from each library is tested to confirm differential 
expression by reverse Southern hybridization. 

To create the subtracted libraries, a first group of regular cDNA libraries was 
constructed. Library johOa was constructed from a pool of 5 normal ovarian epithelial 

1 5 cell cultures. Library johOb was constructed from a pool of 5 ascites short cultured 
samples from ovarian cancer patients. Library johOc was constructed from a pool of 6 
serous late stage (III/IV) tumor samples. Three subtracted libraries were generated from 
tumor samples. Library johOd was a subtracted ascites library, where the tester was 
johOb, and the driver was johOa. The johOe and the johOf library were both subtracted 

20 stage III/IV serous tumor libraries. The tester for both of these libraries was johOc, and 
the driver was a pooled RNA from normal tissues. The tissues used for this driver pool 
were: kidney, small intestine, prostate, lung, heart, muscle, spleen, pancreas, liver, and 
lymphocyte. Library cMhOg was the same as the johOc and johOf libraries, with the 
exception that normal ovary was added to the driver. cMhOh, i, j, and k are all stage l/ll 

25 subtracted libraries made from pooled tumor RNAs of different histological types 

(h=serous, I-endometriod, j=clear cell, k=mucinous). The driver was the same for these 
4 libraries. It consisted of normal ovarian epithelial RNA and PBML RNA. Of the 
markers listed in Table 1, SEQ ID NOS: 1-129, 916-1029, 1566-1571 and 1607-1865 
were identified in library johOa. Markers identified in johOb include SEQ ID NOS: 

30 130-177, 1030-1081, 1572-1574, and 1866-1974. Markers identified in johOc include 
SEQ ID NOS: 178-269, 1082-1120, 1575-1577, and 1975-2060. Markers identified in 
johOd include SEQ ID NOS: 270-370, 1 121-1304, 1578-1592, and 2061-2244. Markers 
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identified in johOe include SEQ ID NOS: 371-611, 1305-1416, 1593-1596 and 2245- 
2487. Markers identified in johOf include SEQ ID NOS: 612-915, 1417-1565, 1597- 
1606, and 2488-2871 . Of the markers listed in Table 1 A, SEQ ID NOS: 2872-2976, 
3817-3898, 4438-4443 and 4474-4675 were identified in library cMhOg. Markers 
5 identified in cMhOh include SEQ ID NOS: 2977-3376, 3899-4072, 4444-4455, and 
4676-5303. Markers identified in cMhOi include SEQ ID NOS: 3377-3495, 4073-4158, 
4456-4460, and 5304-5637. Markers identified in cMhOj include SEQ ID NOS: 3496- 
3742, 4195-4390, 4461-4468, and 5638-6197. Markers identified in cMhOk include 
SEQ ID NOS: 3743-3816, 4391-4437, 4469-4473 and 6198-6398. 

10 

VII. Summary Of The Data Provided In The Tables 

Tables 1, 1A, 2 and 3 are being filed concurrently herewith on a compact disc in 

lieu of paper copies. The compact disc submitted is formatted from an IBM-PC and is 

compatible with MS-Windows. The disc contains the following four (4) files: 
15 Tablel.text, containing l,223kb, TablelA.text, containing l,582kb, Table2.text, . 

containing 10,600kb, and Table3.text, containing 568kb. The material on the compact 

disc, namely Tables 1, 1 A, 2 and 3, is expressly incorporated by reference. 

Tables 1 and 1 A show 6398 novel nucleotide sequences. These 6398 novel 

sequences were determined to be novel through various BLAST searches of available 
20 databases. Of these novel markers, SEQ ID NOS: 1566 - 1606 and 4438-4473 are 

preferred, SEQ ID NOS: 916-1565 and 3817-4437 are more preferred, and SEQ ID 

NOS: 1 - 915 and 2872-3816 are most preferred. 

The sequences of Tables 1 and 1 A were re-interpreted and vector sequences 

removed and those sequences are set forth in Table 2. 
25 Table 3 correlates the SEQ ID NOS. from Tables 1 and 1A with those of Table 

2. 

The contents of all references, patents, published patent applications, and 
databases cited throughout this application are hereby incorporated by reference. 

30 Other Embodiments 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 



WO 01/070979 PCT7US01/09126 

-94- 

described herein. Such equivalents are intended to be encompassed by the following 
claims. 

What is claimed is: 
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Claims 

f 

1 . An isolated nucleic acid molecule comprising a nucleotide sequence of 
■ Tables 1 -2, or a complement thereof. 

5 

2. A vector which contains the nucleic acid molecule of claim 1 . 

3 . A host cell which contains the nucleic acid molecule of claim 1 . 

10 4. An isolated polypeptide which is encoded by a nucleic acid molecule 

comprising a nucleotide sequence of Tables 1-2. 

5. An antibody which selectively binds to a polypeptide of claim 4. 

15 6. A method for producing a polypeptide comprising culturing the host cell 

of claim 3 under conditions in which the nucleic acid molecule is expressed. 

7. A method for detecting the presence of a polypeptide of claim 4 in a 
sample comprising: 

20 a) contacting the sample with a compound which selectively binds to 

the polypeptide; and 

b) determining whether the compound binds to the polypeptide in 
the sample to thereby detect the presence of a polypeptide of claim 4 in the 
sample. 

25 

8. A kit comprising a compound which selectively binds to the polypeptide 
of claim 4. 
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9. A method for detecting the presence of a nucleic acid molecule of claim 1 
in a sample comprising: 

a) contacting the sample with a nucleic acid probe or primer which 
selectively hybridizes to the nucleic acid molecule; and 
5 b) determining whether the nucleic acid probe or primer binds to a 

nucleic acid molecule in the sample to thereby detect the presence of a nucleic 
acid molecule of claim 1 in the sample. 

10. The method of claim 9, wherein the sample comprises mRNA molecules 
1 0 and is contacted with a nucleic acid probe. 

1 1 . The method of claim 9, wherein the sample is isolated from ovarian 

tissue. 

15 12. The method of claim 9, wherein the sample is a tumor sample. 

13. A kit comprising a compound which selectively hybridizes to a nucleic 
acid molecule of claim 1 . 

20 14. A method of assessing whether a patient is afflicted with ovarian cancer, 

the method comprising comparing: 

a) the level of expression of a marker in a patient sample, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

25 b) the normal level of expression of the marker in a control non- 

ovarian cancer sample, 

wherein a significant difference between the level of expression of the marker in 
the patient sample and the normal level is an indication that the patient is afflicted with 
ovarian cancer. 

30 

15. The method of claim 1 4, wherein the marker corresponds to a secreted 

protein. 
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10 



15 



20 



25 



1 6. The method of claim 1 4, wherein the marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

17. The method of claim 14, wherein the sample comprises cells obtained, 
from the patient. 

1 8. The method of claim 17, wherein the sample is an ovarian tissue sample. 

19. The method of claim 14, wherein the sample is an ovary-associated body 

fluid. 

20. The method of claim 14, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a protein or protein 
fragment corresponding to the marker. 

21 . The method of claim 20, wherein the presence of the protein or protein 
fragment is detected using a reagent which specifically binds with the protein or protein 
fragment. 

22. The method of claim 21 , wherein the reagent is selected from the group 
consisting of an antibody, an antibody derivative, and an antibody fragment. 

23. The method of claim 14, wherein the level of expression of the marker in 
the sample is assessed by detecting the presence in the sample of a transcribed 
polynucleotide or portion thereof, wherein the transcribed polynucleotide comprises the 
marker. 

24. The method of claim 23, wherein the transcribed polynucleotide is an 
mRNA. 

25. The method of claim 23, wherein the transcribed polynucleotide is a 



cDNA. 
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26. The method of claim 23, wherein the step of detecting further comprises 
amplifying the transcribed polynucleotide. 

27. The method of claim 14, wherein the level of expression of the marker in 
5 the sample is assessed by detecting the presence in the sample of a transcribed 

polynucleotide which anneals with the marker or anneals with a portion of a 
polynucleotide wherein the polynucleotide comprises the marker, under stringent 
hybridization conditions. 

10 28. The method of claim 14, wherein the level of expression of the marker in 

the sample differs from the normal level of expression of the marker in a patient not 
afflicted with ovarian cancer by a factor of at least about 2. 

29. The method of claim 14, wherein the level of expression of the marker in 
1 5 the sample differs from the normal level of expression of the marker in a patient not 

afflicted with ovarian cancer by a factor of at least about 5. 

30. The method of claim 14, comprising comparing: 

a) the level of expression in the sample of each of a plurality of 
20 markers independently selected from the markers listed in Tables 1-2, and 

b) the normal level of expression of each of the plurality of markers 
in samples of the same type obtained from control humans not afflicted with 
ovarian cancer, 

wherein the level of expression of more than one of the markers is significantly 
25 altered, relative to the corresponding normal levels of expression of the markers, is an 
indication that the patient is afflicted with ovarian cancer. 

3 1 . The method of claim 30, wherein the level of expression of each of the 
markers is significantly altered, relative to the corresponding normal levels of 

30 expression of the markers, is an indication that the patient is afflicted with ovarian 
cancer. 
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32. The method of claim 30, wherein the plurality comprises at least three of 
the markers. 

33. The method of claim 30, wherein the plurality comprises at least five of 
5 the markers. 

34. A method for monitoring the progression of ovarian cancer in a patient, 
the method comprising: 

a) detecting in a patient sample at a first point in time, the expression 
10 of a marker, wherein the marker is selected from the group consisting of the 

markers listed in Tables 1-2; 

b) repeating step a) at a subsequent point in time; and 

c) comparing the level of expression detected in steps a) and b), and 
therefrom monitoring the progression of ovarian cancer. 

15 

35. The method of claim 34, wherein the marker corresponds to a secreted 

protein. 



36. The method of claim 34, wherein the marker corresponds to a transcribed 
20 polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. - 

37. The method of claim 34, wherein the sample comprises cells obtained 
from the patient. 

25 38. The method of claim 37, wherein the patient sample is an ovarian tissue 

sample. 

39. The method of claim 34, wherein between the first point in time and the 
subsequent point in time, the patient has undergone surgery to remove ovarian tissue. 

30 
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40. A method of assessing the efficacy of a test compound for inhibiting 
ovarian cancer in a patient, the method comprising comparing: 

a) expression of a marker in a first sample obtained from the patient 
and exposed to the test compound, wherein the marker is selected from the group 

5 consisting of the markers listed in Tables 1-2, and 

b) expression of the marker in a second sample obtained from the 
patient, wherein the sample is not exposed to the test compound, 

wherein a significantly lower level of expression of the marker in the first 
sample, relative to the second sample, is an indication that the test compound is 
10 efficacious for inhibiting ovarian cancer in the patient. 

41 . The method of claim 40, wherein the first and second samples are 
portions of a single sample obtained from the patient. 

15 42. The method of claim 40, wherein the first and second samples are . 

portions of pooled samples obtained from the patient. 

43 . A method of assessing the efficacy of a therapy for inhibiting ovarian 
cancer in a patient, the method comprising comparing: 
20 a) expression of a marker in the first sample obtained from the 

patient prior to providing at least a portion of the therapy to the patient, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

b) expression of the marker in a second sample obtained from the 
25 patient following provision of the portion of the therapy, 

wherein a significantly lower level of expression of the marker in the second 
sample, relative to the first sample, is an indication that the therapy is efficacious for 
inhibiting ovarian cancer in the patient. 
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44. A method of selecting a composition for inhibiting ovarian cancer in a 
patient, the method comprising: 

a) obtaining a sample comprising cancer cells from the patient; 

b) separately exposing aliquots of the sample in the presence of a 
5 plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2; and 

d) selecting one of the test compositions which alters the level of 
10 expression of the marker in the aliquot containing that test composition, relative 

to other test compositions. 

45. A method of inhibiting ovarian cancer in a patient, the method 
comprising: 

15 a) obtaining a sample comprising cancer cells from the patient; 

b) separately maintaining aliquots of the sample in the presence of a 
plurality of test compositions; 

c) comparing expression of a marker in each of the aliquots, wherein 
the marker is selected from the group consisting of the markers listed in Tables 

20 . 1-2; and 

d) administering to the patient at least one of the test compositions 
which alters the level of expression of the marker in the aliquot containing that 
test composition, relative to other test compositions. 



25 46. A kit for assessing whether a patient is afflicted with ovarian cancer, the 

kit comprising reagents for assessing expression of a marker selected from the group 
consisting of the markers listed in Tables 1-2. 



47. A kit for assessing the presence of ovarian cancer cells, the kit 
30 comprising a nucleic acid probe wherein the probe specifically binds with a transcribed 
polynucleotide corresponding to a marker selected from the group consisting of the 
markers listed in Tables 1-2. 
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48. A kit for assessing the suitability of each of a plurality of compounds for 
inhibiting ovarian cancer in a patient, the kit comprising: 

a) the plurality of compounds; and 

b) a reagent for assessing expression of a marker selected from the 
5 group consisting of the markers listed in Tables 1-2. 

49. A method of making an isolated hybridoma which produces an antibody 
useful for assessing whether a patient is afflicted with ovarian cancer, the method 
comprising: 

1 0 isolating a protein or protein fragment corresponding to a marker selected 

from the group consisting of the markers listed in Tables 1-2; 

immunizing a mammal using the isolated protein or protein fragment; 
isolating splenocytes from the immunized mammal; 
fusing the isolated splenocytes with an immortalized cell line to form 
15 hybridomas; and 

screening individual hybridomas for production of an antibody which 
specifically binds with the protein or protein fragment to isolate the hybridoma. 

50. An antibody produced by a hybridoma made by the method of claim 42. 

20 

51. A kit for assessing the presence of human ovarian cancer cells, the kit 
comprising an antibody, wherein the antibody specifically binds with a protein or 
protein fragment corresponding to a marker selected from the group consisting of the 
markers listed in Tables 1-2. 

25 

52. A method of assessing the ovarian cell carcinogenic potential of a test 
compound, the method comprising: 

a) maintaining separate aliquots of ovarian cells in the presence and 
absence of the test compound; and 
30 b) comparing expression of a marker in each of the aliquots, wherein 

the marker is selected from the group consisting of the markers listed in Tables 
1-2, 
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wherein a significantly altered level of expression of the marker in the aliquot 
maintained in the presence of the test compound, relative to the aliquot maintained in the 
absence of the test compound, is an indication that the test compound possesses human 
ovarian cell carcinogenic potential. 

5 

53. A kit for assessing the ovarian cell carcinogenic potential of a test 
compound, the kit comprising ovarian cells and a reagent for assessing expression of a 
marker, wherein the marker is selected from the group consisting of the markers listed in 
Tables 1-2. 

10 

54. A method of inhibiting ovarian cancer in a patient at risk for developing 
ovarian cancer, the method comprising inhibiting expression of a gene corresponding to 
a marker selected from the markers listed in Tables 1-2. 

15 55. A method of treating a patient afflicted with ovarian cancer, the method 

comprising providing to cells of the patient an antisense oligonucleotide complementary . 
to a polynucleotide corresponding to a marker selected from the markers listed in Tables 
1-2. 

20 56. A method of inhibiting ovarian cancer in a patient at risk for developing 

ovarian cancer, the method comprising decreasing expression of a gene corresponding to 
a marker selected from the markers listed in Tables 1-2. 

57. A method for determining whether ovarian cancer has metastasized in a 
25 patient, the method comprising comparing: 

a) the level of expression of a marker in a patient smaple, wherein 
the marker is selected from the group consisting of the markers listed in Tables 
1-2, and 

b) the normal level or non-metastatic level of expression of the 
30 marker in a control sample 
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wherein a significant difference betweent he level of expression in the patient 
sample and the normal level or non-metastatic level is an indication that the ovarian 
cancer has mestastasized. 

♦ 

5 58. The method of claim 57, wherein the marker corresponds to a secreted 

protein. 

59. The method of claim 57, wherein the marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

10 

60. The method of claim 57, wherein the sample comprises cells obtained 
from the patient. 

61 . The method of claim 60, wherein the patient sample is an ovarian tissue 

15 sample. 

62. A method for assessing the aggressiveness or indolence of ovarian cancer 
comprising comparing: 

a) the level of expression of a marker in a sample, wherein at least 
20 one marker is selected from the markers of Tables 1-2, and 

b) the normal level of expression of the marker in a control sample, 
wherein a significant difference between the level of expression in the sample 

and the normal level is an indication that the cancer is aggressive or indolent. 

25 63. The method of claim 62, wherein the marker corresponds to a secreted 

protein. 

64. The method of claim 62, wherein marker corresponds to a transcribed 
polynucleotide or portion thereof, wherein the polynucleotide comprises the marker. 

30 

65. The method of claim 62, wherein the sample comprises cells obtained 
from the patient. 
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66. The method of claim 65, wherein the patient sample is an ovarian tissue 

sample. 
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Sequence 1 

ACGCGTCCGGGAGACACAAAGCAGGAAGCCTCCGGGAGACCAGAGCTGGGTGCAGACATA 

CACACACACATACACACAGACACACAGAGTCACACACACTCACACACACTCTCTCTCTCT 

CTCCCTCTGTCTTTCTCTCTCTCTCTCTCTCTCTGTCCTTTCCTCCTGGACAGATCCACA 

GTTATACACAGAAACAAACACACACGCACGTAGAGAAGTGATTCACAAACACTTAAAGAC 

ATAAATCACAGGTGCAAAGCCATACCTGGGCTCAAAAACCTCCAAGAGAACGCAGCCTCA 

GACCCACCCAGGGGCCAGGGGCCAGGCTGTTTGCGAGACCAGCCCAGGCGGGACCCAGGC 

TTGCACGGGCAGGTACACGACATTC7TGGGCATACGCAGCCCGCCTGGCCGGAGCTGTGG 

GAGTCCTCAGCCCCAAGACCCAGCAGGCGTCTGAGGCCTGCCCACTAAGGAGGAGGAGTC 

ATTGCTGCCATCATTTATCATNCCCTCCCCAGCCACAGTCTGAGGAGCCCNTGTNCACCC 

TTCCCA 

Sequence 2 

CGTCCGGAAAGCTGGTGGGAATGCTAAGTTCCGAGAGTTCCTGGAGTCTCAGGAGGATTA 
CGATCCTTGCTGGTCCTTGCAGGAGAAGTACAACAGCAGAGCCGCGGCCCTCTTTAGGGA 
TAAGGTGGTCGCTCTGGCCGAAGGCAGAGAGTGGTCTTTGGAGTCATCACCTGCCCAGAA 
CTGGACCCTACCTNAGCCCANGACGCTGCCGTCCATGGTGCACCGAGTCTCTGGCCAGCC 
GCAGAGTGTGACCGCCTCCTNGGACAAGGCTTTTGAAGACTGGCTGAATGATGACCTCGG . 
CTCCTATCAAGGGGCCCAGGGGAATCGCTACGTGGGGTTTGGGAACACGCCACCGCCTCA 
NAAGAAAGAAGATGACTTTCTCAACAACGCCATGTCCTCCCTGTACTCGGGCTGGA 
Sequence 3 

NCCACGCGTCCGGGACGCGGGCGCCAGGTGCACAGCCCCAGTCCGCTGCGGGCGGGCGTC 

GACATCTGCCGGGTGAGCGCGAGCTGGAGCTACACCGCTTTCGTGACCCGTGGAGGCCGC 

TTGGAGCTGTCGGGCTCAGCCAGCGGCGCGGCGGGCCGCTGCAAGGACGCGTGGGCCTCG 

GAGGGGCTCCTCGCGGTGCTGCGCGCCGGGCCGGGGCCGGAGGCGTTACTGCAGGTCTGG 

GCGGCCGAATCGGCGCTGCGTGGGGAGCCATTGTGGGCCCAGAATGGTGGTGCCCGAGGC 

CGAAGGGGGAAGGACGATCCGGCCGGGTGAAGGCCCAAGCTTGGGGAGGCTACCCCTTGC 

TTGCCCTGCGGCCCCGTGCCCCTACGTGAGCCCCCGCGGGCCGCCCTTCTACCGGCCTTT 

TGGCTTCCGGGAGCTTGCGGGGCAACGCCAACTTGGGAGCTGGGGCCGCCAGCAACCCCG 

TTGCTTGTTGGACGCTGTGCCCAAAGGTGGTTTCTGGGGGGCGGGG 

Sequence 4 

AGTCNCCACGCGTCCGGGAATTGANGCCGCGGGGCGGGCGGCGGCGGCGGCTGGGCGGCG 

GCCGGGACCCAGCGGGCCAGGTGGGGACGGCGCGGAGCGGGTGCGGGAGATGCCGTGCGG 

GACTGGGGCCACCTGAGCCGCCCGCCTCGTCCCCGCCTTCTGTGGGAAGGATGTGCGCGC 

GGATGGCCGGTCGCACAACAGCGGCCCCTCGGGGGCCCTACGGCCCCTGGCTCTGCCTCC 

TGGTGGCCCTCGCCCTGGGACCGTCGTGAGAGTGGACTGTGGCCAGGCTCCCCTGGACCC 

TGTCTACCTGCCGGCAGCCCTGGAGCTCCTAGACGCCCCTGAACACT 

Sequence 5 

TCNCCACGCGTCCGACTGTATGTATTCTGGATACAGGGGATACTGGGCTCGCTATGTGTG 

TGGAGCCATCCCTTCCTTGCCCCAGCCCCACCTCCCTCTCAAACCCTCTCTGGCTCTTTC 

TGAGCTTCCTTTCCTGCTCCCCAGCTTGCCCAGTGCTCAGTGCCCCACTTGGCTCTTTTG 

CTACTTCGGGTCAGGTGGAGCCTCTTGGGAATGTGAAGTGCCTTACAGAAAGATTGCACT 

TCAAGAGGAGAGGCTGCAGGGAGCCATCCTAAACCCAGAGGCCTGGAGCTTACCCGTGTC 

ACTTTACTTTTGTACACAAGGGGGTCTCCTTAGTGCCCTCGAGAAGGGATTCTTGGGCeC 

TGAGCTTCTACTCCTGAGGCCACCTTCTGTGCAAGCCCCAAGCTCCCTCAACTCTAGGCT 

TGGAGTCCTCAGTGGGGAAAAGCCCTGNTTTGGGG 

Sequence 6 

CGTCCGCGTCCTCGCTGCGGAAAGTTGGGGCAACCTGTTGCTAGTCTGGTCGTTGGTGAC 
AGCGAGGCTTCCGCGCTCGCfGCTGGTGAGCAGCCCCGGCGTGCCCCGCGGGCTGGAAGA 
GGCGGCGGCGTGATGCGGCCCGTGGACGCGCCCCGCGCCGGCCCGCGAGAACCTGGCCTC 
CCTGGAGCGCGAGCGCGCCCGGGCGCACTGGCGGGCCCGCAGGAAGCTGCTGGAGATCCA 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

2/467 



GAGCCTGCTCGACGCCATCAAGAGTGAGGTGGAGGCAGAGGAGCGGGGCGCCCGGGCCQC 

AGCACCCCGCCCCGCGTGCGGAGGCTGAGGAGNCGGGTGGCTCGGCTGTGCGCCCGAAGC 

AGAGAGGAAGGCTGCGGAAGCGGCGCGGATGGGGCAGGCGGGATCGTGGGAGCTGCACNN 

ACCGGATCGCCGGCTTGCGAGTGCTGCTGAGCCGGCGAGGCCCNCGGGTCTGGAAGCGGA 

ANCGCGCGGG 

Sequence 7 

NTTCGGGAGTCGACCACGCGTCCGCGCGCTGGAGGAGTGGAGCAAGCACCCGGCCCGGCC 

CTGGGGGCTGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAACCCCGGCAG 

GTGGCGACCAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGCGAGGGCG 

CGCCCCAGTGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGCGGCGG 

CGGACGGAGAAAACAACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTTGCC 

GCCGCCTCCCCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGC 

CCCCCTGGGGAAGGCGGCACCAGGGGAGCCTTGGGCGCCCNGGGCTTCGGCCGCGACCCC 

ATTTGGGGTAGACCACAAGAMGCTTCGGGACCCTTTCGGCACCTTTGGACAGCCAAGAA • 

TGGCTGNTGGGCACCCTTTCTTCT 

Sequence 8 

CCCCGCGTCCGGAGCACGCAAAGGGAATAAATTGTAATTAGGTGGTGGGTGGCTAAAAAT 

GACAA TGCAAAGGTGTTGGATTAAAAAAAAAATCTGGTAGTAGAGGGAAATTATGGAGGA 

TTTTTAAAAAGGTTMTGATMTATCCATCTACTTATGTMCTTTTTTTGGAGATAC 

ATAATAGTGTAGAGTGCATTGGAGAGGAAAAGTAGGAGTTGTAAAGACCATTTTGGATAA 

ACTTTGAAGCAAGGGATAATGGCCTCAACCAAGGTAGTGGTGCTGAAGATTGTTTACATA 

MTAAGCAGATACAMTAGAAGGGATTTTTCMGTGGCATTGTMCTGCACTTTTCAAAG 

GTTATTTGCCAAAMTCAAATTAACGGTATCTTCAAAAATCATGTTTGATGGATGTATCA 

TCAAGGGCTTTCTTAAAATTTTGTTGAAAGCCAAGGAA 

Sequence 9 

CGCCTTCCCGGGAAGTTTGGAGGGCCCCGNAGGGGAAGCCCCCGCGNCTTCNGGGGGCCN 

GNCGGGCTTGGAAGGCAANCCCCACCCCAAGTTTCCCCGCCNANGGANTNCAATGAANCT 

TGACCGGGGCCCCCCGGAACCCCNCGCTNGNCTTNTTNGGGGGTGGTTCCTTGGGTTCCG 

GTGGGGGGGAACCCCAAGTGCTTTTTCAAGGCCCCGGCCGGGCGCGGGGCCCCGGAAAGG 

GCCTTTCAAGTTCTTNCCTTTCCCCGGNTTGAAAGAAGGNAAAAGGCCGGGAANGGAAAC 

CNGGGNAAAACCCGCNGGGNNGGGGCGGCCTTCNNCCGCCNGGGCGCCCCTTGCCNGGGG 

GGGGGNAAAGGGGNCAAAGTTTTCCCNGGGGCCCCCGGGGCCCCGGCNGNCCCTTTNAAN 

TCAAGGGGGCCGGGNNCGGNCTTTCCCCAANNCGGCCAAGTTCCTTCAAAGGGGCCCCCC 

CGGGNTTTGGGCCCGGNCCGGGNCNGNMCCTTGGGGMGGAAMAATTCAAAMGTTTT 

GTTGGCCCGGTTTCCGGGTNGGAATTGCCCCCCNAAAATTTGNAAGCCGGGGGGGGGGNN 

NCCTTGGGGGCCCCTTCTTTGCCCCCNTTTAAAGGGMNGGGCNAAAACCTTTNCCCAAC 

CGNCCAANNGCCCCCGTNAAMAANGGGCCGGCCTTNNTTTTGNCCGGGGCCCCNAANAA 

GGGC CTTTTCGNTTTTTTCGGGTTTTTTCCCCCGGGCCGGGCGGGG 

I I I I I GGCTTTTAAAGGGGGG 

Sequence 10 

NCGCGTCCGCCGCATTGTGGCCAAGTGCCATGAGGAGCAGCTGGATCATTCTGTCCAGTC 

ATATATTAAGTTCGTGTTCAAGACCAGGGCATGCAAGGAGAGGACTGTACATGAGGAACT 

GNCTAAAAATGTGACTGGTCTTTTGAAATCAAATGACTCAACAACAGTAAAGCATGTCCT 

MAGCATTCCTGGTTCTTCTTTGCAATTATCCTAAAATCGATGGCACAGCACTTGATTGA 

CACAAATAAAATCCAGCTTCCCCGGCCTCAGAGATTTCCTGAATCTTACCAAAATGAATT 

GGACAATCTTGNCATGGTCCTATCCGACCATGTGATTTGGGAAATACAAGGATGCACTTG 

MGAMCANGAAGGGCAAACCACAGCGTTGCCAGATTTCTCAAGCGCTGCTTTAC 

Sequence 11 

CGCCNCGCGTCCGGCTTCCTAGAAGAGCACAGTCCCTTAAAGCACCCTCTATTGCTACAA 

TTAAAAGTCTAGCAGATTGTAACTTTAGTTACACAAGTTCTAGAGATGCTTTTGGCTATG 

CTACACTGAAAAGACTACAGCAACAAAGAATGCATCCATCCTTATCTCACTCTGAAGCTT 
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TGGCATCTCCAGCAAAA6ATGTGCTATTTACTGATACCATCACCATGAAGGCCAACAGTT 

TTGAGTCCAGATTAACACCAAGCAGGTTCATGAAAGCCTTAAGTTATGCATCATTAGATA 

AAGAAGATTTATTGAGTCCTATTAATCAAAATACCCTGCAACCGATCTTCCTCAGTGCGG 

GCCATGGNGTCCAGTGCCACATNGGGGGGGTCAGAATGATTACATTGGGCTTGCTCTCCC 

GGNGGATATAAATGATATATTTCANGGTMGGGTATTTCTTATTTTTAGACAAAAAACAT 

CCCCNCATGATGATCCAGGNGCCAGAGCATTTGCCCTGAATGCAGGAGGGCTTTCATNTG 

GNACTGGGNGGGCTTTGNAAAAAA I I 1 1 I I 

Sequence 12 

TTCGGGAGTCGACCACGCGTCCGCCAAGTCCTGCGATGATGGACTCAACACCTTCCGCGA 

CGAGGGCCGGGTTCTGCGGCGCCTGCCAAACCGCATACCCAGCCTGCGGATGCTCCGGAG 

CTTCTTCACCGACGGGTCCTTGGATAGCTGGGGCACCTCTGAAGATGCTGACGCTCCTTC 

TAAGCGACACTCAACCTCTGACCTCTCAGATGCGACCTTCAGCGATATCAGGAGAGAAGG 

CTGGTTGTATTATAAGCAGATTCTCACCAAGAAGGGGAAGGCTGAGGACCGGGATGACAT 

GCTGGGCTGGATCAGAGCGATCCGGGAGAACAGCAGGGCCGAGGGCGAGGACCCCCGGCT 

GTGCCAACCAAGCTCTTGATCAGCAAGAAGCTTAATGATTATCGCAAAGTGAGCCATAGC 

TCTGGGCCCAAAGCTTGATTTCTTCCC 

Sequence 13 

GTATTAATGTTCTCAGGCATGAAGCAGAATTTTACGGGATCACTCCATTAGTAAGAAGGC 

TTCTCTTATGTGAAGAATTGGAGCGTTCCTCTTGTGGCAGTGTCCTTTTTCATGGTTACT 

TGCCCCCACCAGGTATTCCTAGTNCGTAAAATAAACAACACAGTCAGATCTGCTGATTCT 

AGGAATGGTCTAAATTCTACAGAAGGTGAAGCCCGGGGAAATGGTACACAGCCTGTTCTC 

TCTGGAACGGGAGAAGAAACTGTTAGGCTAGGATTTCCTGTGGATCCACNAAAGGTGCTA . 

ATAGTAGCTGGCCATCACAACTGG ATTGTAGCTGCATATGCCCATTTTGCTGTGTGTTAC 

AGAATCAAAGAATNTTNANGATGGCAGCAAGTGNTTACGAGCCCATATTTGGATTGGACT 

ATCGAACGAGTAGNTTTAAATGCAAAGGTGGATGGAGGGCCACATGGAGACAAAAGACAA 

AAATG 

Sequence 14 

GCCNCGCGTCCGAAAAAATTAAGAGAAGGCCTGGCGGCCGGTCTGAAGTCATCTATAATT 

ATGTACAACGCCCCTTCATCCAGATGTCATGGGAAAAGGAAGAAGGGAAGAGTCGCCATG 

TGGATTTCTAGTGTGTTCGAAGCAAATCCCTCACGAATCTGGTAGCTGCTGGAGATGATG 

TCTTGGAGGACCAGGAGATATTAATGCATCACCCACCCCAAGTGGATGAACTTGACCGGC 

TAAATGCCCCACTTTCTCAGATGGCTTCTAACGACTTTCAGGATTAGGGCCAGCTGTGGG 

TCTACTCCTTGTTGGAGCCCATCTCACCTGGGATGCCTGCAGCCAGCCCTCCCTCGTGAT 

TTGTCTCACCTTGAGTAGGAGACATGCTTCTCCCCTAACCTTTTCCTTTCTGCCATAATT 

AACATATGTCCTTTTCAGTAAGTCCATGCCTCTGGCAGGGGATGAAAGAAGTACTCACTG 

GGTAATTAGCTACCATCTTTGCAGCACCCTGGTAACTTGAAAAATTT 

Sequence 15 

TCGGGAGTCGACCCCGCGTCCGCCGAGCGGGGCGGCGCGGCTGGCGGGGCCGGCGGCCGG 

CTGAAGCGAGAGCGCGACGCGACGCGACCGCGGCTTCCCGAGCTGCGCCTGGCCGNCCAG 

CGCCGCGGNCCGCCCGAGGCCTGGAGGGGTCCGGGCCGCCGTCCATGGTCGCGGCGTCCT 

GAGGCGGGGGACGCGCCCGGCGCCCCCGGCCCTCCTNCGCCTCCTCCCGCGGGGCGGGCG 

GCCTCCTCCGGCGCCTNCCCGCGCCCGCCCGCCGNTCGCCGCCGCCTCCCTCCCTCCTTC 

CCTGCGGCTCCCCCGGCTTTCGGAGCCCGGGGGCGGCCTGTGGCGCGCGGAGCCCGCGCC 

GGACTGCGCCTlvrTTTGGACCTTGAGGGGAAACATGCGTTTGCCNTGGATCGTTTGAAATT 

CTGAGTTT GGGATC CCCGNCCGGCCGNCTGGC I I I H CGCCCGCGGG I I 1 1 1 ICC ) I I I I 

TCCTTTTGC I I I I 1 1 I I CCTTCTT 

Sequence 16 

NGCCCCCGCGTCCGTTTTMTTATTTTGTTNGAGCCTGCANAGTAANGTTNTTAAAAATA 
TMCGTTCATACGCATTTTAATTAACTTTGAAAGTTCA TATG CATCAGAAAATTTATGAA 
AATTTG AATG AA AAAA ATTTCATCTATTTATTTTTCTAATTTT AATGG CAA ATTTAC ACT 
ATTATGGCTGATMTTCTGTGMCTTACCTTCTTGTTGACTGAnCTTTTTCCCTTAATC 
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CCAGCTTTAAGGAGATAGGTGAAGTTATTGTACAAAGTTAAGTGATACCATAAAGTATAT 

ATTATAMGTCATACATGGCTTTTGGACAGTNTTATATTTCAGTTGCAGTGCTGCATTCC 

ATTAAATTTCATAAMTGCTAGGGAAMTGTGTTTGATAAAATTTTNTGCAGTGAGAAAT 

GACAGACTGAGTGCCTGACAATTTAAGCCACATATGAAAGTATGCAAGTAAAGANTTCAG 

GTCCTTAATGTCATCTATATCATGGTATAAAAG 

Sequence 17 

CCACGCGTCCGGACGAGACGAGCCCACTAGTGTCCCCCGAGCGGGCCCAACCCCCGGACT 

ACACCTTCCCGTCGGGCTCGGGCGCTCACTTTCCGCAGGTGCCCGGGGGCGCGGTCCGAG 

TGGCTGGCGGCGGCCGGCTCGGGCCCCTNTCCGCCGGGCTCGCCGGGCCACGACCGCTGA 

GCGGCAGCCACTGTTGGATCGGGCCCGGGGCGCGGNGGCCCAGGGCCAGACCCAAACCGT 

GGCGGCGCAGGCCCAGGCTCTGGCCGTTCANGCCGNGGCGGCAGTCCACGCCGATCAGGC 

CCACCGNGAGCGGAACGAG 

Sequence 18 

GGGAGTCGACCNCGCGTCCGGGCGGTGGGTGTCCGCTTCTCTCTGCTCTTCGACTGCACC 

GCACTCGCGCGTGACCCTGACTCCCCCTAGTCAGCTCAGCGGTGCTGCCATGGCGTGGCG 

GCGGCGCGAAGCCGGCGTCGGGGCTCGCGGCGTGTTGGCTCTGGCGTTGCTCGCCCTGGC 

CCTGTGCGTGCCCGGGGCCCGGGGCCGGGCTCTCGAGTGGTTCTCGGCCGTGGTAAACAT 

CGAGTACGTGGACCCGCAGACCAACCTGACGGTGTGGAGCGTCTCGGAGAGTGGCCGCTT 

CGGCGACAGCTCGCCCAAGGAGGGCGCGCATGGCTTGGTGGGGCGTCCCGTGGGCGCCCG : 

GCGGAGACCTCGAGGGCTGCGCGCCCGACACGCGCTTTTTTCGTGCCCGAGCCCCGCGGC 

CGAGGGGCCCGCGCCCTTGGGTCGCCCTGGTGGCTCGTGGGGGCTTGCACCTTTAAGGAC 

AAGGTGCTTGTGGCGGGCGCCGAGG 

Sequence 19 

NATGTNGNNCNAAAAAGGCCNGCNTTANAGGCCAGGAAACNCGTAAAAAGGGCNCGCGTT 

GCTGTGCGTCTTTTCCATAGGCTCGCGNCCCCCTGACCNAGTCATCATCAAAAAATCCGA 

CNGCTCAAGTCATGAGGTTGGCCGAAAACTCCGACAGGGACTTNTAANAGNATACCCANG 

GGCGNTTTCCCCCTGGGAAGGCTCCCTTCGTGGCGCNTCTCNNTGTTTCCAGACCCCTGC 

CCCGCTTTACNCGGNATTACCCTNGTCCCCGCCNTTTTCTTCCCTTTCGNGGAAAGCGGT 

NGGGCGCCTTCT CNTC AATTAGGCTTCACCGCCTGNTAANGGTATTCTCAAGTTNCGGNT 

GTANGGGTGCCGTTTTCGCTTCCAAAGNCTGGGGCCTTNTGTGCCACCGGAAACCCCCCC 

Sequence 20 

TTCGGGAGTCGACCNCGCGTCCGCCTGGAGCCGCCAGAGTTTCCGCACCCGGGAGGGAGA 

TGCGGCCGGGGCTCAGGCTCCTTGCAGTTGTMTTTAGATTCGAGAAGTGGTTTATCCTT 

TGACTGGAAAAGAAAAGTAGCTGCAGTATTCCCCCAGCACTTGCTGAGAGCATGCCGTAT 

GCCAGGCTGTGAGGCTCGAGAGACAAGCAGTGGAAGAGTTGCGGCCTGTTTCATCTCTGG . 

ATTGTAAATCTGAGCCTCCTTCTGGCCCCTGGAAGGGGACAGCATCACGATGGAATGATT 

CCTAACCAGCATAATGCTGGAGCCGGGAGCCACCAACCTGCAGTTTTCAGAATGGCCGTG . 

TTGGACACTGATTTGGATCACATTCTTCCATCTTCTGTTCTTCCTCCATTCTGGGCTAAG 

TTAGTAGTGGGATCGGTTGCCATTGTGTGTTTTGCACGCAGCTATGATGGAGACTTTGTC 

TTTGATGACTCAGAAGCTATTGTTAACAATAAGG 

Sequence 21 

CGACCACGCGTCCGGCAGCCGCGGGGCGGGCGGCGGCGGCGGCGGCGGCGGCCGGGACCC 

AGCGGGCCAGGTGGGGACGGCGCGGAGCGGGTGCGGGAGATGCCGTGCGGGACTGGGGCC 

ACCTGAGCCGCCCGCCTCGTCCCCGCCTTCTGTGGGAAGGATGTGCGCGCGGATGGCCGG 

TCGCACAACAGCGGCCCCTCGGGGGCCCTACGGCCCCTGGCTCTGCCTCCTGGTGGCCCT 

CGCCCTGGACGTCGTGAGAGTGGACTGTGGCCAGGCTCCCCTGGACCCTGTCTACCTGCC 

GGCAGCCCTGGAGCTCCTAGACGCCCCTGAACACTTCCGTGTGCAGCAGGTGGGCCACTA 

CCCACCTGCCAACTCCTCTCTGAGCTCCCGATCTGAGACCTTTCTGCTCCTACAGCCCTG 

GCCCAAGGCCCAGCCACTTCTTCGGGCCTTCTACCCAA 

Sequence 22 
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CGCGTCCGCCCGGTGCCTCCGCCCATGGAACGCGCGGAGTGGCGCCGCCGCGGCTACGCG 

CCGCTGCTCTATCTGCAGTCACACTGCGACGTGCCAGCGGACCGGGACCGCTACGTGCGC 

GAGCTCATGCGCCACATCCCGGTAGACTCCTACGGGAAATGCCTGCAGAATCGGGAGCTG 

CCTACCGCGCGGCTACAGGACACAGCCACGGCCACCACCGAGGATCCAGAGCTCTTGGCT 

TTCTTGTCCCGCTATAAGTTCCACTTGGCCCTGGAAAATGCCATCTGTAACGACTACATG 

ACAGAAAAACTGTGGCGTCCCATGCACCTGGGCCGCTGTGCCCGTGTACCGCGGTTCTCC 

CTCTGTGAGGGACTGGATGCCGAACAATCACTCCGTCATCCTGATTGATGATTTTGAGTC 

TCCTCAGAAGCTGGCAGAGTTTATTGACTTTCTGGACAAGAATGATGAGGAGTATATGAA 

ATACCTGGCATACAAGCAACCT 

Sequence 23 

CGCGTCCGGCTGGGCGAATNAGGGATTCCGGTTCACAATGGATGCTGATAAAGAGAAAGA 

TTTGCAGAMTTTCTTAAAMTGTGGATGMATCTCCMTTTAATTCAGGAGATGAATTC 

TGATGACCCAGTTGTGCMCAGAAAGCTGTCCTGGAGACAGAAAAGAGACTACTGCTTAT 

GGAGGAAGACCAGGAGGAGGATGAATGCAGGACCACCTTGAACAAGACTATGATCAGTCC 

TCCACAAACTGCTCTGAAGAGTGCAGAAGAAATAAACTCAGAGGCCTTCTTGGCATCTGT 

GGAGAAGGATGCAAAGGAACGAGCCAAGAGAAGAAGGGAAAACAAAGTCTTGGCGGATGC 

CCTAAAAGAAAAAGGGAATGAAGCATTTGCTGAAGGCAATTATGAAACAGCTATCCTGCG 

CTACAGTGAGGGGTTTGGAGAAGCTGAAGGACATGAMGTGCTGTACACCAACCGAGCCC 

AGGCTTATATGAAACTTGAGGA 

Sequence 24 

GGGAGTCGACCNCGCGTCCGCTCCCTCTGAGTTGCGCTGGGCTTGGCTGCTGCACCATGA 

CCCTGGAGGCGATCCGCTACTCGCGGGGCTCCCTGCAGATCCTAGACCAGCTGCTGCTGC 

CCAAGCAGAGCCGCTACGAGGCGGTGGGCTCGGTGCACCAGGCCTGGGAGGCCATCCGCG 

CCATGAAGGTGCGGGGCGCCCCGGCCATAGCCCTGGTGGGCTGTCTCAGCCTCGCCGTGG 

AGCTGCAGGCGGGCGCCGGGGGACCGGGACTCGCCGCGCTCGTGGCCTTCGTGCGCGACA 

AGCTGAGCTTCCTCGTCACCGCCCGGCCCACCGCTGTCAACATGGCCCGCGCCGCCCGCG 

ACCTGGCTTGATGTTGCAGCCCGGGAGGCCGAACGGGAGGGGCGCTACGGAAGAGGCCGG 

TCCGGGAGAGAGTGATCTGCTGCACCGAGGACATGCTGGAGAAAGACCTCAGAGACAACC 
GAAGCATTG 

Sequence 25 

GGAGTCGACCNCGCGTCCGGGATAACGAAGCTGCTACCATGATGATGGCTGATCTCATGT 

TCAGAAAACAAGACTATGAACAAGCAGTGTTTCATTTACAGCAGCTTTTAGAACGTAAGC 

CAGACTCCTCGAGTTCAGGGATCACACCATATTCCCAATATCAGACAAAATGCCACACAC 

ATGGATGTGGCAACATAGATGTTTATTGGTTGAATGGATCAGTGAATGACTGTAAAACAC 

CAAGTCAATTAAAAACACAGCAGGAGAATCGCTTGAACCTGGGAGGTGGAGGTTGCCGTG 

AGCCAAGATCACACCACTGCACTCCAGCCTCGGTGACAGAGTGAGACTTGGTCTCAAAAA 

CAACCACAAAATTTTAAATAATTATATGACATTATCTCGTTTGATTGATCTCCTAAGAAG 

ATGTGGAAAACTCGA 

Sequence 26 

ACCGTCCGGGGCCATCCAGGAGAGCCTCCTACCAGCACAGAAGGCCTGTGCCCCAGCGCC 

CTGAGCGAGACAAGCCGTTTTGATAATGACTTGCAGCTAGCCATGGAGCTCTCTGCCAAA 

GAGCTGGAGGAATGGGAGCTCCGGCTCCAGGAGGAAGAGGCTGAGCTCCAGCAAGTCTTA 

CAGCTGTCACTCACTGACAAATAGACCTTTCAGCCTGTGAGCCTCTGCACAAAGCAGAGG . 

CTGTGGGCTGTCACAGATGCTGTGTCAACCAGGGCCCTAGGGCTAAGGGCCTGCACCTTG 

CGTGCATGCAGCAGGCAACAACTGCCCCTTCTTTATGCAGAGGTGCAGAACCAGGGACTC 

CTGGGCCCATCCAGGCTGCTTCCTTGGGGTGG 

Sequence 27 

NCCNCGCGTCCGGCCGGCGATGCCGCGCCCCCGGGCCGGGCTGTAGCGGGGCCGCGGCTG 
GACGTGTGCGCCGGGCAGGCGGGACATGGAGGTGGTGGACGAGACGGAGGCGCTGCAGCG 
CTTCTTCGAAGGCCACGACATCAACGGTGCCCTGGAGCCCTCCAACATAGACACCAGCAT 
CCTGG AGG AGTACATCAG CA AGG AG G ATGCCTCCG ACCTCACACTGCCGG ACTCTCCCCC 
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AGACTCGGGCTCCGAGGCCTACTCCCCCCAGCAGGTGAATGAGCCCCACCTNCTGCGCAC 

GATAACCCCTGAGACACTGTGCCACGTGGGGAGTGCCCTTC 

Sequence 28 

CGCGTCCGCNAGGGAGGGCCGAGCGAGGCGCAGGCAACCGGGCAGCAGGCATGATGGCCT 

CGCCTAGTGACTCCAGCCGCTCGCTGACCAGCCGGCCCAGCACCAGGGGCCTTACCCACC 

TNCGCCTNCACCGACCCTGGCTGCAGGCCCTGCTTACGCTGGGGCTGGTCCAAGTGCTCC 

TGGGCATACTGGTGGTCACCTTCAGCATGGTGGCCTCTTCCGTCACCACCACCGAGAGCA 

TNAAGAGGTCCTGCCCGTCTTGGG 

Sequence 29 

ACGCGTCCGATTTTAGTNGCAAGGAGTCCATGTGTTCAACTCCAGCATTTCCTGTGTCTC 

CAGAGACACCGTATGTGAAAACAGCGCTGCGCCATCCTCCGTTCAGCCCACCTGAGCCCC 

CGCTGAGCAGCCCAGCCAGTCAGCACAAAGGAGGACGTGAACCACGAAGCTGCCCTGAGA 

CGCTCACTCACGCTGTGGGGATGTCAGAGAGCCCCATCGGACCCAAATCCACGATGCTCC 

GGGCTGATGCGTCCTCGACGCCCTCCTTTCAGCAGGCTTTTGCTTCTTCCTGCACCATTT 

CCAGCAACGGCCCTGGGCAGAGGAGAGAGAGCTCCTTCTTCTGCAGAACGCCAGTGGGTG 

GAGAGCAGNCCCAAGCCCATGGGTTTCCCTGCTGG 

Sequence 30 

CCGGTTNCTTGTNNGTTNATTGATTTAMATAGAATATCMTTGMTTTAGAAAATTCTC 

AAAAGCCAGTTTMTGCTGTTCATCTTTTAAGGCCAAMAMGTTTAATCCAGAGGCAGT 

CTTTCATTCTGCACTMTTTATAATTTAGATCAAAGMCTMTTATATATCTCAMTTTA 

ATAATAAAAAGGTATAGTAATGAGAATTAAATTTATGGTAAATTATATAACTCAGAATGT 

TAAAGTAACTTGGAAATTCTTAATCTAAGTTAAGTATC I I I I I ATTTCTTACTTGTCCTG 

TTTGATTTATTAAGGGAMAGAAMTTTTAAGGAGTTGC 

AAGTGGAATGTTTATTCACCCTTATTTATATACTTAAAAGACATTGTATTGGCCTGGTCT 
CGAACTCCTGACCTCAAAGGTGGATCCACCCACCTCGGCTTCCCAAAGTGCTGGGATTA 
Sequence 31 

GCCGGTGATTTTGMCMTTCTGAAATATTTCAGGTMGATTMTAACATCCMTTACAA 

ATATATGTTTCAATATTTTATACGTATGTCTACTTTGAAAGTTAAACCAATAGTATAGAA 

AGCCTAAGAATGAACACTGATTGGACATACTCACAGAAATTAAGGGAAAAACACATATTG 

TAAMTTCCTGTCMTGTTTGAGTAGAATACAGAAGTACATAGCAGTCTTCMTTTTTAA 

ACACAATTATGGGCTTATMCTGGACGTGACATGCATCATTTATTAGAACAATATTATTT 

ATTTATACTAAGTAAGGATATAAGATCACAGAAGCTTAGTGTTATAACGGAGACTTCACA 

GACATTCATAACTAATGTTTTCTAAGGCAAATAAGGGGCATAAACCAGAACTCATGGGTC 

AGTGCCAGAGGTMGTATAAAAAGGTTATGTATGAMGACATTTATTTATAGGAGAATTT 

CTGAGGGATTCTATGCCTTTTCAACTTA 

Sequence 32 

NCGTCCGGGAAACTGGTTCNGATGGTGTCTGCCCAGGAGCGCCTGACACGCACCTTCACA 

CGCAGCAGCCACACCTACACCCGCACGGAGCGCACGGAGATCAGCAAGACGCGGGGCGGG 

GAGACAAAGCGCGAGGTGCGGGTGGAGGAGTCCACCCAGGTCGGCGGGGACCCCTTCCCT 

GCTGTGTTTGGGGACTTCCTGGGCCGGGAGCGCCTGGGATCCTTCGGCAGCATCACCCGG 

CAGCAGGAGGGTGAGGCCAGCTCTCAGGACATGACTGCACAGGTGACCAGCCCATCGGGC 

AAGGTGGAAGCCGCAGAGATCGTNGAGGGCGAGGACAGCGCCTACAGCGTGCGCTTTGTG 

CCCCAGGAAATGGGGCCCCATACGGTCGCTGTCAAGTACCGTGGCCNGCACGTGCCCGGC 

AGCCCCTTTCAGTTTACTGTGGGGCCGCTNGGTNGAAAGGTGGTGCCCACAAGGTGCGGG 

CCCGGAGGCAC 

Sequence 33 

CCGCGTCCGCAGGAAATTGTTAAAAATAATTTGGGGGTGTTTATTGGGGAAGGAAACAGG 

GCCTTGACAGTGGAGGACTTGGMGACATGTAATTTAAGATATAGAGTATGATTGTTGGA 

AAATMGCATGGAGATCCAGMGGAATCTTAAGAGTTTTTCTATGCAAGTGAAGATGGAA 

GAAAATATGTATTTTACAMAGATAAATTACAAGTACCTTATTTGCTTTGCAAAATAACT 

TATCATGTTCTTCCACTATrrTTATTATATTTTAATTTTAATGAAA^ 
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ACACTAMTTTTAAATACATGGCTCAAGACAAAAAAATGGGGAAAATATTTTTTAAAAAA 
TCACCCCAAATCCTGGTACTCAGACATAACCACTATTCAAACTTGGCAGAGTAATATTTT 
TCCTGTCTGCATGTGTGCCACNATGTGTGCATGCATACCACAAAGTAGATGTTTTACTAT 
ATCTCCTGGGTTATTATCTGC I I 1 1 I ICCC 
Sequence 34 

ACGCGTCCGGGACAAGAGAAACATAAGCNGGAAAAAAGGAGGAGGAATAAACACACGCCT 

GTCCATAATAAAACTCGCTCTTGAAGACTCAGCGGCAGCCCTGCACCGGAGACTGACGAC 

TTGCGCGGCTGTGACCTCCGCCCTGCAGCGGACCCTCGACTGCCCTGCACTGCGGCTCTG 

GAGGCCCCGACTCAGTGCATGGGAAAGAAATCCTCACTATCAGAAAACAGAGGGGCAATC 

TGCTGCTCTCCCTTTCCGGCCAAACACGTCACCCATCAACCGGATACCTACCAAGAGGCT 

TTCAGAGGAGGCGCCCAAGGTCTCCCAGGCCCGCCCCTCCCCAATCACGCTCCGCTCAGC 

CCCCTCAAC7TTTGGCCTCCGGGAAGTTCGCAGCGTNTCTCACGCTTGGCAGGAAGTTCC 

CGCCAAGGCTTTCGGAAAATCCTTTAAAAAGCAACGCTTGCGCTGGGCGGGGCTTTGGTG 

Sequence 35 

CCCCGCGTCCGGTAGATTGCTTGTGGCTGGCAGTGAAGATGGTGGAGTTCAAC I I I UG A 

TATAAGTGGGAGGGCTCCCCTCAGGCAGTTTGAAGGCCATACAAAGTAAGAGACAGTTGG 

TTTCTGTGTGTTCTGGTTTTATTTTGTTGTAAGCTC I 1 1 I I I I CTCTGGACTTTGGTTAA 

AMGATAGAGATCAGTTTTATGGAGATTATTTGCCTATAGGTACTATATTTCCTGATTGT 

TCTAAGAGTGCTTAACTTGGGTTCCGTGGTGCAGTtTCATGGGGCTTATGAATTCCCTAG 

MTTGTATGTGATATTTTAGGAAATACACGTTTATCTAGGGAGCTACTCTGTAGCTTTTG 

GTTAACTTTAGTGGGGTCTGTGGCCCAGCTGAGATTATGAATTACTGACCTGAAGACAAC 

CTTACAGCTGGTAATGACAGCTCTATAGGCCTGTACTGTCTTAGAGGCTCTTATGTTGAA 

GTCAAGTANGAAGGTGGATTTTCTTCTTGAATTATAGTGTTTTGCCCCTTAATAA 

Sequence 36 

CNCGCGTCCGCGGACGCGTGGGGGCGAGGGCCGCTGGGGCCGCGAAGTGGGGCGGCCGGG 

TGQGGTACGAGCCGGGTCTGGGCTGAGGGGCGCGGCTTCGCGGTGGACCCCAGCCCGGCA 

ACGGGAAGGCGAGCTCTCCTCCACCGTCCAAAGTAAACTTTGCCGCTCCTTCCGCGGCGC 

TCCCGAGTCCTCGCCGCCGCCGGGCCGCCGCAGTCCGCGAAGAGCCGTCCTGCGTCAGGG 

CCTCCTTCCCTGCCCCGGCGCGGGGCCACTGCGCCATGGACGCCACAGCACTGGAGCGGG 

ACGCTGTGCAGTTCGCCCGTCTGGCGGTTCAGCGCGACCACGAAGGCCGCTACTCCGAGG 

CGGTGTTTTATTACAAGGMGCTGCACAAGCCTTAATTTATGCTGAGATGGCAGGATCAA 

GCCTAGAAAATATTCAAGAAAAA 

Sequence 37 

GCGTCCGCGGACGCGTGGGCCGCCGCCGCCGCCGCCGCCGCGATGTGACCTTCAGGGCCG 

CCAGGACGGGATGACCNGGAGCCTCCGCCCCGCGGNGCCCGNGGCTCGCCTCGGCCTCCC 

GGGCGCTCTGACCGCGCGTTACCCGGCCCGCCATGGCCCCCTTCTCTCTNGCCCGGGCNC 

•ACGCTCGACCCTGCGCNCGTATCGCCAGCAAGTCTGCTGCCTGCATGCTGCNTGGCTCGC 

GAGNNGTTGGCGCGCNTTGCNCTGNNGCCTGTTTGCACGAGCNGCCGCCGAGGCCCACTG 

CANGNTTANCTCGACGGTCTCCNAAGNCANACCGCNCCGCCGNCCCTTTCCAAGGNTCNT 

TTCNCGAGGGGANGGGCNCAAANGGCCAGGGGNCCCCACNCNTGGGGAAGAAGGGNGGAG 

TTNGNCNGATGNGNATGAGNAANCCTTGTAGNCAAGANCCCGTCCCNAAGNGGNAGNGNC 

AGC CCNG GGGAAGGGTTGGTTTTCNNANGNTAAACCGNAACCCCCCCGGAAGTNAACCGG 

NGATTTTMTTTATTTTTACCCCNCANANGNAAATAACCTTTNAANAACCATGGGNCATT 

NNAAANCAAAAGNTAAATTNGGGGGNAACTTAACCGNTAAACCANCCCAAAA 

Sequence 38 

CNCGCGTCCGGCGGGTCCCGCCGGCGGGTACCTGGGCACTGCGCCCCATCTGGACTGAAA 

TGGGGACACCCCTTCGGGGGTCCCAGGCTCCTGGCCGTATTGTTCTCC7TCTCCTCGTGA 

TAACTCCGCAGTGGAGGTGGATTCCGTCCAAGACGCCCAACGTGGCTCCGCGTAGCAATC 

AGCGCTGCAATCCTGGCGGTTACCTCAGCGGCGGCGTCTCTCTCTGCGCCTCACACTCGC 

AGCCCGCGGCCCTCCCCAACTTAGGGCGTTTACAAAAGAAACTACTCCAGACGCGCTGCA 
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AAGGGAGGCGCATGTGCCCGAAAGCTGGCGATCAGACGGGGGGGGGCATTCTGCATGTGT 

GATGTTTCTGGGGGCGGTGGGGAGTGTGTGTCGGGGTCGGGGGGCGGG 

Sequence 39 

NTGGTCTTTCTTCCTAAGAAGGAGATGGAGCCAATTTCTCACAGCTCGTGCCTCAGTACT 

GAGGGTATGGAGGAAAAGGCAGTCAGTCAGTGTCTAAAAATGACGCACGCAAGAGACGCT 

CGGGGAAGATGTAGCTGGACCTCTGAGATTTACTTTTCTCAAGGTGGACAAGCTGTAGCC 

ATCGGGCAATTTAAAGATCGAATTACAGGGTCCAACGATCCAGGTAATGCATCTATCACT 

ATCTCGCATATGCAGCCAGCAGACAGTGGAATTTACATCTGCGATGTTAACAACCCCCCA 

GACTTTCTCGGCCAAAACCAAGGCATCCTCAACGTCAGTGTGTTAGTGAAACCTTCTAAG 

CCCCTTTGTAGCGTTCAAGGAAGACCAGAAACTGGCCACACTATTTCCCTTTCCTGTCTC 

TCTGCGCTTGGAACACCTTCCCTGTGTACTACTGGCATAAACTTGAGGGAAGAGACATCG 

TGCCAGTGNAAGAAAACTTCAACCCAACCACCGGGAT 

Sequence 40 

CACCGNNCGCGCTCCTTCTGCCGCCAGGGCGAGGCTGGCACCCGGCCAGCGCGGGCAGGG 

CCACGGGTGCCCGGCTGTTTCCCGGTTGTGGAAGGCGCTCAAGGTGCGCGGCCCGGGGCG 

CGCTACTGGGGGCGCCCTCCGCGGTGGGCAAGCGCGCCAGGGATCGGCCTGGGCANGCCG 

CGGGGCGCGCGAANGCTGCGCTTTCCCTACGCCCCCCCTCGCTTCCTCCGGCACGGCGGC 

AACGGAGATTTCCTCTCGGGGAAACTACGCGGATCCTTTTCGGGGATCCTCGCCCCGCCC 

CAATTTCTNCGCCCCCTCCCCTTTGCTGGGGCGCCTGGGCTGGCCCGCGCAGGGGA 

Sequence 41 

CNCGTCCGGTTCCTAACACAGACGAACTCAGCTTCCTTTGCCATGCCTCTGACTCGAGCC 

AGCCTTTCTTTTTATCCTCCG I 1 1 1 I CTCAGATTCCCTCCACACAGTCTTTTCCCAGGTC 

TAGATCGCTTCCCTCGCCCCAATCTTTCCTTGAACCCCTTTTCCAGTGTCCCAAAGCTGT 

CCACTCTTACGCCTCTTCCAGAAACACAGGCTACCTCCCCCCAATTCCCCAGTGCCACTC 

TGGATTGTAATATCCCACTCAGGAGCTTCTTTCTTGTAATTTCCCTCCCCCCACCCCCCA 

GCGTCCCCGGGTCGTCTGTnTCTTCTATGAAGCAAATATTACTCATCAAATATAGGAAC 

AAAGGCCTAAGTCCTTTCTGNGCTTTATTTCTTNGGGTGACTGGATCTTAGATCCTATCA 

TTTAAGTAGATGATGGTT 

Sequence 42 

GGTGTCGACCNCGCGTCCGCGCTTCTCNCCTCGGCCCGTGGAGCCGGGGCGTCCGGGCGT 

AGCCCTCGCTCGCCTGGGTCAGGGGGTGCGCGTCGGGGGAGGCAGAAGCCATGGATCCCG 

GGCAGCAGCCGCCGCCTCAACCGGCCCGCCAGGGCCAAGGGCAGGCCGCCTTCGCAGCCC 

CCGCAGGGGCAGGGCCCGCCGTCCGGACCCGGGCAACCGGCACCCGCGGCGACCCAGGGG 

GCGCCGCAGGCACCCCCCGCCGGGCATCAGATCGTGCACGTCCGCGGGGACTCGGAGACC 

GACCTGGAGGCGCTCTTCAACCGCCGTCATGAACCCCAAGACGGCCAACGTGCCCCAGAC 

GGTGCCCATGAGGCTCCGGAAGCTGCCCGACTCCTTCTTCAAGCCGCCGGAGCCCAAATC 

CCACTCCCGACAGGCCAGTACTGATGCAGGCACTGNAGGAGCCCTGACTCACAAGCATGT 

TCGAGGCTCATTCCTCTNCAGCTTTCTTGCAGTTGGGAAGC 

Sequence 43 

GTCGACCACGCGTCCGGGAGCTGCGGCGCCCTCCCTTATCGCCTTGGCAACGACCCAGCC 

GCGCCGCGAGGAGAACCGGGAATGGAGGTCGTGGCGTGAGGGGCGCCGAGCGAGGGGAGG 

CGCGGGCCACGGGAGTTCCGGGAGTTCCGGCGTTGCCCGGCAGTCCGCAGTCCTCGGCGG 

GAAGGCTGTCCCGGCGCCTCAGGCAGCTCTGCGTGGGCCGGGGTGACTTCCTCGCGATCC 

CCTGCGCGAGGTGAAGGGCAGGGACCTTTGCCGCGCCTTCCACGCGCCGTGCCCCACCGG 

CGAAGTGGGCTCCATCTTCCTTCAAGACTTGTCGCTTCGCGGACAGGGCGCCCGTGGGTT 

CTTCCCGGCCCTTCCGTACCGTCCTCTTTCCAAGCGGGCTTCCGGAAAGCGGGTTCCTTG 

TCTTTCGGACGCATTTTCACCCCGCGCCGGGGAGGAGCTTNCCGGGNAAGGGTTCCACGG 

CGGCCGAGGGTTTTCC 

Sequence 44 

CGTCCGCGAGACTCCCGCGCCCACCACCCCCGGCGGAGCTGCTGCTGAGCCACTCAATCT 
GAGCCCTGGCTACTAATAAAGTTCGTTTAAAAATCATAATCATTCTTAAGAGAGCGAAAG 
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AGGGTGCGAACTAGCCGCTCGGCCCCGCANGGAGAGCTGGCGCGTCNGGAGGAGACAGCN 

GCGGCAGCGGTTGCGCCGCGACCAGGAGGAGCCGGTGGCGCCGGGCGGCGGGTCCGCGGC 

CGGTGGGGGACNGTGAGTAGCGGCTCGCGCTGCGGTGCAGCGGCGCTGCACTCACTCGCC 

CTCTCCAGGGGCTGGGGGTTCGTCCGGCTCCACTGGGGAAGACTCAGCTTCTCCCCGGGG 

TCCCGGGTTGTGCCTTACTCTCCGGAGTGGGCAGGGGTATCGAGGGCAGGGGCCTCCCGG 

CCCGGCTCCCCCATCCCCCGGTTCGGACCGACGAGCCGCCGGCTTCTTCCCTTCCCTGAG 

CACCGATCCCAAGTTCCA 

Sequence 45 

TCACCACGCGTCCGGGCGTCCGGGTACCCGAGGGCTCTCCCGCGTTGCTGGCACCGCTGG 

CGCCGCGGTCTCGTAGCGCATGGGCCTGCTCCGAGGCGGGCTCCCATGCGCTCGGGCCAT 

GGCGCGCCTGGGCGCTGTGCGCTCCCACTACTGCGCCCTGCTGCTGGCCGCGGCGCTGGC 

CGTCTGCGCCTTCTACTACCTCGGCTCAGGCCGGGAGACCTTCTCCAGCGCCACCAAGAG 

GCTGAAGGAGGCCCGCGCCGGGGCTCCCGCCGCGCCCTCGCCGCCCGCGCTGGAGCTAGC 

GCGGGGCTCCGTGGCGCCAGCCCCCGGCGCGAAGGCCAAGAGCTTGGAGGGCGGCGGTGC 

CGGGCCGGTGGACTACCACCTGCTGATGATGTTCACCAAGGCGGAGCACAATGCCGCGCT 

GCAGGCCAAGGCCCGCGTCGCGCTTGCGCTCA 

Sequence 46 

CCACGCGTCCGTGTCGGGCCGGCGCTCCCTTCTCTGCCNGGTGGCGAGTACACCTGCTCA 

CGTAGGCGTCATGAGGTCTCCGGTTCGAGACCTGGCCCGGAACGATGGCGAGGAGAGCAC 

GGACCGCACGCCTCTTCTACCGGGCGCCCCACGGGCCGAAGCCGCTCCAGTGTGCTGCTC 

TGCTCGTTACAACTTAGCAATTTTGGCC I 1 1 I I I GGTTTCTTCATTGTGTATGCATTACG 

TGTGAATCTGAGTGTTGCGTTAGTGGATATGGTAGATTCAAATACAACTTTAGAAGATAA 

TAGAACTTCCAAGGCGTGTCCAGAGCATTCTGCTCCCATAAAAGTTCATCATAATCAAAC 

GGGTAAGAAGTACCAATGGGATGCAGAAACTCAAGGATGGATTCTCGGTTCC I 1 1 1 1 1 1 A 

TGGCTACATCATCACACAGATTCCTGGAGGATATGTTGCCAGCAAAATAGGGGGGAAAAT 

GCTGCTAGGATTTTGGGA 

Sequence 47 

CGCGTCCGCGGACGCGTGGGCGGGGCCGCGGAGCCGGGCCGGGGCATGCGCCGTCTCCGN 

CTCGGGGCCGNCGGGGGCGCCCTGCTGAGCGCTACCCACGTGCGTCCGCGCCACCTCGCG 

GGCGACCCCGCGGCCAAGGCCCCCGGCGGAGCGGNTCCCGGGCGCCCCGAACTAGCCCCC 

AACTTTGGGCGAAGTTTGCCTGCGCCTCTCCCCGCCCCCACGCGGCGCGCCGGGGCCGCG 

GACGGNAGCGGCCCCCGGGGATGCGCCTTCCCGGGGTACCCCTGGCGCGCCCTGCGCTGC 

TGCTGNTGCTGCCGNTGCTCGCGCCGCTGATGGGAACGGGTGCGCCGGCCGAGCTGCGGG 

TCCGNGTGCGGCTGCCGGACGGCCAGGTGACCGANGAGAGCCTGCAGGCGGACAGCGACG 

CGGACAGCATCAAGNCTCGAGCTGCGCAAGCCNGACGGCACCCTCNTNTTCTTNACCGCC 

GACTTTAAGA 

Sequence 48 

GCGTCCGCTGCATTGCGCCCACCGACTCCACTATGTTGAAGAAATTCGACAAGAAGGATG 

AGGAGTCAGGTGGAGGCTCCAACCCA7TCCAGCACCTTGAGAAGAGTGCGGTACTCCAGG 

AGGCCCGTGTATTTAATGAAACTCCCATCAACCCTCGGAAATGTGCCCACATCCTCACCA 

AGATTCTTTATCTCATAAACCAGGGGGAGCACCTGGGGACCACGGAAGCGACCGAGGCCT 

TCTTTGCCATGACCAAGCTCTTTCAGTCCAATGATCCCACACTCCGTCGGATGTGCTACT 

TGACCATCAAGGAGATGTCTTGCATTGCAGAGGATGTCATCA7TGTCACCAGCAGCCTAA 

CAAAAGACATGACTGGGAAAGAAGACAACTACCGGGGCCCGG 

Sequence 49 

ACGCGTCCGCAGAANGCTCTCAGATGGGACAGTCTTTTACT7TCATTCTCACCTCTGTAA 

ATAGCAGGACAGGTTGGGGTGGGCCTGACTTCTATTCTGCTTTCAGGGGGTACTTACTGG 

AAAATCAACTTAGGAAGTGAATTTGAGGGTTGGTGTAATTTTAAGCCCAGCCTCTGATCC 

TTGGTTGCACAAAGCCTAATTTCCAAATATTTCTAACAGATTCAAGACTGTATTGGCAAA 

GAGGTAGAGAGCTATGATAATGACATAAATTACATAAAAATCCAGTTGAATGAATAAGAA 

GGAATTTGGGCGTATAACCCATGGAACACCAATGGTGCTAAGAATTTGCCAAACCCTCAG 
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CTTCGTATAAGTCCCAAAGAACTCAGGCTTATAGCACTAAGCAAATTACCAGTGGGAGGA 
GGAGGACCTGCCACGGAGCTGGATAATTACATTTAAATAI I I I I GCCANTCTGTTGGAGA 
C 

Sequence 50 

NCGCGTCCGGCAGGCGGGTCGTAGAGAGGGTTCANGCCGTCTGTATATCTCCCCAGATAC 

CTGAAACTGACCACCTGAGTACGTTTTCCCATTGCTGAGCTGTTTCCCTGATATCTGGCC 

ATGCAACGGAGATCAAGAGGGATAAATACTGGACTTATTCTACTCCTTTCTCAAATCTTC 

CATGTTGGGATCAACAATATTCCACCTGTCACCCTAGCAACTTTGGCCCTNAACATCTGG 

TTCTTNTTGAACCCTCANAAGCCACTGTATAGCTCCTGCCTTAGTGTGGAGAAGTGTTAC 

CAGCAAAAAGACTGGCAGCGTTTACTGCTCTCTCCCCTTCACCATGCTGATGATTGGCAT 

TTGTATTTCAATATGGCATCCATGCTCTGG AAAGG AATAAATCTAGAAAG AAGACTGGGG 

AAGTAGAT 

Sequence 51 

GCGTCCGGAAGGTCCTTGAGCCATCTGGATGGCGGGCAGTCTGGCACACTAATGTGTTCA 

AGGTGCTGGTTGAGATCACAGATGTGGACTTTGCAGCCTTGAAGGCAGTGGTGAGGCTTG 

CTGAACCATACCTCTGTGACTCTCAAGTGAGCACTTTTACCATGGAGTGCATGAAGGAGC 

TCCTTGATCTGAAGGAGCATCGGTTGCCCCTGCAGGAGCTGTGGGTGGTGTTTGATGATT 

CAGGA 

Sequence 52 

GTCGAGCNCGCGTCCGGAAAAAAAAAATGACCCAGAGATATTAACAACTTGACCTGGTTA 

TACAGTTAGATATTTGCACAGTCTGGACTCAAACTGGAGGCTTCTGACTCCTCATCTAGG 

CTCCTCTCACTCTGCCATTGCATGGGTTTTCTCATATACCTTCTCTCATAAGGTTTTCAC 

AAATTTGTCACCGTCAAATAATTATCAAAATTATTCACACTATTATAGATGAAAATAATG 

TGCTTATAAAGATTAAGTAACTTTCCTGAGGGCGCAGGTATCTGGTTCAGATAACAACTA 

GCCTGGCTTAGAATAAACACATATTTCCTGGTTCTGAAGTTGGTGTCCTTCCTACCACTT 

TCTGCTGTCTCCTAAAGATAAAGAATGTTATTGGCTCACTGAATTAATCCATTCTGTTCC 

TGGCTGAAATAAAAATTGGTATATTCCTTACGTGAAGTGTCAACAGGAAGGGGGCTTTTA 

CAACTTCCTTT 

Sequence 53 

GGAGTCNCCACGCGTCCGCGCGCTGGAGGAGTGGAGCAGCACCCGGCCGGCCCTGGGGGC 

TGACAGTCGGCAAAGTTTGGCCCGAAGAGGAAGTGGTCTCAAACCCCGGCAGGTGGCGAC 

CAGGCCAGACCAGGGGCGCTCGCTGCCTGCGGGCGGGCTGTAGGCGAGGGCGCGCCCCAG 

TGCCGAGACCCGGGGCTTCAGGAGCCGGCCCCGGGAGAGAAGAGTGCGGCGGCGGACGGA 

GAAAACAACTCCAAAGTTGGCGAAAGGCACCGCCCCTACTCCCGGGCTGCCGCCGCCTCC 

CCGCCCCCAGCCCTGGCATCCAGAGTACGGGTCGAGCCCGGGCCATGGAGCCCCCCTGGG 

GAGGCGGCACCAGGGAGCCTGGGCGCCCGGGGCTCCGCCGCGACCCCATCGGGTAGACCA 

CAGAAGCTCCGGGACCCTTCCGGCACCTCTGGACAGCCCAGGATGCTGTTGGCCACCCTC 

CTNCTCTTCTCTTGGAGGGCGCTCTGGCCCATCAGACCGGATTAI 1 1 1 1 I CAA 

Sequence 54 

CNCCCGTCCGGAATNCCCATAGTTAGCTGCTGTGCTTTCACAACTTCTTTCTCTGTAAAT 

TCCTCGCTTGGCNCTGAGAAGGAAAAAAGATGTTCGTAAAGGGCTCAGCGAGGAATTTAC 

AGAGAAAGAAGTTGTGAAACCACCATAGTTAGTTGCTGTGCTTTGMTTTCTTTTGCTCA 

AATGGCCTCAGCGAAATCTTATTTGCCTATAGCATATCTACAAAAAATTTTCCTAGACCG 

TCTTTTCTACAACTGGATGGTAAAGTTGATTGAAGTGTGCCTCATGTAGCTTTATGTTTG 

GGGCATTTGAAGGGCTATGGCTGGACCAGAGTGTAATATAAATGCTTAATAGAGAGGGGA 

AAAGAAGAGTGTAAGAACCATTATAGGGCTGGGCTCACGCCTGTAATCCCAGCATTTTGG 

GAGGCTGAGGCAGGCGGATCACGAGGTCAGGAGTTCNAGACCAGCCTGACCAACATGGTG 

AAACCCCATCTCTCTAAAAATACAAAAAT 

Sequence 55 

GTCGACCNCGCGTCCGCAGCCTTCGGCTGCGGAGGGGGCTCGGCGGCGGCCGGCGGAGAA 
AGTTGCTCCGAGAAGAGGCTGGGTCGAGCTGGGCCGAGCCGGGCGCGCAGGGCGGGCGTC 
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GCGGGCGTCCCGGGCGGACGCGGCGCGGAGACTGCCGGCGCGTCCCGGGGGTTCCGATTT 

GAAGACCTTGCTTCTCATCACCCACTGGATTATGCCCCAGGCTTTCCTACCCAATGATCC 

TCTTGCAACACGCCGTGCTTCCTCCACCTAAGCAGCCCTCACCCTCGCCTCCTATGTCAG 

TGGCCACCAGGTCTACAGGAACCTTGCAGCTTCCACCACAGAAGCCTTTTGGGCAGGAGG 

CTTCCTTGCCTCTTGCAGGGGAAGAAGAGTTATCGAAGGGAGGGGAGCAAGGACTGTGCC 

CTGGAGGAGCTATGTAAGCCCCTGTACTGCAAACTCTGCAATGTCACCTTGAACTCTGCA 

CAGCAAGCCCAGGCTCATTATCANGGTAAAAATCATGGTAAGAAA 

Sequence 56 

ACCNCGCGTCCGGACCTGTTGGCGACATGGTGGCACCCGTGCTGGAGACTTCTCACGTGT 

TTTGCTGCCCAAACCGGGTGCGGGGAGTCCTGAACTGGAGCTCTGGGCCCAGAGGACTTC 

TGGCCTTTGGCACGTCCTGCTCCGTGGTGCTCTATGACCCCCTGAAAAGGGTTGTTGTTA 

CCAACTTGAATGGTCACACCGCCCGAGTCAATTGCATACAGTGGATTTGTAAACAGGATG 

GCTCCCCTTCTACTGAATTAGTTTCTGGAGGATCTGATAATCAAGTGATTCACTGGGAAA 

TAGAGGATAATCAGCTTTTAAAAGCAGTGCATCTTCAAGGCCATGAAGGACCTGTTTATG 

GGGTGCATGCTGTTTACCAGAGGAGGACATCAAGATCCTGCATTATGTACACTGATCGTT 

TCTGCAGCTGCAGATTCTGCTGTTCGACTCTGGTCTAAAAAGGGTCCAGAAAGTAATGTG 

CCTTCAAACTTTAAACTTTGGAA 

Sequence 57 

GTCGACCACGCGTCCGTCTCATTTGTGAAGAGGTCTGTCTTCCTGAGGAAGCAGGGGACC 

CTCACCTGTGAACCAAGTGTGCCATGGGAGCTGCTCCATGTCCAGGTCCAGGTCTCCTGG 

TCTGCAGGGAACGGCACAAGAGGGCTGGCCTAGGCCAGGAGGATGTGATCTGTCCTAGAA 

GGGGGCTGACCTGCTTGCTGACCCCGCTTGCTGCTGCCTGGCTGACCTGACTCAGCCACG 

GCTGTTCCGAGGGCCCTTCTGAGTACGAACTTCCAGTTGGAGGATCTGGGTGAAGACCCA 

GCTGCTTGAGATAGCAGCCTCTGGCTAGGCCCTTGGCGTGGCCAAGCCAATCAGGCAGGT 

TTAGAGCCTGGTGCCCCTAGACAGGTCCTGCAACCAAGAACAGGGGGTAGCCTTCAAAGG 

CCAGCCCTGCCTTCCAACCACCGCTCCACAGCGAGGGAAACCAAGGCTCTTAGGGCAGGA 

GGGTTGT 

Sequence 58 

CCNCGCGTCCGGGGAGCAGGGATCAACGGTGGTCCCCGTAAACCTGACAGTAAACCTGAC 

AGAGGCTGCAGGAGTGCATTTCCACCCAGGGTGCACTCAGCGAGTGGAACTCCACACCCG 

TTTCTTTGGAGTCAAGGCGCGACCTCTCAGGGAGGAGACTGCTCCTGGTTGCCCACTGCC 

GGGTCATCCCAGCTTGCAGTGGAACCCTCCGCAGCCTGGCCTCTTCCAGGGTAGCCGCTC 

ACTCCCCTCTCTCTTGTCCTAGGATAAGGCCGAGGAAGGCTGACGAGTTCCAGCTCTGGG 

GATGCCCTATCAGCTGTGTCACCTTGAACAAATCATTTCTCCTCTTGGGTCTCTGTTTCC 

TCCAGTGTGAAACGTGGTGAAGGCATGAGGGGCTATGGGAGCCCCAAGGCCTCTTTCAGA 

GATCTCCTCTGGGTCCCATGTGACCCCGTGGCTATCCCCAAAGGCAAGAGGGTCCCCAGC 

CCTGCACCAAGGCCCTGGG 

Sequence 59 

CCGAGGAAAGGAGTTGGTTCGCGCAGGTGCGGCGCCTGGGTCCCCATGGCGCTGTGGCGC 

GGCTCCGCGTACGCGGGCTTCCTGGCGCTGGCCGTGGGCTGCGTCTTCCTGCTGGAGCCA 

GAGCTGCCAGGCTCGGCGCTGCGCTCTCTCTGGAGCTCGCTGTGTCTGGGGCCCGCGCCT 

GCGCCCCCGGGACCCGTCTCCCCCGAGGGCCGGTTGGCGGCAGCCTGGGACGCGCTTATQ 

GTGCGGCCAGTCCGGCGCTGGCGCCGCGTGGCAGTGGGGTGAGTGCCAACGGGGCCTGGG 

TCTNTGAGCCTCCGAGGTCGGCCTTGGAGGTCGGGCGGAGCCGCGCAGAAACAGGGCTTC 

TCAGAGGNCCCCGGGAGGCGCTTGCTGTCCGCGCTGGCCCC 

Sequence 60 

CGCGTCCGGTGGGAAGCCAGAAGATAAAACCAAATGGCTGGGCACGTCTTTAGGTTATTC 

CTAGCTAAGAGTTAAGAGTTGTAAGCTCTCTCATTCTTTGTTCTTCAGCCTTAAACTATC 

TTTCCTTCTATTMCTTTATTTGTCTCAGTTACAATGATAGAGGTAACTTCACATACTAA 

AAGAAATTAGGTTACCATGTGAAACATTC7TCTTGGCTTGTGCTAATGTTATCAGATCCA 

MCAGCATCTGMAGAAAATTTTCCAAGTACGATGTTGTTCTCTTGTTTTCTGAAATACA 
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TATCATATGTTAAAGTGAGAGTTTTTATACATGTTGAMGAAGTTGAATGACATAACAAA 

TAGTTACTGAGGCCTCCATTTTCTTACTTCACAGTTAAMTTCCTGTTTCTCTTTGGGTA 

TAGGAGGGTAGAAAGAAGTGGGAGAGTAATAGCATTTTAAAACACAGAATCAAAAATCAT 

ATTAAAAGTAG 

Sequence 61 

CGTCCGGAGCATCGCGCACTGCGGCCGGGTCACCGACGTGGGCATCCGCTACGTGGCCAA 

GTACTGCAGCAATGCTGCGCTACCTCAACGCGAGGGGCTGCGAGGGCATCACGGACCACG 

GTGTGGAGTACCTCGCCAAGAACTGCACCAAACTCAAATCCCTGGATATCGGCAAATGCC 

CTTTGGTATCCGACACGGGCCTGGAGTGCCTGGCCCTGAACTGCTTCAACCTCAAGCGGC 

TCAGCCTCAAGTCCTGCGAGAGCATCACCGGCCAGGGCTTGCAGATCGTGGCCGCCAACT 

GCTTTGACCTCCAGACGCTGAATGTCCAGGACTGCGAGGTCTCCGTGGAGGCCCTGCGCT 

TTGTCAMCGCCACTGCAAGCGCTGCGTCATCGAGCACACCAACCCGGCTTTCTTCTGAA 

GGGACAGAGTTCATCCGGCGTTGTATTCACACAAACCTGMCAAAGCAAATTTTTTTAAA 

AGCAGCGTATGTAMGCACCGACACCCACTCAAAACAAGCTCTTTCTTTCNGGAAGGGTA 

TTAAGGAAT 

Sequence 62 

NCCACGCGTCCGCCAGNCTGTGAAGGATCCCAGACTGGCATATGCAGGAGGAAATGGGGC 

GGGCGAGGAGTAAGGACCCCAAAAAGCAGGGGTAGGGAAGGGCCCTCCCAGCGCCCCACT 

GTAATAGGGGCCTCATCAATGCCCCATGCTCACTGAATAAAGCACTGCCAGCGAAAGGTG 

AAAAGAGGAACAAAGAACATTCTCCTGGACGCCACCCACAGAAAGCCACGTGCAGGCTTG 

GCCCTCACC7TGGGGACCTTGGACACGGAGCTGGTTATGTCACATCTGGCTCTCAGAGCT 

GGGGCAGCGTCTAGGAGGCCTGATGTAGAAAGCACTCAGCTAAGCCCTAGTTACCGGCAC 

ACGGGCACCAGCGCCCCCTCTCAGCAAACTTNCACGTCTTATGAAATTAGCACTGGATTT 

CCACTTCAATTGGA 

Sequence 63 

CCCACTGTAATAGGGGCCTCATCAATGCCCCATGCTCACTGAATAAAGCACTGCCAGCGA 

AAGGTGAAAAGAGGAACAAAGAACATTNTCCTGGACGCCACCCACAGAAAGCCACNTGCA 

GGCTTGGCCCTCACCTTGGGGACCTTGGACACGGAGCTGGTTATGTCACATCTGGCTCTC 

AGAGCTGGGGCAGCTGTCTAGGAGGCCTGATGTAGAAAGCACTCAGCTAAGCCCTATTTA 

CCGGCACACGGGCACCAGCGCCCCCTNTCAGCAAACTCCACGTNTTATGAAATTAGCACT 

GGATTTCCACTTCAATTGGA 

Sequence 64 

NCGCGTCCGCTTCATCTTAGGATAAAGTCTAAATCTTTG l UN I GCTATTGTACTAAAC 

TCATAAATCCTAGGTTATAAAGATAAAGCCTTAAACTTTATCTCATCATCCAGCCCAATT 

TCCAGCCACAATGAAGTACTTAAAACTCTGTGTCTTTGTACTTGCTGTTCTCTTGGCCTC 

CAATTCCTTTTCATCTTTTCCATTCGGTAAAGTTTGTTTATCCCACAGGCCCTATCTTGG 

AAGCCTCCAGCAACTTCTCCAGACAGAGGTGTTAGCAGTGTAGGATCAGATTTCTCAACC 

ACGTCACTCCCATGTCTGGGTAGATATCTCTGCCCAAGTGTTCTCATAGCACTTGAGCAG 

TACTCTCTAAGCGCCCAGGATCTACCATGTTGCI I I I I I AAAATTTGATTAATTTATTTT 

7TTATACTGCTCCTTGTGGAGCANGGAGTGTTCCCAGAGTAGCCCACCATGTTATATTGA 

ATGGATCTGTGTGCATAATGCAGCTGTCCATCTACATCGTAT AI I I 1 1 GTCTCCTCAAGG 

GTAGGGA 

Sequence 65 

GTTTGTATTCTGGATACAGGGGATACTGGGGCTCGCTATGTGTGTGGAGCCATCCCTTCC 

TTGCCCCAGCCCCACCTCCCTCTCAAACCCTCTCTGGCTCTTTCTGAGCTTCCTTTCCTG 

CTCCCCAGCTTGCCCAGTGCTCAGTGCCCCACTTGGCTCTTTTGCTACTTCGGGTCAGGT 

GGAGCCTCTTGGGAATGTGAAGTGCCTTACAGAAAGATTGCACTTCAAGAGGAGAGGCTG 

CAGGGAGCCATCCTAAACCCAGAGGCCTGGAGCTTACCGTGTCACTTTACTTTTGTACAC 

AGGGGTCTCCTTAGTGCCCTCGAGAAGGATTCTTGGCCCTGAGCTTCTACTCCTGAGGCC 

ACCTCTGTGCAGCCCCAGCTCCCTCAACTCTAGGCTGTAGTCTCAGTGGGAAAGCCTGGC 

TTGGGGGTCTCCTAGGAATGTCCACCTGAAGGCACACTTGATAGGGGCTTGCACAACTTA 
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NCT 

Sequence 66 

CGTTCGGCAGCAGCAGCTGGCGGCCAACCGGCACAGCCTGGTGGAGAAGCTGGGGGAGCT 

GGTGGCGGGTGCCCACGCACTGGGTGAGGGCCAGTTCCCCTTCCCCACTGCTCTGTCGGC 

CACCCCCACGGAGGAGACGCCCACCCCACCCCCAGCCGMCACCAGCGACCCCCCGGCCGA 

AGACATGCTGGTGGCCATCCGGCGTGGGGTCCGGCTCCGCAGGACCGTCACCAACGACAG 

GTCGGCGCCCCGCATCTTATGATGGCGCCACCCTCCCCATCCTCTCAGGCCCCAGTGCGA 

GCAGGTGGCCTGGTCTGTGAGCCGCAGGCACTCAGAGCAAAGGCCCAGCCAGGAGAGAGG 

ACAGAGCCAGGGCAGAGGCCATGCCACTTTATGGAAAGACACCTCACTTGGATTCCAGCA 

TTTAAACAGGAAGTGACTTCTTAGCAAGCCTGGCCAGGACGGAGCCTGCAGGCCTGGGCC 

TGGTTCGGGGTCTGTTTTATGCTCTTCGGTCCCTTCTCTTCCTTCTCTGGGGCCGTGCCT 

CTTCCTACCCATAAAGCACCAAAACCAGGGCCGCTGCCATGACAGAGGGGCCAGGCTGGC 

CTTCCTTTCACATCCCGGCCTNTTCCAAGGCTGGTCCTGCCTNACTTCTTCTGGAATGTG 

GGCCCCCTCTCCCTTGCTGACCCCTCTTCCTNTTGCTT^r^^^TTGCTTTTCCANGCCCT 

Sequence 67 

GGCGTCCGG CAACCTAAAAATAGGATGCACCAATAGCATGTGGTTCCAAGTAAGTTGTGA 

TTTTTATTTTGAGACAGTGTTCCACTGGAAGGGAGGGAAGGGCTTACATTCACAGACAGT 

AAAGCAGGGCTGTGTAAGGAGCTACATTTACTTAACAGTCTCCTTTCCAGCTAGGTTTGT 

TTTATTTATTTGCAAGTCAGCTAAGAATTAACCTTTTAAACCATCTAAAACAGGCAAGCA 

ATATAAAGATTTCTACTAGTGCAAGGTAAGTGGTTTGAATATACAAGTGCCCTTTCCTGC 

CACCCAGTCTCACTACCGTTTTAGTCCTGCAGCTGGGTAAAGCCACTATTGTGTGGAAAC 

TCTCCCATGTGCCCTGTCTCTACCTCTGGACACACCAGCTCCTTCTTCCACCTATTCTAT 

TCCTCAGTTAAGCCAAGTGATCAGAAGTAGTATTAAATGGGTAGATAATTTTA 

Sequence 68 

GCCACGCGTCCGGCGCCGGGTGCGCCAACTACGCAAAGACCAAGCGGGCTCCGCGCGGAC 

CGGCCGCGGGGCTAGGGACCCGGCTTTGGCCTTCAGGCTCCCTAGCAGCGGGGAAAAGGA 

ATTGCTGCCCGGAGTTTCTGCGGAGGTGGAGGGAGATCAGGAAACGGCTTCTTCCTCACT 

TCGCCGCCTGGTGAGTGTCGGGGAGATTGGCAAACGCCTAGGAAAGGACTGGGGAAAATA 

GCCCTGGGAAAGTGGAGAAGGTGATCAGGAGGCCGGTCCACTACGGCAGTTTATCTGTCT 

GATCAGAGCCAGACGCGACGCGTCCACTTCGCAGTTCTTTCCAGGTGTGGGGACCGCAGG 

ACAGACGGCCGATCCCGCCGCCTCCGTACCAGCACTCCCAGGGAGAGTCAGCCTCGCTCC 

CCAACGTCGAGGGCGCTCTGGCCACGAAAAGTTCCTGTCACTGTGATTCTCAATTCCTGC 

NTGGG I I I I I I I 

Sequence 69 

ACCCANACCTGGGAGGAATTAATGGAATGCTTGNCCCTGGGCAGCCTTAGAAACAGACCC 

NAGCTTATCTAANGCTGCTCCGAGGCAGTGACCCAACTANGGCTCAGGAAGTCAAGAANA 

TTGACCAAGCTTATAGTGATCACCTCTTGACCTTTGTGTCACAGTCNTTTTGCTTTTTAA 

AACCCTTTTGTGAACCGNTTATGGCCTTTGATTCTGACAGGCATCNTAGTTGTGAAGGGG 

AACANGGGCAGGATATAATGTTTCGTTTACCAAATACAANAAAATCNGANGTACCCAGNT 

AGATCACAANATTTTTTGGGAGAAGGNCTNTTGGGTCTCTTCCAGGAGNTCACTTCANNN 

TTGGNAACTTGACAGGGGCTTGGGGAATTTANTATTCCCCTTGGGCGCAGGGNCNCAAAN 

GGGTGGCANTTTCCCTCCTTGGAGN I I 1 1 I I I I I CAAGAANTCCTTGCNTNGGGGAAAGA 

TGGTTACNANNATCCCGGAATTTCCAACCCCCTTCCCT AI I 1 1 1 1 1 GGTTTAAGG 

Sequence 70 

CGCGTCCGGCACATTAAAAAAAAATACTTATTTTTTATTATGGAAAGGTCTTGGAACATT 

CTGATAGTGAGCTTCCGGCATTCATTTGCTGTATCTGGCTTAGGAGATGCTAGGGTGGCA 

AGAAGAGGCACAGGCTTAGATGCGCTGGGTGGAGAGTTGGCTTTAGTAATGATGGTTGAC 

TCTMCGACTTGATTATCAGCTGTGCCTTTTTTCTTCCTGCCTTCTGAGGTGTGTTGCCT . 

GCATCCTAATTCACGTACTGAGTAGCAAGCTAAGCAGGTTGTAGCTGGAGATTGTAAGAA 

ATCCTGAATGGAAACCAAAGAAAGACTGTCACATCACATGATGTGCCCTTTTCAATCCCA 
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TGTCTCTTCCCAGTGGCATCCCAGTGCTGTCTCTGCCCCCTGCTGCTTCTGTAAAGATTT 
TCTGACACMGTMCTGCCTCATAGACCTTCCTTTTTATGAAATCCTGAGTTTTGGTTTG 
GGTACGTCCTTTTTAGAT 
Sequence 71 

GCGCGGCTGTCGCGAGGGCGGGGGTCGGGGCTGCAGGCGGGGCAGGGCTGGGTGGGGGCG 
CGCGACGCACCTGCCTGCTTCCTGCACGGGTGGNCCCCAAGGACTGCGGGGCCCCAGCCC 
AAAGCGGACCTTGA 
Sequence 72 

CGCCCCGCGTCCGGGCGGCTGGTGGGCGACCGGGCGCATCCTCATTGCATGTGCGGCGGC 
:• CCTACCTCGGCCCTGGCCTGACCCCGGCGGCCCTGCCCGCCCCTCCCTCCAGCATCATGG 
CCAGCCCAAGAACCAGGAAGGTTCTTAAAGAAGTCAGGGTGCAGGATGAGAACAACGTTT 
GTTTTGAGTGTGGCGCGTTCAATCCTCAGTGGGTCAGTGTGACCTACGGCATCTGGATCT 
GCCTGGAGTGCTCGGGGAGACACCGCGGGCTTGGGGTTCACCTCAGCTTTGTGCGCTCTG 
-TTACTATGGACAAGTGGAAGGACATTGAGCTTGAGAAGATGAAAGCTGGTGGGAATGCTA 
AGTTCCGAGAGTTCCTGGAGTCTCAGGAGGATTACCGATCCTTGCTGGTCCTTTGCAGGG 
AGAAGTA 
Sequence 73 

GCCCCGCNTCCGGAAATGTCCG A I 1 1 I I I I I I AATTTAATGAAATTGTTAATGAGGAAAA 

ATTTTTAATATAGGTCTTATCTACCACACATCCCCATAGATTTAAGGATTTTAATAGAAA 

GTCATGATGTATGTATTTAAGCCACGTTAAAAGAAAAAATATAACTATGGACCGGTATTC 

AGTGAATACAGTTTCATGGTTTTTAATTCTTTCAAAGCACATTAAAAATGGTGTGCTGAT 

AAACCCCAAGTMATTAACCCTTTTTCCGTATMATCCATTrrTTGTTTTGAAGAGGGGA 

MTTATATTTATTGNTGTTTACTGAATCCTGGTGTGAAAGCATATCAGATATGTATGAAC 

TGCTACTGCTGTACTTCCGATTTACGGACATCATTTTATTGCTATTTGTAGACCGTGATA' 

ACATGAACATGAGTCCTATTTATGTGGGCCTTCAGTGGATGGGCAGTGCCACTCANGTCT 

CTGGGGGTTTCCTCTCTTAATTTTAAAGTAA 

Sequence 74 

AGTCGCCNCGCGTCCGTGTGTTTCTCCTCGGTCCCCAACTCTACCTTCCCCAACCACAGT 
TCCTGTCCCCAGATGTCCTGATGCCCACCATGGCAGGGGAGCCCAATAGACTCCCAGGAA 
CTTCAAGGAGTGTCCAGCAGTTTCTGGCTATGTGTGACAGGGGTGAAACTTCCCAAGGGG 
CCAAGTACACAGGAAGGACTTTGAACTACCAGAGCCTCCCCCATCGCTCCAGAACAGACA 
"ACTCCTGGGCACCCTGGTCAGAGACCAACCAGCATATTGGGACCAGATTCCTGACTACTC 
CAGGGTGCAATCCTCAACTAACCTACACTGCCACACTACCAGAAAGAAGCAAGGGCCTTC 
AGGTTCCTCACACTCAGTCCTGGAGTGGATC I 1 1 I I CATTCACCCTNCCACCCTNCCATT 
GNTCATCCTGTGTACCCACCATCTAAGCAGTCTTCATGTACCCCTGAGGTCAAGCTTGGA 
A 

Sequence 75 

CCCGCGTCCGGGCTGGCATGGCTCTATATAAGATTGTTGCANAAANTCCCTACTACTTTT 
GGTCTGTGATGAGCTTAATTATGCAATCTATATNGGCACAGGATGAAAACCTCTCAAAAA 
CAATGTTTCTGCCCCTTGCTGAGAGAATGGTCGAAAAAATGGTGAAAGAGGACAAGATAG 
• MGCTGAGGCTGMGTTGAACTTTATTATATGATCCTGGAACGTTTGGGAAAGTACCAGG 
AGGCCTTGGATGTCATCAGAGGGAAATTAGGAGAGAAGTTGACAAGTGAGATTCACAGTC 
GGGAAAATAAATGCATGGCTATNTACAANAAGCTGAGCAGGTGGCCAGAGTGCAATGCCC 
TTTNCCGGCGCCTCTTACT 
Sequence 76 

GNTGGAGGGAGCTTTGCTACTCTGCTCTTGGCATGACTCCAGGA I 1 1 1 1 1 1 CTGGAATCC 
AACCTCTGTCCTCTTAGGAGAAGGAACCTGTCCTTGGTTCAGATGGGTGGGCATGAGGAG 
GAAAATTTCCATTAGTGTAGAAAAGTGCTGGACAGAATCCGGTTTGGAAAATTACAAATC 
' CAGTTGGTCAAAATAGGCCATTTCCTATGTGTGACCTATTCGTGGTATGCCAACTGGACT 
GCTTCCTAAACAGGACGAGGAAAGTGAGGAATATTTTTATATGAAAGCCTTAGCCTGTCT 
GGCACCCATGAMAAMCTATTTATGCACTCCTACmCACCCGTCTTTTTGCATTCTCT 
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ATTTGTAGCACAACAGAGTTGAATGCCACAAAACACCCCGTTTATAGTGAGCTGTTTTCA 
GTGACCAATATCAGAAGGAGGCTTGCTTCTGGACTAGCCTACTAATTGCCAGCAGCCACC 
ATTTTTCATG 
Sequence 77 

GGAGAGTGCCTTGCCGGACCCTCAGGACGGAGCTGCTGGGCTGCTACAGTGACCAGGACT 

TTCTGGCCAAGCTGCACTGTGTGCGGCAGGCCTTCGAGGGGCTTCTGGAAGACAAGAGTA 

ACCAGCTTTTCTTCGGGAAAGTGGGCCGACAGATGGTGACAGGCCTGATGACCAAGGCTG 

AGAAGAGCCCCAAAGGCTTCCTGGAGAGCTACGAGGAGATGCTGAGCTATGCCCTGCGGC 

CCGAGACCTGGGCCACAACACGGCTGGAGCTGGAGGGCCGAGGGGTGGTATGCATGAGCT 

TCTTCGACATCGTGCTGGACTTCATCCTCATGGACGCCTTCGAGGACCTGGAGAACCCTC 

CGGCCTCGGTGCTTGCCGTCCTGCGGAACCGCTGGCTGTCANACAGCTTCAAGGAGACGG 

CCTTGGCCACTGCTTGCTGGTCGGTCCTGAAAG 

Sequence 78 

CACGCGTCCGGAGAACGTGATTTCTCAGCCGAATGAGTTTGAACATACCCCACAGGAAGA 
• TGACTTGGGGTTCAAGGAAGAAGATTTGGCTCCAGATCATGAAGTAGGAAATGCCTCTCT 
CAAACCTGAAGGCATCCAGAACTGGGATGACTTATGGGTCCAGAGAGAGGGTCTAGGAAA 
GCCTCAGCCTCGGGACAGAGGCCCCCGGCTCCTGGGTGAACCACGCTGGGGCCAGGCTAG 
TAGTGATCGGGCCGCTGTGTGTGGTGAGTGTGGCAAAAGCTTCAGGCAGATGTCAGATCT 
GGTGAAACACCAGCGGACCCACACAGGGGAGAAACCCTACAAGTGTGGGGTCTGTGGCAA 
GGGCTTTGGGGATAGCTCTGC 
Sequence 79 

CGCGTCCGCAAGAAGATAACCCCAAACTCTTTTCTCAGAGAGTTTGTAGCCTAGTTTGGG 

ATAGATAAGATCCACATATTTAGTCATATAAGACTACAGGAGAGTAGAATAGATGCACCA 

GATGGTGTCGAATGAAAGTGGTACTTTGTAGACTATAAGTGCTGTAAATTCTAAAGGACA 

GGTTACTTTTGCCTGGAGTGGTCAAGAAAGATTTTATTTAAAATAAGGATTTGACGGGGA 

GACTTAGCAGTCAAAAGGGAGAAAGCGGGTAAAACAAATGTAAGCCATCATAAGAGTGCA 

TGTGGTTTGGAAGCATCAGGGAAAAGACTAGCCAACCTGAAGTAAAAGGTTCGTGCAAAT 

TGGGCAATCAATAGCATTAAGTTGGAAACAGCTTGGGGACAGACACATAAGAGGGCCAGA 

GTGTGMTAATTTCATCTAATACTTTATAGCACTTGGACATTTACAGAGCACTTTTCTC 

Sequence 80 

TNCTCTCTGCCCCCCCACATTCGTCCTCTTGATTCTCTGCTTCTCTAGCTCAGCCGCTGA 

GCTTCGTGCCTAGCCGCCACTAGTCCTTGACCAGCGTTCTGGCAACTCTTGCCTCCAAGT 

TCTTCCAGCTCCAGGCTGAGCCGATGGGGATTCAGTTTTCTGACATCACAGCTNAGTTCT 

TGATTTCTGCAGCAAAACCTTCAAGGCTTCCATCACCTCGGCTCTAGAGTCCATCATGCT 

CCTCTCCTATGACCTGCTAGGGCTGATCAAACGTTCTGTCTCCTGCCGTGCCCTGCCCTG 

CCCTGCCCTGAGCTTCGCTTAGCCTGTTGCAGGCTTTGTGTTTTCCTTCTTGTGCTGTTG 

GACCACGCAGCTCCTTCTACCCATAAAACCCCCTTCTCTAGGTCGGTGGAAATCTTGGTC 

ATCCTTCCGTGTCTAG NTAAACTGG NCACCTCCTCCACG AAAGCCTTCTAAAAACTCCTT 

CTCAGGGGAACTGGT7TTCTTTCT 

Sequence 81 

CACGCGTCCGCAAAATAGCCCCACATCCNGGCAAAAGGGGCCTTTCCCTTGGCCCAGAAG 

AAAAAGGAACAAGTGGAGTGCAGAAGAAAATCTGTACTGAGAGACTTGGGCCTAGCTTGT 

CTTCCAGTGAGCCAACCAAGGCTGGTGCTGTCCCATCCAGTCCCTCGACGCCAGCACCAC 

CCAGCGCCAAACTTGCCGAGGACTCAGCTCTGCAGGGTGTGCCCTCTCTGGTGGCAGGTG 

GAAGTCCACAGACTCTTCAGCCGGTATCCAGCAGTCACGTGGCTAAAGCTCCCAGTCTGA 

CCTTCGCTTCCCCCGCCAGTCCTGTCTGCGCATCAGACAGCACTCTCCATGGGTTAGAGA 

GCAACTCTCCCCTTTCACCACTGTCCGCTAATTATAGCTCACCTTTATGGGCTGCAGAGC 

ACCTCTGCCGCAGCCCAGATATCTTTTCAGAGCAGCGGCAGAGCAAACATAGGCGCTTTC 

AGAATACCCTAGTAGTCCTACATAAAATCTGGGTTGCTGGAGATCACTTTTGAAAACCAA 

G 

Sequence 82 
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ACGCGTCCGCACCCCTGTGTCCAATGACATGTGCACCCAGGTCGCCAAGCGGCCTGTGGA 

CACCCAGGCCTGTAACCAGCAGCTGTGTGTGGAGTGGGCCTTCTCCAGCTGGGGCCAGTG 

CAATGGGCCTTGCATCGGGCCTCACCTAGCTGTGCAACACAGACAAGTCTTCTGCCAGAC 

ACGGGATGGCATCACCTTACCATCAGAGCAGTGCAGTGCTCTTCCGAGGCCTGTGAGCAC 

CCAGAACTGCTGGTCAGAGGCCTGCAGTGTACACTGGAGAGTCAGCCTGTGGACCCTGTG 

CACAGCTACCTGTGGCAACTACGGCTTCCAGTCCCGGCGTGTGGAGTGTGTGCATGCCCG 

CACCAACAAGGCAGTGCCTGAGCACCTGTGCTCCTGGGGGCCCCGGCCT 

Sequence 83 

CCCCGCGTCCGCTCTTACGCATTACTCTATGTCTACTGTTATGGGTGTGTAATTTTATAC 

CATAGATGTTTACTCTTTAMCAGACACTTCTAGTCTGTTTTATTTCATGTGTCTGGGAG 

CGGATAAAGTGTGAGGTTCAGGGAGAAAGAGAGGTCTGTCTCAATGCCTTGGCACGGCAT 

GAAGACAATCTCCCCTCCTTGTCCCCTTTCCCTGCTAGCTCCTGATGACTGACAGATTCA 

CAGCAGAACAGAAAGGACTGGGAAGGGATGGAGGTGGGACATCTGGCACTGACCTTCAGG 

GGCTGACCCTGTGGGGGAACATCTGCCCTGAAGAGTTGGAGCCTTCATGTGATGACACAG 

AGCTGAAGTGTGATATTCGGGAGGGGATAGAGAGTGCTTGGAGGTTTTCTGATTTTGAAG 

AATCCCAAGTCAGTC 

Sequence 84 

GTCCGGCCGCTTCCGGTCTCCCTCCCGGGCCGGCGCTGGCCTGACTGCGGCCCCGGTCCG 

TAGCACTCCGCCCTCCGCTTCTCCCGCCCTGTAGCCGCGAAGACTGCTTCAGCCTTTCCC 

TGTGCTGCCCCTGCCGCGCGATGGAGACGAGCTCGAGCTGCGAGAGTCTTGGCTCCCAGC 

CGGCGGCGGCTCGGCCGCCCAGCGTGGACTCCTTGTCCAGTTAATGTGTTAAGAGCCATT 

GACATTTGAAGATCATCAGAAGTGAAGATAAAACATCTCAAAAATTATAATTGCCTCCAC 

TTCTCATTCAGAGAATTCAGTGCATACAAAATCAGCTTCTGTTGTATCATCAGATTCCAT 

TTCAACTTCTGCCGACAACTTTTCTCCTGATTTGAGGCCCATGCAGTCCAGTTCGGGAGC 

TAAGT 

Sequence 85 

CCGCGTCCGCGTGAGGTGTGGGTGTTCGTTTCTCAGGTAAAACATGGCTAAAAGCTTACG 

GAGTAAGGTGGAAAAGAAAGATGCGTGCTGAAAAGAGAAAAAAGAATGCCCCAAAGGAGG 

CCAGCAGGCTTAAAAGTATTCTCAAACTAGACGGTGATGTTTTAATGAAAGATGTTCAAG 

AGATAGCAACTGTGGTGGTACCCAAACCCAAACATTGCCAAGAGAAAATGCAATGTGAGG 

TAAAAGATGAAAAAGATGACATGAAAATGGAGACTGATATTAAGAGAACAAAAAGACTCT 

TNTAGACCAGCATGGACNGTCCCAATTTGGNTGAACCCAAAGGCAAANAAAAANGNTTTG 

ANGGCAMCCGANNGAAAAAAAAANGGGGAAAACCAACCNAAANCCCTTAAAANGGGCCA 

ANGGGGTTTGGCCNCTGNNAATNNTTTNIAACCCNTTTGAAAACCCCCCCTGGNNGANACC 

NCCCGTAAAAAATNTTCCCCCCN N I I I I I I I I I I 

Sequence 86 

CCACCGCCGTCCGAGGAGGGATCACCAAGCCGTGGGCCATGAAAGTCGGGGGGGGGCACC 
GCAAGCTTGAAAGCTTCATCATTGACCTTTCNCAAGAAATTACCGGGGCCAAGGCCGCTT 
GTT TCNAAAAC CCCCATGGAAACNAAAACGANGGGGAGGCANGGGAGAACCACCCCCTTG 
GACI I I I I I I INATTTCTTCTTGGACCTACCGAAGGAGGGCACAAATTGCCGGGAAGATT 
GCTTGCCTTTCCACCTTGGGACAAGGGGATCCTGGGACTTTTCCGGCCCGGGGTTCCCNT 
TCCCGTTGGCCCGGCAAGGGATTGGGTCNAAACAATTGACCCAAGGGAAGAATCCCCGGG 
GACGGTCACAACCGGGGNACCAAAGGNAAAGCTTTTTTGGGANGGGAACCC I I I I I I I I A 
ATT 

Sequence 87 

CCGGGTCCCTCCCTGCGGAGCCGCTGGTCCGGCTGGCGGAGATGTGACCGCGGGCCCGGC 

CGGCCTGCCTCAGGCGTCGCGTCAGCTCCCGTGTCCGTGCCCTTAACCCACACCGATGGC 

GGGATCCGGCTGCGCCTGGGGCGCGGAGCCGCCGCGTTTTCTGGAGGCCTTCGGGCGGCT 

GTGGCAGGTACAGAACCCGCTGGGTAGCGGCTCCTTCCNCTTCGGGGTATTCGGGTTTCG 

CTTGTTGGGGAACCCTTGGTTGGCCCCCGGGGCCCTTAAANCAGTTNTTGCCGCCAGAAA 

CCACCGGGGTTGCGGCCTTTGCCGCCNAGTATGNTTTCCGAAAAAAAAGGGCCGGGCTTG 
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NACAAGTTGCANGGGTCACAAAAACATCGTGAATTTTGATNGGAGTGGTTACAATCCAC 
Sequence 88 

CGTCCGTTTAATTATAACCTAGATTGTCTGGGCAACGGCAGGAACGGAGTGCCACTGTGG 

AGCAGATAACTGCAGTGGTTTTCTAGGAGTGCGGCCAAAGTCGGCATGTGCGTCAACAAA 

TGAAGAGAAGGCAAAAAATGCTAAGTTAAAACAGAAGAGACGAAAGATCAAAACAGAACC 

AAAGCAGATGCATGAAGATTACTGTTTTCAATGTGGAGATGGTGGAGAGCTGGTCATGTG 

TGACAAAAAAGACTGTCCCAAAGCATACCACCTCCTATGCCTTAACCTGACTCAGCCACC 

ATATGGAAAGTGGGAGTGTCCGTGGCATCAGTGCGATGAGTGCAGCAGTGCAGCTGTTTC 

CTTCTGTGMTTCTGTCCACATTCATTTTGTAAAGATCATGAAAAGGGGGCCCTGGTTCC 

CTCTGCACTGGAAGGCCCGCCTCTGCTGCTCGGAACATGACCCCATGGCTCCTGTGTCAC 

CAGAATACTGGAGCAAGATAAAATGTAA 

Sequence 89 

NGTCGCCCCGCGTCCGTAAAATGTTAAGTCCCCTAAAAGTGAATAAATTTTAAATACCTA 

CTTTTAAAA^TACTGTCTTCTAMTTGACATAATTGCTTTTCTTACCAAAAGAAGGAGAG 

GTTCCCCTMTTCCTTTTGGGCCATAGATCCGCTTTTAGGATCTGATTAAAGATGTGGAC 

TTTCACCGCAGGAAGATACCCATGGGCACACCATTCAAAATGTATCAGATTTCAACGGTT 

TTACAAACTCCCCCGGGATTAAGTCAAATGGGTGGATATTACAGCTGTTTTCATCAGATA 

TGGTTATTTGTTGACAACCATCAGGACAACAATGTTTATAGATGGAAGGATAACTTCCTG 

GTTTTTTTCCACATTGAATGTGGCTAGTTACATATCTCAATTTAAAATAAATTGTGGAAA 

GCCAAAAAAGTATGGTCAAGCTAACCTTGGGGTGGCTTTTACCTGATGCCTACAAGCACA 

GAAAAATAGTTTTTAA 

Sequence 90 

CCGCGTCCGATTATGCCAAGAGAAGGTATTACTTTAAGATGTGAAAAATGTAAAATGGAA 

AATTACATTACTAAGAAAAACAAAAAAACACAAACTGNNAAATTAGAANTGAAAAAACAT 

TGCCACAAATGCAACGCACATACAAGTCATAAAGAAAAAAAATAATTTAAAATGAATAGA 

MATATTTAAGACTTAAGCTTTGAAGAACACAATTTAGATGCAATMTTTCATTTTCTCA 

ACAAACAAGATTATGAGTTTCTAATCTTCAAACAAGTGATGGAATAGTGTTTCTTGAAAA 

AGAAGAAATCCATTTATTTGTTGATTCAAGATATATAGAAGCTGCTCAAAAAGATGCAAA 

MATGTACAAGTTCATTTATTGACAGCAGCTAATTTAAAAGATTTTGTAAGTAGTAAAAA 

TTACTTAAAMTTGGTGTTGAMAAGMTACTTAACTTTAGCTGATTTTAAAAAACTTCA 

AGCTTGATTTCCAAGTGCAGAATTTGTCAAATCAATGCGCAAAAATTAAGACTTATTA 

Sequence 91 

CCGTTGTCCCATATATCTTGTTCCAGCAGCCATATATCTTGNGGTCTACACGCCTAAAGC 

ATGATTTCCCTTGAAGTCTTGGGGTTGNTTAAAGGAGAGTCCCTTCAATATAAAACCTCT 

GAAATATTAGTGAGAATGGCTCACTAATGTGAACAATGTTTAAATTATTTATTTATATAT 

AGAATTACTGAATATTAGTACTGGGAAAATTTATAGAAATCATCTAGTCTTACCCTTCAT 

CTTACATATAAGAAAAATGGTCTTTTCTTCTAATCACATTTACAAAATATGATATAAACC 

TTGAGCATGAATGTATGAGCCTAATTAGAGAAACAGAAAATCAGCATGTCAGTTTTCCTT 

CATTCAAAATAACATAGTCTTTCTAAGCAGTCATTCTGGGAG 

Sequence 92 

ACCACGCGTCCGCAAGGCCCGCCCCTTACGTACTCGGAGCTCGGATCCCAGTGTGGACCT 

GGACTCGAATCCCGTTGCCGACTCGCGCTCTCGGCTTCTGCTCCGGGGCTTCTTCCCTGC 

CCGCCCGGGGCCCTGACCGTGGCTTCTTCCCCGGCCTGATCTGCGCAGCCCGGCGGGCGC 

CCAGAAGGAGCAGGCGGCGCGGGGGCGCGCTGGGCGGGGGAGGCGTGGCCGGAGCTGCGG 

CGGCAAGCGGGCTGGGACTGCTCGGCCGCCTCCTGCCCGGCGAGCAGCTCAGACCATGTC 

GCCTGAAGAATGGACGTATCTAGTGGTTCTTCTTATCTCCATCCCCATCGGCTTCCTCTT 

TAAGAAAGCCGGTCCTGGGCTGAAGAGATGGGGAGCAGCCCGCTGTGGGCCTGGGGCTCA 

CCCTGTTCACCTGTGGCCCCGACACTTTGCATTCTCTGGTCACCATCCTCGGGACCTGGG 

Sequence 93 

NCGCGTCCGCCAAGATGGCGTCCNTCATGGAAGGGCCGCTGAGCAAATGGACTAACGTGA 
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TGAAGGGCTGGCAGTACCGTTGGTTCGTGCTGGACTACAATGCAGGACTGCTCTCCTACT 

ACACGTCCAAGGACAAAATGATGAGAGGCTCTCGCAGAGGATGTGTTAGACTCAGAGGAG 

CTGTGATTGGTATAGACGATGAGGACGACAGCACCTTCACAATAACTGTTGATCAGAAAA 

CCTTCCATTTCCAGGCCCGTGATGCTGNTGAGCGAGAGAAGNGGA 

Sequence 94 

ACGCGTCCGCGGACGCGTGGGTGCGGGCCGGCCNCCCTGGACGAAAGAAGAGGGCCCCTC 

CAGGCCAGTCTGGGCACCCTGGGATAGCGGCTGCAGCCAGGCATGGCCGACTCTGCACAG 

GCCCAGAAGCTGGTGTACCTGGTCACAGGGGGCTGTGGCTTCCTGGGAGAGCACGTGGTG 

CGAATGCTGCTGCAGCGGGAGCCCCGGCTCGGGGAGCTGCGGGTCTTTGACCAACACCTG 

GGTCCCTGGCTGGAGGAGCTGAAGACAGGGCCTGTGAGGGTGACTGCCATCCAGGGGGAC 

GTGACCCAGGCCCATGAGGTGGCAGCAGCTGTGGCCGGAGCC 

Sequence 95 

CCCCGCGTCCGAGGTGACCTCCTTGGCCCAGATCATCTTAGAGCCAAGAAGCAGGACCAT 

TCGTGGTTTTGAGGCCCTGATTGAAAGAGAGTGGCTGCAGGCTGGTCACCCATTCCAGCA 

GCGCTGTGCACAGTCAGCCTACTGTAACACCAAGCAGAAGTGGGAGGCTCCTGTATTTCT 

TCTCTTCTTGGACTGCGTGTGGCAGATCCTTCGTCAGTTTCCCTGTTCTTTTGAGTTTAA 

TGAGMTTTCCTCATCATGCTCTTTGAGCATGCTTATGCCTCACAGTTTGGAACATTTCT 

GGGCAACAATGAAAGTGAAAGATGTAAGTTGAAGCTACAGCAGAAGACGATGTCTTTGTG 

GTCCTGGGTTAATCAGCCCAGTGAGCTGAGTAAATTCACCAATCCCCTCTTTGAAGCCAA 

CMCCTTGTCATCTGGCCTTCAGTTGCTCCGCAGAGTCTTCCACTGTGGGAAGGTATTTT 

CCTACGTTGGAATAGATCCTCTAAGTATTTGGATGAAGCATATGAAGAAATGGTTAACAT 

CATTGAATATAATAAAGAATT 

Sequence 96 

CCGCGTCCGGTTTNCCTGTTGGTTAGGCTGGTCTTGAACTCCTGACCTCACGATCTACCC 

ACCTTGGCCTCCCAAAGTGCTGGGATTACAGGCCTGAGCCACTGCACCAGGCCCACCCTG 

TCTCTATTTTCTAAAATAATAAATCTGATTTTAATGTGGCTGGATATAAATCATATCACA 

GTTGG ATTTG GAAGTTTGGGTTTTATTCCTAACTTTGATGGGMGCCATTTTAAGCAGAA 

AGATGATTTTAAAAGACCACTATATTTTCTGTGTGAAGAATGAACTGGGAGATTTCATAG 

TATTATTAACAAAAATAGAATAGTTGGGGATCTGGTTTGGCTTGGGAAATGGAGGAAGTT 

CAACTTTGGGCATGCTCCATTTGCATTGCCAAGACATTGCAGCAATTGGAAGTGCAGTCA 

GAGAGCTTAGGAGAAACACTTGGCAGATGGACATAGAGAAGTAGTACTCAAAGCTTGTGG 

ACATTGATTAAATAATCATACAGGAGTATGGGCTGACAAAAGATTNCAAAGAGAAAACCT 

Sequence 97 

GTCNCCACGCGTCCGGGACTCTCGGCCCTGGAGAAGGAGGTGGACTTTGACTCCGACCCC 

ATGGAGGAGTGCCTGCGGATCTTCAACGAGTCCACCAGCGTCAAGACGGAGGACAGAGGC 

CGGCTGGCCCGGCAGCCCCCCAAGGAAAAGAGTGAGGAGAAGGGGCTTTCGGGTCTGACC 

ACTCTGTTCCCCGGGCAGAAGAGGAGGATCTCCCACCTTTCCAAGCAAGGCCAGGAGGTG 

GAGCCCCCGAGGAGGGGTCCCGCGGTGCCCCCGGCCCGGCCCCCGACGGCGCAGGAGGTG 

TGCTACCTGCGGGCCCAGCAGGCGCAGAGGGCATCGGCGAGCTTGCTGCAGGCCCCCGCC 

AGGCTGGCAGAGAAGTCGCCCTNCGTCCACATTTCCCGCCCCTGGCGAGAA 

Sequence 98 

CGCCNCGCGTCCGGCAAAGCAAAAGGGAAATTATTTGGTGGATGGTAGCTCAAAATTGGA 

ACTCTTGTTCTMTTCAGTTACATTGGCTTTACCCTCCTTAGATTTTTCATCAAAGGGCT 

GTCCCATTG CMTCTTACTAAMCATTTTGTTAAAATAAACTCTTTTCC I I I I IATATTA 

ATMTTAGGCTTTTAAATAAAGATGTTATTCCTTTAAAATGGTGGGCTTACCATCATTGA 

AGATGTCACTCAGGTGGCCTTGCTTGATCAAAACGCCTTTTTTAAAAACCAAGCTTTAAA 

AACATGTTTATMTTTCATGAAGTACATATATATTGTTCCCATAGTCTTCAGCTTTAAAA 

CTATA AATATGCCCAAATTTTGTTATTTGCCCTACTTTAAGTAGGTTTATTGNGTTTGTT 

TTTTTCAAGTACTTGTTTTTCTCTGATAAGACTCAGGAATTCTGAAATGTGAAAATGNCT 

CAATT 

Sequence 99 
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CNCGCGTCCGAAATCGTTGCTACCAANTATTCAAAACCCTTTGAGTTTACATACTAGTTA 

CCTTAAAAATTANTNCCTGACNCTCNTGANTTTGGGNGGAAAGCCCTTGTI^JTCNNCTCTC 

TNATGNACTCTCATGGGTTTTTTTGTATGATTTGAATATNMTGTGCCTAMGMTTTTT 

GCTCTCTTAATCTATGNATACATACTTGAACAAATCATTCTTGCTTAACTGCTGATCTTT 

TGTAAAACTATTG 

Sequence 100 

GCCCCGCGTCCGGCTGGAAGCAACAGTTTGGCAGCCTGGGGTACACTCAGGTTATTCGTT 

ACAACTATTATTATTTGATGTCTTTTTTTAAACTCAGGTCATCCACTTTTGACTGTCATC 

CATGG AAGAGCTCTTATTAAAAGCCTCAGACTTTCGGGACCTATGATTCTTTGGCACAAC 

CTTTTGGAAAATTCTTAAGCAGGGATGAAGCAAACTTGATTGGAGTTGGGGAAAAAGAAG 

ACAGATTAGTATTTTTCATGCTGACAAAAAATAGCTGCTATGACTTTTCCCGCAACGTGG 

ACAG GGGC CAAGTGAAGCTGAACTGGTCATGGTCTGTCGCCCAGTGTTCCCTTGTCGTCG 

GCGATTTTGCCCCCGACCCTTCTTGGTGGGCTTAGTGGTGGCAATCTGTCTCTTCTACCA 

GACTCTGACCCTCCGAGGGTCGAGGAAGCTCACAGCCGCTGCCCCTGGGGCTGTCCCACA 

CACATCCACTGAAA 

Sequence 101 

CCACGCGTCCGGGCGTCTCGGCTCTTCTGTATCTCCCTGGCCTGGTCGCTCTCGGCTTCT 

GGGCCTCGCCCTTCCTGTCTGTGAAATGGACTCTGGGTGAATCCAAATGGGATCGTCTCG 

GGCTACGTCCTGTCCCTCCGGGACTACAAGTCCCAAGGTGCTCGAGGCGACCTTGGCTCC 

CCCTCCCCACCGGGACCCGCCTCCCTCCAGCCCAAGTCACGTCGTCTAACCTGTTCCCAG 

CTCCTGCCCCGCCCCGTTCTCCGCTCCCCAAGCCGGAGCCCGAGCTGGAGGAAGCCCCCA 

GGTGCCAGGATCTGCTCGGATCCGNGCCCGCTCCGGCCGGCACCATGGACAGTGAGGCAT 

TCCAGAGCGCGCGGGACTTTNTGGACATGAACTTTCAGTCGCTGGCCATGAAACACATGG 

ATCTGAAGCAGATG 

Sequence 102 

■ CCACGCGTCCGGTCCGGGGTGGAATCACGTCGCTGCGGCTGCCGACGACCCACACCCGGC 
CGGCCGCCTCCGCAGACCCACCTTGGCCGCGCGGCAGGGGGCGCGCAGAGCCCCGAGGGA 
GCGAGTCCCCGCGCGTGGCAGCTCGGCGGCTTCTCCCTTCGGGAGGTCCGGCTCCCGGCT 
CTCCGGACCCGCCTGGCGTCCTCGCCTGCGGCGGGGCGGACGACAGCGGCGCCCAGGAAT 
GGCTTCGGCGGGCAGCGGCATGGAGGAGGTGCGCGTGTCGGTGCTGACCCCCTTGAAGCT 
GGTCGGGCTGGTGTGCATCTTCCTGGCGCTGTGTCTGGACCTGGGGGCGGTGCTGAGCCC 
GGCCTGGGTCACAGCTGACCACCAGTACTACCTTGTCGTTGTGGGGAGTCCTGCCGGAAA 
CCCGCCAGCTTGGACATCTGGCACTGTCGAGTCCACGCTCANCAANCGATTGGCAGATTG 
C 

Sequence 103 

NCGCGTCCGAGAMTTGCMTTTTTTMTTTTTMTTTTAAGAGGAATTCGTGCCAGAGA 

GAACTATTAAGAAAGGGGTATATCCAGTCTAAGGATTATTAGGCTCAAGTCCATGAATAG 

GCTCTGGGAAGTTTGTAAACACTTGGAAATTATTTGCAAAATGTGTGTGTGAATGTGCTT 

TACCTTANAGAGTTCATGMTTTCATTAGATTGTTGMAGAGTTTTAGTATTAACAAAGG 

AAAAACAAACCACCACCATCACATAACAAAACCACAACAGTGATTTAATCTTTTCACCTA 

ACAATAAGTAAATTGAGGCTCTGATGGCTAAATTAATAGCCTGAGGCTACACAGTCAGTG 

GCAGAGCCCAGGGTANAGAGAGAACCAGCACAAGCCATTGTGGGAGCCGAGGGTAAAGAG 

AGAGCtAGGTGTTGTACCTTAGTAAATAAATCAGAA 

Sequence 104 

GNGTCGCCCCGCGTCCGGAAGGTGGAGACCGCTTACCCTGATCNGGGATGTATCGGCTGC 

GGGTGCGCAAGGCAGTCCAGGAGTGACCTGGGGCTGTGGAGAGCGACCCGTGGCCTTGTG 

TTTCAGAGTTTACCACCTAGGATGACTTCAGTGACTAGATCAGAGATCATAGATGAAAAA 

GGACCAGTGATGTCTAAGACTCATGATCATCAATTGGAATCAAGTCTCAGTCCTGTGGAA 

GTGTTTGCTAAAACATCTGCCTCCCTGGAGATGAATCAAGGCGTTTCAGAGGAAAGAATT 

CACCTTGGCTCTAGCCCTAAAAAAGGGGGAAATTGTGATCTCAGCCACCAGGAAAGACTT 

CAGTCCGAAGTCCCTTCATTTGTCTCCTCAAGAACAATCTGCCAGTTATCAAGACAGGAG 
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GCAATCCTGGCGGCGAGCAAGTATGAAAGAAACGAACCGGCGGAAGTCGCTGCATCCCAT 
TCA 

Sequence 105 

CGTCCGCGCAGCGCTTGAATCCCGTGGCCTAACCGTCCCTCGGAAGACCGGTCCCGCTCG 

GGAGGCTCTGCAGTCGCGCCTGGGGTCAGGGCCGGGGGCGAATGTGGCTCGCGTTCTAGG 

CCTCCCTGGGTTGGAMAAGACTATGTTAGCAANGTGTCACGCCATGCTTTTGCCAACTT 

TGCMTTAAAGGTTGACATTCCTGCATMGCATTTCTCTGTGAAAATGTCCTTGCCTCTT 

ACAGAGGAGCAGAGGAAAAAGATTGAAGAGAATCGACAAAAGGCTCTGGCCCGCAGAGCT 

GAGAAGTTATTGGCAGAACAGCATCAGAGGACTAGCTCGGGCACCTCCATTGCTGGCAAC 

CCATTCCAGGCCAAGCAAGGCCCATCCCAAAATTTCCCAAGGGGAGTCTTGTAAGGCCAA 

GTGAGCCATTGGTGTCATTTTCAAGCAACAGAATCTCAGTAGCTCATCTAATGCTGACCA 

AAGACCTCATGATTCCACAGTTTTCANGCAANGGGAATATGGAAA 

Sequence 106 

CCGGCCCTATCCCTATATTGTTTGCTTGTTTGGG ATAACCTAAAA I I I I I I ATCCAGTTT 

ACTACTAATTTGTTTTACCTGATGTATCTTCTCTTTCAATAATTTTATGTTACCTTCTGT 

TTAGAATAATATTTGCCACAGATATTTAGGTTTAATTCTGTGTTTGAATGATTCCAATGC 

CTT TCTCTACCCA CTTTGAACACTTCATCCTGGAATGGTTGGCTGATGTATGTCTCTAAA 

CAAI I I I I I I I I IAGGAGAAGGTATGTGGGTMTGTMTTCCTAMCCTTTGCTTTTCTG 

AAAAATCTTTCATTFGCCTTTATACATGACCAGATTTACTGGGTATATAGATTTGTTGAT 

GAAAAMGGTAAAAAGAGCAACTTTTGACATCCAGAGGTTTGTCTGGCACTCACAGCTAG 

CCCCGTGTTATTCTCCCTATT 

Sequence 107 

GCGTCCGTCTNAACCCTAAAGCTAAAAAGTCATTGTGAACCTTTNGGTCTGATGCTAAAG 

AAGGGAAAACAGGTACAGGAAATCCCATGTGGATGCTTGCTTNCAGGATTTCCTGCCATG 

ATTCCAGAATCCCACAGCTNCAACATGATTGCAAAAAGACTCCCTGCTCATTTTNCCTCA 

GCATGCACAGCGCTGTCCTGTCTCAGTTGCAACTCGACAGAGCCGCATTTACTCCAGAAC 

CCAATCCACACACCTGCTCATCCTGCCCCGAGAGGAGTGCCTGAAGCCAATAGCAGGGAA 

CTAGAGCAGACTTGGGTGGATCTTCATTGGATATTAGGTATCTTGCCCTAGATAGGCAAG 

CAGTGGCCTTACAGATGCTGACAGATGATCTGATTAGATGCACAGNTGCTGGGTGGCGTC 

TGGGGCCAGTCTATTGGNCAGTTCTGGGAGNGGGAACTATTTGGGCTCTGCAAAGATG 

Sequence 108 

CGTCCGCTCCCTGGCCCTGCTCCGGGAGCTGTCGTTGTCTCCGCCAGCAGCCCTGTGGCT 

GCAGGAGCGCCAGGCCCAGCTTCGCCACTCGCTGCCCCTGCAGAGCTTCCTGCTGAAACC 

TGTCCAGCGCATTCTCAAGTACCATCTGCTGCTGCAGGAACTAGGGAAGCACTGGGCGGA 

GGGCCCAGGCACTGGGGGTCGCGAGATGGTGGAGGAAGCTATTGTGTCCATGACAGCGGT 

TGCCTGGTACATCAACGACATGAAGCGCAAGCAGGAGCATGCAGCGCGCCTCCAGGAAGT 

GCAGCGGCGGCTGGGTGGCTGGACCGGACCAGAGCTCAGTGCTTTTGGGGAACTGGTGTT 

GGAGGGCCGCGTTCCGAGGAGGCGGANGGGNGGGTTGGCCCCCGGCTACAAGGGGGT 

Sequence 109 

AGAATTGTGTATGCCTTGCCTATCACGGTACAGCACGAAGCCAGGCTCCTTTCTCCACCA 

AAGAAGATGGAACCAGACTGGAATTCTGTCTCCAGAGAGAAACCCAGCTGTTTGGGTCAA 

AGA CAGA TGCTTCAGACTTGGGTGGGAAGGTGAAAGATGGCTATTTAGAAAGCTGGTGGC 

ACGTTTTACATAAGGGAATGTCAGATGGGAGATGCTAGTTGCCATTTTAACAAAGCAGGT 

MATCGGTAMTTTTAAACTCTGTCCATGTTCTGTTAGAACTCAGGGACAAGGGATCCAT 

GAAAAAG 

Sequence 110 

ACGCGTCCGCACGGAGAGAACTGGNCCTGGAGCGGGGGCGCGGGGAGGGGGGCGTCGTCN 

TGGGTACAATTGCGCANGGGCAMGGTCGAGAGGTCGCGCTGNCGCCGTTTTATTTGAAG 

ACATCGTCCAGTTCTGACCATGGACTCNCAGCCATCGGCCCTTAGTTTCCATTCCCTCTA 

G NGGG CCTTCNGAGGGNTCTACTGACGTACCTCCTTCCCTTGGTACCGGACCGGGGAAGT 

GTTTTCGGGCGCGGGAGGTTCCGCATGCCCAGGCCTGGCCAGGGGA 
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Sequence 1 1 1 

CGGGCCCTTAGTCCAAGCCTTGATCGGCGACTAAGTGACGGCAGTGACTGCCGCCATGCC 

GAGCTGGACGGAAGNCACTTCTGAGAAGGGCGGAAGTGTCTCGGGCTCCTTAGAGGGAGG 

ACACCATATTAGTGCCAGTGGGGAAGTCACCGGGTGGAATTACTTCTTTGTGGAGTTTGT 

GCTGTAGCGACMTGAAAAACGAAGAGTCAACTTTTATAAAACAAAATAAAAATTAAGTC 

AAATCATGCCAACCTTTATTAGATCGGCTAGCAGGGTTAAACTTAATTCAAAGCCCCTGA 

TGAATCGGGCCTTCATTGCACCCCCAAAGGCTCCGCCACCCTGATT 

Sequence 112 

CGCGTCCGGGCGCCGGTACGCCTGGTCCCCGCGTGGAGTCTTTACTCAAAACAGCTCCCG 

CCTCAGGCCGAGATGAGGAGCCCTTCANAATAGCTGCTGTCTCTGGGNGGACCCGGGCGT 

CCTTGGCAGCCCAGCTGNTCTGGACAAAGCCCTGCCAGTCAGGCCTCGGCTGGCAGGAAC 

CATGGCAGAGGCTGGGGATGCTGCGCTATCGGTGGCCGAGTGGCTGCGGGCATTGCACCT 

GGAGCAGTACACGGGGCTCTTTGAGCAGCATGGCCTGGTGTGGGCCACTGAGTGCCAAGG 

CCTCAGCGACACCCGCCTGATGGACATGGGCATGCTACTCCCT 

Sequence 113 

TGTCGACCCCGCGTCCGCGGGANGTTCATGGAAACGCAGGACACGACAGAATTGTGTNTG 

CCTTGCCTATCACGGTACAGCACGAAGCCAGGCTCCTTTCTCCACCAAAGAAGATGGAAC 

CAGACTGGAATTCTGNCTCCAGAGAGAAACCCAGCTGTTTGGGTCAAAGACAGATGCTTC 

AGACTTGGGTGGGAAGGTGAAAGATGGNTATTTAGAAAGCTGGTGGCACGTTTTACATAN 

GGGAATGTCAGATGGGAGATGCTNGTTGCCATTTtAACAAAGCAGGTNAATCGGTNAATT 

TTAAACTCTGTCCATGTTCTGTTAGAACTCATGGACAAGGATCCATGAAAAAGACCTGTG 

ATGTTTCNTCTGGCGCTTTACTGGCCTGGGCACACCTACCAATCTTTTAGGATTTGACTG 

GTTCCATTACATTTCCT 

Sequence 114 

GTCGACCCCGCGTCCGTATCACTGTAATTTAAGGAAAGAAAACTTCAGTTCTGCCTCTGG 

ATACCAAGATGCCCATTGCTCAGTTCAGACAACTGATATTAAAATAAAGCTATGCTCCTT 

ACTTACTTCTTTTATTATAAACAAATTCCTTTGCTTTGGCTGATACTAGCTGAGTCATTG 

ATCATCATTGGTACCATGATATTGTAATCTATGCTGCTATTTGGCACAAGACTGAAGTTC 

ACACTACAGTAGAGAATACTATAAGATAATTTGCAATAAATACTGATAATAATAATACCA 

GATATTTTMCTMCTTTTTCTACCTTTATTAATAGCAATCAGCACACTTGAATGTGTAA 

ATTTCACAGTAACTTTAGGCAGAACTTAAGCTCCAGGCCACATTTGTATAAGAACACCAA 

GTATTCAAGGCATAAAGTCTGTTGTAAGCCAAAAAAA 

Sequence 115 

AGTTCAGTCTGCAGCAGTCCCTGCACCCACTTCCCAGTTGCTTTCATCTNTGGAAAAAGA 

TGAGCCCCGTAAAAGTTTTGGCATCAAGGTCCAGAATCTTCCAGTACGCTCTACAGATAC 

MGCCTTAAAGATGGCCTTTTCCATGAATTTAAGAAATTTGGAAAAGTAACTTCAGTGCA 

GATACATGGAACTTCAGAAGAGAGGTATGGTCTGGTATTCTTTCGGCAGCMGAGGACCA 

AGAMMGCCTTGACTGCATCMMGGAAAACTTTTCTTTGGCATGCAGATTGAAGTAAC 

AGCATGGATAGGTCCAGAAACAGGAAAGTGAAAATGAATTTCGCCCCTTGGATGAAAGGA 

TAGATGAATTTCACCCCAAAGCAACAAGAACTCTNTTTATTGGCAACCTTGAAAAAACC 

Sequence 116 

CCCGCGTCCGCACCAGGCCCGAGTCTTCCCTTCATGGAGGGTGACGTGAGCAGCAAGGAT 

AAGATGGGCAAAGGAGTGGATGGGACATATAAAAAAGCTCTTCAGGAAGCTGCAGCAAGG 

TTTGAGGAATTAAAGGCCCAAAAAGAGCTAAGACAGCTGCAGGAAGACCGAAAGAATGAC 

AAGAAGCCACCACCTTATAAACATATAAAGGGTCTCCCTCTGTGACCCAGGCTAGAGTGC 

ATTGCTGCAATTTTGGCTCACTGCAACCTCCGCTTCGTGGGCGCAAGTGATTCTCCTGCC 

TCCTGCCTCAGTCTCCTAAGTAGCTGGGATTACAGACATGAGCCACCAACGCCTGGCTAA 

TTTTGTGATTGGCAAAAAAGAGATTTTGTGACACATAAAGATGATATGAAATTCAACTTT 

CAATCAAGTATCCAGAAAATTTTA 

Sequence 117 

CCACGCGTCCGGCCCTTGCCCCTGTCNCACANGAATGGACCCACGGCCCCACCCAGCGCC 
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GTCAGCGCCCGGCACTGCCACCCGGGTCCGGGCCGCTGCCTGCACGTGGGATCCGTCGGG 

CAGCCGGGGACAGAAGAGACCCCGCCGTTGGGACGCAGGGCAGAGCCGGCCACCTAGTCC 

CTTCCAGCCAGCAGAGGCGAGGGAAGGCGTCACTGCCCCGGCGGGGAGACGGGCAGGACG 

CCCTGCCCCGCACCAGCAGCCTCCGCCGGGGCGCCCTCAGCTCCCTGCTTGGCTCTGTCT 

CTCCACACCCGGCAGGGCCCGCGGGCTGCCCCAGCCTGGGGGGGTCGTGGGCAGCTGCTA 

CTCAGTGCCAACCCCGTGGGGCACAGA 

Sequence 118 

NAGGGAGTCGACCACGCGTCCGGTGCGGAGCAAGCATCACACCATGGCGTATGAGTGTTC 

CTCTGTGTAGACTCAACCTGCGCCTCGCCGTCCCCCATTCGCACACCCGATGCCCGGGGG 

TCGCTACGGACTTAAAATCTCCGCACCGCACCCTCCACCTCAGAAACGTTCCTGGATCCG 

AACACTGCCCCCTGACGACCTAGAGAGATCCCGGCTCCCAGCCCACTGAGTGGCTTCAGC 

CTCGCTGGTAGGTCTCTCCTCCAAAGCTCTGGAAACAGACTCCTGGGAGTGANGGTAGNG 

GGGGAGCNGCAGGCACCGCCCCCTTTCCCCAAGTCCNCCGCCCACTTCATCCCTCAGGCA 

CCTNCCAACTCCTGGCCTTNTCTGCACGAGGCGCCTGCCCCGGGCCCCGCCTACAGGGGA 

CCCAGCTCTTCTTGCAGCCATTGGAAGNTGATCACCTGGGAGGTGA 

Sequence 119 

CACGCGTCCGGTTTTACTGCTCTTTGCCATGTGGTAAAAAGAGGCTGAGACATATTTAAG 

AATTCCAAGAGGATATTATGTGTCAGAATTTCAGACACTGATGAGAAGTTTTTAATTGTT 

CI I I I 1 1 ATTTGATTTTGGAATTCAGGTGCACTCTATTCAAGTGCAAGGATATCAGAAGT 

I I I I I I I I ATTTAAAAAA I I I I 1 1 I I I CGAGATGGAGTTTCACTCTGTTGCCCAGGCTGG 

AGTGCAATGGCAGCTTACTGCAACCTCCACCTCCTGGTTCAAGCGATTCTCCTGCCTCAG 

CCTCCCCAAGTAGCTGGGGATTACAGGCACCGCGCCAACACACCTGGGCTTATTCTAATT 

TAAGTAAGAAAATGGGAAGTCTTACCCATNTTTGGTCAAGGCTTGGGTCTTCGAACCTNC 

TGACCTTAANGGTGATNCCACCCCANCTTTGGCCTCCCAAGCCGTGCTNGGGATTATAGG 

GCATGMGCCCACCCANGCCCGGNCCCAGGATTTTTATATTTAAGCCCTTCTTGCTCTTN 

AAAAAAAAAAAAAGGT 

Sequence 120 

NGTCGCCNCGCGTCCGGGAACCTACCGGTACCGGCCGCGCGCTGGTAAGTCGCCGGTGTG 

GCTGCACCTCACCAATCCCGTGCGCCGCGGCTGGGCCGTCGGAGAGTGCGTGTGCTTCTC 

TCCTGCACGCGGTGCTTGGGCTCGGCCAGGCGGGGTCCGCCGCCAGGGTTTGAGGATGGG 

GGAGTAGCTACAGGAAGCGACCCCGCGATGGCAAGGTATATTTTTGTGGAATGAAAAGGA 

AGTATTAGAAATGAGCTGAAGACCATTCACAGATTMTATTTTTGGGGACAGATTTGTGA 

TGCTTGATTCACCCTTGAAGTAATGTAGACAGAAGTTCTCAAATTTGCATATTACATCAA 

CTGGAACCAGCCAGTGAATCTTAAATGNTCACTTAAATCAGAACTTTGCCNTTAANAAAG 

AAAATTGGGGNGTCCTGGGTTTA 

Sequence 121 

CCNCCCCGCGTCCGATCAATTCCTGGAATTTATGGTTATAACTTCGAAACAGAAGATGGA 

CTAATTTT A I I I I I ATCTAATTTTATTGTTGGAAATTTAGGAATTTACGGAAATACTAAT 

TTAMTTATTTAAAAAGATCATTAGAATCAAGTAAACCAATTTTGATGGCTATGATTGAC 

TCMCMGAGCTMTTATCCAGGTAAMCAATAGATAMATTTTTGCTAAAAAGTTTTTA 

GAMAMCATTTTTAACAACAAAATCCACTTCAAGAATAATTGTCGGAGCATACGATGAA 

GAGATGCTTTCAATTCAAGAAATCCTTGATTTAGCTTACAAATCAAACGTAAAGGTTGCT 

GTATATGGNAGAAATTATGACAATCTTTTAGAAATGAATCAACGATTAGCACAAAAACAA 

AATCTTGAAATACATTATCCAGAATTTTTTGATTTCAGGCAAGCTAATAAAATCGATAAT 

TTTGTMTCTTAATTACATCAACACCTGGAGCGAATTTACCAAAGATTTTTTAGAAT 

Sequence 122 

CGCGTCCGCGAAACTGAGAACCAGTTCTCCCGAAGCCGCGGGTCTCCGGCCGGCGGCGGC 

GGCGGCGGCGCAGGTGAGCAGGGCAGGGGGCAGCCGAGGGAGCGCGGGGAGCGGGGGCCG 

GGGGGCCACGTACGAGGGGCTGCAGGCCCAGCCGGGGCGGGACTCGCCAATCCTGCGTCC 

CCAGCTCAGGACGCGGACGCTGATCCGAAGCCCCTGGCCCCGGCTGGGTCAGCACTGGGA 

GAGCAGGCCCAGGTCCGCAGCCCGGGTGTGGGGCCCTCCCCAAATCCAGGGAAAGGATCG 
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TGGAGCGGGGTGGGGACTGAAAGCCATTTCCTTTCCCGTGAAGAAl I I I i.ATCAGTGCAA 
GTAACAAAATATT 

Sequence 123 

CGCGTCCGCTNAAAAAAATAATACCAAAAAAAGTTTTTGTAAAGACAACGCTCTCGCTGT 
GTTGCCCCGCCACTGTGGCCTCCTTAGCTTCTTCCCTGGGGCCTGCTGGACCTT TCCATA 
CTCCAGAAACTAAAGGGGGTCCAGGACCCTGCTTNAACCCTAGGATCCCGCATCi • ■ ■ ■ • 
I II I I I I I I I I GGACGCAGGGTCTTGCTGTGTCCCTCAGGCTGGAGTGCAGTGATTCACT 
GCAGCCTCAAACTCGTGGGCTNAAGTGATTTCTTTAG CCTCAGC CTTNTAAGTAGCTGGG 
GACTACAGTCATACACCAACATGCCCAGCTMNTTTCCTTTTTTTAATTCTTGTAGAGNA 
TGTTTGAGACGGCTTGGGCTNTGTTGCC 
Sequence 124 

CCNCGCGTCCGTGCTGATAAAACTCCTTTGACCTGACGATTGCT CTAAGTC CTAATTGCC 

ATATTTATATTCCCATAGTAAGAGTGTTTGGAGATAGTGTTTGAGCI I 1 1 I I GCTGGTGT 

TAAAAATGCATAATGAAAGATGGCACNAGAGAGGCATATTATATCCAATTCATGAAGTTG 

TTTGTGTTMCAGAAAGCTTATTTTAATCACTTAACATTGTTGATTTGTCTAATCACAGT 

AGCGCTATTGATTAGGAGCCTGACCTTTANATGGTTGACTTGTGAGTGTATTCAATATGG 

TGAAATAANGGTGTTTGATATATGGCTGCAGATTTTAGAAGGTGTCATTAGCAAAGGTAT 

ACGGAATAAAATANGGGTTATAGTATTCCTTACTCAAATTCTGTATGTGCTAGAGCTGGC 

TGGAGTCTGTTGGCATGCTCATTTGGTGTAAGGNCCGNTAAGGACTATGCT 

SGC|U6ncG 125 

GCCCCGCGTCCGCACTTTGTATTGATAACTTAAAATGGCATCAGTTTATCTTAGACATCA 

GCTTGCTTTTTATCTCCTTTTTTAGTGAGTGAAATAGAGCAACTAGCATGCCTGTGTTCC 

CAGCTACTTGGGAGGCTAAGGTGGGAAGATCAATTGAACCTAGGAGGTTGAGGCtATAGT 

GAGCTGTGATTGCACGACTGCACTCCAGCCTGGGCAATGGAGTGAGACTCCTGTCTCTAA 

AACAGCAACAACAAAAATAAAGCAACCATAGTGCATAAGGGAAA TTAAA TGTTCCCTATA 

GAAATATGTGTATGTCTGTGATAAGTGGTATGCAAATGCTAATTATTTTATAAAATAAAA 

GTTCAGMCTATTCTTATCATTGCCACTTGAACAATTAAAGGGTTTGCTTTATTTCCTAA 

TGTTTAATAGGAACCCTTTGCTTCAAACAGCCTTTGTTGAAATCATGTAAAAATTTGTTA 

ATAG 
• Sequence 126 

CNCCACGCGTCCGGCGGCCCAGCCGCCGCCTCCCGTTCTCCCCTCCGCAGCGGCGGCGGT 

GGCGGAGAAGGAACTCGACACGCACCGACCGCCCTCCCGCCCCAGCCGAAGCGGAAGCTG 

TAGCCCGCTCTGGGCCGGGGCCATGGGCGCCCCGCGCCGCCCGGGTCATGAGGACGGAGG 

CGGAGGCAGCGGGGCCGCCGCTCGAGCCCGGGGACTTTGTGCAACTGCCTGTGCCCGTCA 

TCCAGCAGCTCTACCACTGGGACTGTGGCCTGGCCTGCTCCAGGATGGTGCTGCGGTACC 

TGGGCCAGCTGGACGACAGTGAGTTTGAGAGAGCCCTGCAGAAGCTGCAGCTGACCAGGA 

GCATCTGGACCATCGACCTGGCCTACCTGATGCACCACTTTTGGCGTGAGGCACCGCTTC 

TGTCCCAGACCTGGGGTGTNGACAAGGGCTACAAGAACCAGTCCTTCTACAGGAAGCACT 

TT 

Sequence 127 

CNCGCGTCCGCGGTCGCGTGGGCGGACGCGNGGGTTCGTCCTGGACAAGTCTGGGAGTGT 

GGCAAATAACTGGATTGAAATTTATAATTTCGGNNAGCANCNGGGCGGAG AGAT TCNGTG 

AGCCCTGAMTGAGATTATCTTTCATTGTGTTTTCTTCTCAAGCAACTATTATTTTGCCA 

TTAACTGGAGACAGAGGCAAAATCAGTCAAGGCTTGGAGGATTTAAAACGTGTTANTCCA 

GTAGGAGAGACATATATCCATGAAGGACTAAAGCTAGCGAATGAACAAATTCAGAAAGCA 

GGAGGCTTGAAAACCTCCAGTATCATAATTGCTCTGNCAGATTGGCAAGTTTGGACGGTC 

T 

Sequence 128 

GGAGTCACCNCGCGTCCGCCCCGCGTCCGCCCCGCGTCCGGTTAATCTTAGGCCTGAGGT 

TTGGGGCCGGGTGACAAGGAAGTTAAACTCGTCCTCCCTG CCAG ATTCTACCCCCTTTCG 

GAGCTGAGCTCCAGCCAMCCTGTGGAGTTTTCTTTGACCATTTTAGGACATGTTACTGC 
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TTQTGAGTTGGCTGCCCCAGCTGCTCAAACAAGACCTTTCTCCTGGGTTCGTAGTAGTGA 

AAAGGAGCAGCAGAGCAACTGAGGAGGAGGGCGGGTGGGAGGCATGGGACTGGGGCTTGG 

GGAGGTCAGGCGAGACCGGGGTGAGAGCTCAGAGAAGCTCCTGTGACTTCCATGCTAAGA 

TCTTGCCAGAGAACTCTGGTCAGTCCTCGGGTGTCTGGATGAAGTAAAGGAGTTAGGCAT 

TTCTTCCTTTGATTCTCTGGCTTACCT 

Sequence 129 

CGTCCGGCCCGCTCCGGGCGTGGGCGTGTTCTCGGCGGGCGTGCCTGGAGGAGGAGCTGG 

GTCCTTGTGCGGCTGCAGAGTCAGATGGGGCGGGGATTTCGGGGCACCGGGTCCTCACCT 

TCACGAGAAAGGCCCCACAGCACGTCCCCACTACCCGACGACTCACTCTTCGTGGCTTCT 

CTCTCCTCCCCAAGAGCAGGGGTGGGCCTGTCTCGCGTTCCCCTGCGGGAGTCAGGAAGC 

GTCCTTCCTACCTACCAGTCCTCCCCTCTGGTGTCTGGGGACACTTCCTGGGGGCCTTTC 

AGGTGGTTGGCGCCGGTGCAGGGCCTGAGAGCCTGGG 

Sequence 130 

GCGTCCGGTGGCATCATGACTTCTGGGGCAGTAGACTGAGCAGCAACACCAGCCACAAGT 

CCTACCGGCCTCTCACCGTCCTGACTTTCAGGATTAACTACTACCTCTCGGGAGGCTTCC 

ACCCCGTGGGCTTTCACGTGGTCAACATCCTCCTGCACAGTGGCATCTCTGTCCTCATGG 

TGGACGTCTTCTCGGTTCTGTTTGGCGGCCTGCAGTACACCAGTAAAGGCCGGAGGCTGC 

ACCTCGCCCCCAGGGCGTCCCTGCTGGCCGCGCTGCTGTTTGCTGTCCATCCTGTGCACA 

CCGAGTGTGTTGCTGGTGTTGTCGGCCGTGCAGACCTNCTGTGTGCCCTGTTCTTCTTGT 

TATCTTTCCTTGGCTACTGNAAAGCATTTAGAGAAAGTAACAAGGAGGGAGCGCATTCTT 

CCACCTTCTTGGGTGCTGCTGAGTATCTTTCTGGGAGCAGTGGNCATGCTTGTGCAAAAG 

AGCAAGGGATCACTTGTGCTGGGTTTAAMTGCCGGAATTTGACAATCTTTGGGTGATAG 

GC 

Sequence 1 31 

GTCCGCTGGGGGCCCTGGGGCTCTCTGCGTCGAGAGCGCTCGAAGACCCGGGATTCCTGG 

CCCGATCGCGGGCGGGGGGAGACCCCAGCTCCACCCCAGCTCCCGCCGGCTCGGGGAAGG 

GGCGGCCCCTTTAAGAGCGCGCGGCCCCGCCCGCCCCCTCCGGGCAGGATCCGAATTCCA 

GGGAGGCGGGGCGGAGACGGCGGCGAGGAGGAGGCCGCGGCGCGGGACGCATAGAGCTGC 

GGCTCGGGCGGCGCCTCCTGCGGCGGCCCGGCCCGGCTCCGGCCCCCGCTGGGGCAATGC 

TCCCCGGG 

Sequence 132 

TCGCCNCGCGTCCGGGCACACACATGCCAGGCTATTTTAAGAACTACTACAACTATGATA 

AAGCTGTGAATATGTAGCCATGAACCAAAACAAAGTCTCTGTCCTTGTGGAACATTTGTT 

CTGTCAGAGAAGACAGTGTGTTGGCTCACATTGTGGTCAGTGCTGTTGAGCAAAATAGGT 

CAGAGTAAGGGGGATGGAGACTGGTGGGAGGAATGCTGCTTTATCCAGGATGGGCAGGGA 

GGACTCGATGGTGTCAGCACTGAAGGATGTAAGATCTGCTGCTCTGGGGAGAGGAGCAGC 

ATGGAAGGAGTAGAGTGCAGAGGCCATGAGGAAGGATCAGGCTTGACTCCTTTGAGCAAG 

GGGGATGGGAAGAGTGACGGNAGAAAGAGGGACAGGCCACATGGCCTGGTGGCCTGTGCT 

GAGGCCTTGGGCTTTTCTTCAAGTGAGATGAGATGCCATTGGCCAGTTTGGGCAGTGATT 

TNATCAGACTTGGTTCAGCAGGACCATNCTGCTTGGCAATGTGGAGAGCANGCTGAAG 

Sequence 1 33 

CGCCNCGCGTCCGAACAGGCCGGGCACCAAGGCGCAGGATTTCTATAATTGGCCTGATGA 

ATCCTTTGATGAAATGGACAGTACACTAGCTGTTCAACAGTATATTCAACAGAACATAAG 

AGCAGATTGCTCCAATATTGACAAAATTCTTGAACCACCTGAAGGCCAAGATGAAGGTGT 

GTGGAAGTATGMCATTTAAGGCAGTTCTGCCTTGAGCTAAATGGACTTGCTGTCAAACT 

TCAGAGTGMTGCCATCCAGATACTTGCACTCAAATGACAGCMCTGMCAATGGATTTT 

TCTTTGTGCAGCTCATAAAACTCCAAAAGAGTGTCCTGCTATAGACTATACTAGACACAC 

ACTTGATGGTGCTGCATGTCTTCTGMTAGCAATAAATATTTTCCCAGCAGGGTTAGCAT 

AMGGMTCATCTGTAGCGAACTAGGATCAGTATGCCGTAGGATTTACAGMTATTTTTC 

ACATGCTTATTTTCATCATCGGCAGATATTTGGATGAATATGAAAATGAAAC 

Sequence 134 
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GCGTCCGCGAAAGCTGGGAAGCCAGGTCTACCTGCCCCAGACGAATTGGTGTACCAGGTG 
CCACAGAGCACACMGAAGTATCAGGAGCAGGMGGGATGGGGMTGTGATGTTTTTAAA 
' GAMTCCT TTGAAGATGATGCTGCTTTTTACAAAGCATCGTTTTAMGCACATGGCCTTT 
TTTTITTTMTTATTAGTGGTAGTMTATATAGAATGTATTACA 

GGTTGGGGAAAATGTGGTGACTGAGGTACAGGAMCTACTMTCTTGCCATCTTGCTTTA 
AGGTGTTATGGTGGCACAGTTACTGCTCGCCTGTTMATTTCAAATGTCCTGTTTGATAC 
TACTGGAGAACACTATTTTTAATACAGAAAAAGGTCCCTATAATGCACTTCAGAGAAATT 
AA 

Sequence 135 

TCGACCCACGCGTCCGGGAGTCCCCCCTGCCCCCCATCAAATGCTTCCTGCAATACTTTG 

CACACCAGAGACTGGGCCTCCCCAGATCCAGGGGGACAGGGGTCCCTGGGGGAGTCCCCA 

GGGCCAGCCCCTCCAGGCCAGCTGCACACACTTGACACTGATTTGCACAGTCTTGCACAA 

ATAGGGGGTAAGAGCCCAGTGGCTGGGGTGGGCAATGGGGGTAGCCTCTGGCCTAGGGAG 

TCCCCTGGCACTGCCAATGGGCACAGTCCCGAGCACACACCCCCTGGCCCTGGACCCCCA 

GGCCCCTGCCCCACCAAGCGAAGGCTGCTTCCTGCTGGAGAAGCCCCAGATGTCAGCTCT 

GAGGAAGAGGGGCCAGCCCCTCGGAGGCGCCGGGGATCCCTGGGCCACCCTACTGCTGCC 

AACAGTTCTGATGCCAAAGCCACACCCTTCTGGAGCCACCTGCTGCCTGGGCCCAAAGAG 

CCTGTTTTGGACCCAACAGACTGCGGTCCCATGGGGCGGAGGCTGAAAGGAGCCCGTCGC 

GTGAAAGCTTGAGCCCCCTTCGAAAGCCTNCGGAAGGGGCCAGGCCTGCTGAGCCCCCCC 

AGT 

Sequence 136 

CGACCCCGCGTCCGTGAGAATTCAGCTTTGGAGTCCCGGGTGAGGGGTTTTAGATAAACC 
CATCAATATCACCCACATTCTGTGACTCTTTGCATCACTCGTGTTATTTATTTATTTATT 
T ATAT TCTGCCTTGTTCCAGAAAAGTGTTTAAGGCAACAACGCTTGI I I I I I GGTGTTTT 
CTTTTGACATTTGMMTTTAGTACATTC 
AAGAGAAGGAACAATTGTTTTTAGTMGTTTTCTTTTTC^ 

TGAAATTAAAAGTTCTTGAGAGAAGGAGAGGAAGATACAGCAGACATAGGACTGAGCCAA 
GGAAGAGTCTGCCTGAGAGAGACGCTTGGCCTGTGCTTTGCTGCCATCCGTGCGGCCTTG 
GCCACA 
Sequence 137 

TCCGATTTTTAMTCTATTGGCCGTGrTGTCCTACCTGMGTTCTTCAACTGCCAAAAGC 

ACAGCCCTTTTTCTCTGAGCTGGTGGTTCTGGCTAACACTGACAGGGGTCGCTTGTTCCT 

GTGCAGTGGGCATCAAGTACATGGGTGTGTTCACGTACGTGCTCGTGCTGGGTGTTGCAG 

CTGTCCATGCCTGGCACCTGCTTGGAGACCAGACTTTGTCCAATGTAGGTGCTGATGTCC 

AGTGCTGCATGAGGCCGGCCTGTATGGGGCAGATGCGGATGTCACAGGGGGTCTGTGTGT 

TCTGTCACTTGCTCGCCCGAGCAGTGGCTTTGCTGGTCATCCCGGTCGTCCTGTACTTAC 

TGTTCTTCTACGTCCACTTGATTCTAGTCTTCCGCTCTGGGCCCCACGACCAA 

Sequence 138 

CGACCNCGCGTCCGGAAGGACCCTCTGAGCTATTTTGCGGCATACGGGAGCAGCAGCTCA 

GGCTCCTCGGACGAGGAGGATAACATCGAGCCGGAGGAGACGAGTCGCAGAACCCCGGAT 

CCGGCGAAGTCGGCGGGCGGCTGTAGGAACAAGGCGGAGAAGCGGCTCCCGGGACCTGAC 

GAGCTGTTTAGGAGCGTGACTCGCCCGGCCTTTCTCTACAATCCGCTCAACAAACAGATA 

GACTGGGAGAGGCACGTCGTCAAGGCGCCTGAGGAGCCTCCAAAGGAATTCAAAATATGG 

AAGTCAAATTATGTACCACCTCCTGAGACCTACACCACTGAGAAGAAGCCTCCGCCTCCA 

GAGCTTTGACATGGCAATAAAATGGTCTAACATATATTGAGGACAATGGTGATGATGCTC 

CACAGAATGCTAAAGAAAGCTAAGGCTTNTACCA 

Sequence 139 

CGACCACGCGTCCGGGCTGGCGAGCCCGGCTGAGGAGCCTCTTGGGTCGCACTTACCGCC 
GCGTCCGCTCCCGGTCCCTGGCCCCTCAGCGGCATGGCGTGCGGGGCGACGCTGAAGCGG 
CCCATGGAGTTCGAGGCGGCGCTGCTGAGCCCC 
Sequence 140 
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CGTGTCCGGTGAATGGGAGCGGAACTCACAGGACACAATGAGCCGGGTCACTGATGGCCT 

TGCTTTCTAAGAATCTCACAGTGAGCCCTAGAACTCTCTACGTGGTAACACTGTGTGCCT 

TTTTCAGAGAAGAGCCTATCTTAGATCTTAGCCTAACGTTGGGTCTATTGTGTTGCTGGA 

GAGACCAGCACTGACATTCATCTCAAAGCACATGGTATGTTTGACTCCTATGTTGACTCA 

ACTACCCATCTTGTACTGGGACACTCGC I I I I I I I 1 1 1 1 I I 1 1 I I GAGACGGAGTCTTGC 

TCTGTCACCGGGCTGGAGTGCAGTGGCACGATCTCGGCTCGCGGCAGCCTCCGCCTCCCG 

GGTTCAAGTGATTCTTCTGCCTCAGCCTCCTGAGCATGTGGAGCTCAGGCTGAAGGTGAT 

GTGGCCGCCC 

Sequence 1 41 

GTCCGATTGATTCTTCTATGATGCGTGTTCATTATACAATACACATTCTCGGAAAGGCAG 

AATTATTTGCTTTTATGATTGI I I I I I GTGACCTAAGAAACGGTTCTTCCCCCATTTTCC 

TATCCCAGGCCAMAMTATTCTCTCACGTTTTTCTGTAAGAGCTTTATAAGTTCAGCTT 

TTATATTGAGGTCTGTGATTCATCTTGGATTATTATGTGTAGGTTGTAAAATAGGAATCT 

AGGTCCAG 1 1 I 1 1 CGATGTGGATATTCAGTTATTGTGGGGCCAATTGTTGAAAAGTCTCC 

CCAAAGAACTGCTTTCTTATCAGAAAGATAAGATATATTAAAAGTGTATATCTAAGTCTG 

GGTTCTCTTTTCTGTTCTMTGGTTGATATTTTTATCCTTATGCCAGAACCACACTGTCA 

TGATTGCTGTAGCTTTATAATAGTCTTGMTCMGTTGCTTTTCAGTTTTGTATTTTTCA 

MATTGCTTTCGGTATTCTAAGTCCTTTGCATTTCTGCTAAAATTTAGAAGCAGCATGTC 

TACCAAAAGGAAAAAAAAAGCC 

Sequence 142 

TCCGGCGGAAGAAGGTGCGTCCGCGGCTGATCGCGGAGCTGGCCCGCCGCGTGCGCGCCC 
TGCGGGAGCAACTGAACAGGCCGCGCGACTCCCAGCTCTACGCGGTGGACTACGAGACCT 
TGACGCGGCCGTTCTCTGGACGCCGGCTGCCGGTCCGGGCCTGGGCCGACGTGCGCCGCG 
AGAGCCGCCTCTTGCAGCTGCTCGGCCGCCTCCCGCTCTTCGGCCTGGGCCGCCTGGTCA 
■ CGCGCAAGTCCTGGCTGTGGCAGCACGACGAGCCGTGCTACTGGCGCCTCACGCGGGTGC 
GGCCCGACTACACGGCGCAGGTGCGTGCACCCCGTCCGCACCCCGCCCCTGCAGCCGCCT 
GGTCTCCCCGCCTCCCCTCCTCCCTGCAGGTTTGCGTGGCTGAGGCTCCCACCTCCTGAC 
CTCGGGGGCCGAGAGCTTTGCGAGCTGACCCCGCTTCTTCTGGCTTTGCAGAACTTGGAC 
CACGGGAAGGCCTGGGGCATCCTGACCTTCA 
Sequence 143 

CCCGCGTCCGAGATCCTGTAGGTGAAGTTCTCCTGTGCTCCACAGCCACCCAGAGGAATT 

CCAAAACCAGCAGTGGAGGACTTGGGGAGGACAGGAGGGAAAACATGGCGAGTTCATCAG 

CTCTGTTTCCTTTATTAAAATATTTCTGTAATTGGTGGTGGGAAATTGAAGAAATCAAGT 

GATTGCATCAGCGCTGGAAAAAGCTGCCAGCACTTGGCAGTGGAAGAGAATATATGCTTT 

ATACTGGACTTTTTGAAAAAGAGGCTGAGTTTGGCCAGATTGCCGACCAGCAATGGAAAA 

ACTAATTAGGTGCCTTGCCTGTGAGCCAGACGCCCAGCAGGGCTGTGGCGCATGGCTCCC 

GCCGCCTCTGAAGAGGACACTTTCTAGTGAATTCAGTTCGTGCTACCCTTGAGCAGCCTG 

TGCTACAGCAGGCACATTTGTGAATCTCCCAGCCTGTGCCTGGCGTCNGAACTGTAGCTT . 

CCCAAAGAC 

Sequence 144 

CGCGTCCGAGTAAATCTGTTCTGCACTAAATTATATCAAAATAAATATAAATAATTATTT 

CGTATACCATTCGTTATATACATTTCAGTGTTTTTMTMTGTCMTTTCTTATATTTTA 

AGGAGAACTCTAGTTATTTTTATAAATCTAATTGACTTAATTTTGTGGGAATATAAAAGA 

AGTGATTAAAAACCTTTGGTTTAAAGTAGTTAATCCTGAAATCAAGCTCTGTAAATATTG 

TGTAGGGATATGGAGAAATCCTCAAAAAGAAAGAGCTAAAGAAAATGGCAGGGATGGCAT 

CTTGGGAGTATAACTGAAAGTAGGAAGATGTGGATAGAAGAGTCTTATTTTTAATCACAG 

GGCATATGTGCTATTTGAATTATTTTGACAAAAGTATAAAAATATGGAATTATGCATTGT 

GTGTGTGTGTGGTTTCATCTGTTTAAGAAATATGTGATGGAGACTGTCCTATCATCAGGA 

AATTATTCCAGT 

Sequence 145 

GTCGACCNCGCGTCCGCGGAGACACCGACCGCGGCGGCAGCAGCAGCAGCAGCAGCGAGA 
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GGCAGAGGCGGCGGCGGCGGGGAGGACAGCACGGCCGAGGCTGCCCAGAGGCGCCTCCTC 

CACACCCCCCGCCGCAGCAGCACCGGCGACAGATTTTTTAAAAAATGGATTTGGCCAACC 

ATGGACTTATTCTACTGCAACAGTTAAACGCTCAGCGAGAGTTTGGTTTCCTGTGTGACT 

GCACGGTTGCMTCGGCGATGTATACTTCMGGCACACAAATCAGTTCTTGCTTCATTCT 

CCAATTACTTTAA 

Sequence 146 

CCACGCGTCCGATCCTCCCCAAGGCAGAGGTGTGCGTGCGGAACCATGTCCAGCCCTACA 

TCCCATCCATCCTGGAGGCCCTGATGGTCCCCACCAGCCAGGGCTTCACTGAGGTGCGAG 

ATGTCTTCTTCAAGGAGGTCACGGACATGAACCTGAACGTCATCAACGAGGGCGGCATTG 

ACAAGCTGGGCGAGTACATGGAGAAGCTGTCCCGGCTGGCGTACCACCCCCTGAAGATGC 

AGAGCTGCTATGAGAAGATGGAGTCGCTGCGACTGGACGGGCTGCAGCAGCGATTTGATG , 

TGTCCAGCACGTCCGTGTTCAAGCAGCGAGCCCAGATCCACATGCGGGAGCAAATGGACA . 

ATGCCGTGTATACGTTCGAGACCCTCCTGCACCAGGAGCTGGGGAAGGGGCCCACCAAGG 

AGGAGCTGTGCAAGTCCATCCAGCGGGTCCTGGAGCGGGTGCTGAAGAAA 

Sequence 147 

NACCACGCGTCCGCCNCGCGTCCGCTTGCACCCGGTGAAGAGCGTGCGTGTGCTGAGGCC 

GGAGCCGCAGACGGCTGTGGGGCCCTCGCACCCCGCCTGGGTGCCCGCGCCTGCCCCGGC 

CCCCGCCCNCGNCCNCGCCCCCGNCCNNGCTGCGGAGGGCTTGGACGCCAAGGAGGANCA 

TGCCCTGGCGCTGGNCGGCACAGGCGCCTTCCCGNTGGACGTGGAGTAC 

Sequence 148 

TCCCAAGAGCTGCANGNNNCAGCCGCGACAGCAAGAACCGGNAGAGCCGGCAGACCGCGG 

CGGCGGCGGCGNCGGAGGCAGGAGCAGCCTGGGCGGGACGCAGGGNCTCCGCGGGCGCAG 

GAAGGCGAGCAAGAGATATNCTCTGAGAGCCAAGCAAAAGAACATTAANGGAAAGGGAAG 

GAGGAAANGAAGGCTGGATACCGGNGCAGTGAAAAAAGGCACTTCCAAGAGNTGGGGGCA 

CTCAACTACGCCACNAGACTCTGACCGGGTGGCCCAATCAAGCCAATGAAGAAACCTATA 

ACCCGGNTTAACCTNTNATTGGCCTGCCTTCTNNNTNGGGGGGTGGGGGCCCAAGCCCNC 

TTMCCCCCCAACCTTCNNTTTCAAACCTATCCCAACCTTATTCNAAAANGAAGGGGANC 

CTNAAGGATGGGGNTTTCCCCCAAGGCNAAAAAGGAAAAAAAGGGGGCCCCCNNGGGAAG 

CCTTCNTTCTTGGGGAAAAACAAGGCAAAAAAAATTGGAAGCCTGAAAACCCGGCTTCAA 

AAAAAAGGGG 

Sequence 149 

GGCCGAAAGGGGGGCAGGTGGTCGGGCCGCGCAAGCGGAGATGGAATGGGGCCCGGGCTC 

AGACTGGTCACGGGGGGAGGCTGCCGGCGTGGACCGCGGGAAGGCGGGGCTGGGGCTCGG 

CGGGAGGCCACCCCCACAGCCGCCCCGGGAGGAGCGCGCCCAGCAGCTGCTGGACGCGGT 

GGAGCAGCGGCAGCGGCAGCTCCTGGACACCATCGCAGCCTGCGAGGAGATGTTACGGCA 

GCTGGGCCGCCGGCGCCCGGAGCCGGCTGGTGGCGGGAACGTCTCANCCAAACCTGGAGC 

G 

Sequence 150 

CACGCGTCCGGCCTGCTGTTNACCTGCGGGACCCCAGGAACCTGGACTTGTTTCTCAAAG 

TGGTTCATGGAGATGTCACCCCCTACGACCTGGTGCGGATGAGCTCGATGCAGCTGGCCC 

CCCAGGAGCTGGCCCGCTGGCGGGACCAGGAGGAGAAAAGGGGCCTGAATATCATTGAGC 

AGCAACAGAAGGAGCCGTGCAGACTTCCAGCCTNCAAAATGACCCACAAGGGCGAAGTGG 

AGATTCAGCGGGACATGGACCAGACACTGACCCTGGAGGATCTGGTGGGACCGCAGATGT 

TCATGGACTGCAGCCCACAGGCCCTGCCCATCGCATCAGAGGACACCACGGGGCAAGCAT 

GACCACCACTTCTTAGACCCCAACTGCCACATCTGCAAGGACTGG 

Sequence 151 

Mill CCTTAGMTCTTCGAGAAAAAGATGAAGGTATTATTCTCAGTTTCGAGATCAGGA 

CTCCTCACCACCAGGCGGGGGCTTTAAGGTAGACACTACAGGGAATCTGATCTCAGGGTG 

ATCCTCTCCCTTCCACTTGCAAAAAGAGAGGAGCAGGTGGGCCACTGCTCTCTGAGATGT 

TAACACCCCTCACACTCCACGGGCATGCTTTGTCATTCTGCACACCGGTGTAGCTGCAGC 

TCTGTGTGMTTCAGATCTCAAGAGAAATGTAAATCMAAGTATGAGTTTCTTTCTTTCT 
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TGGGTGCCACAGTAGGAATGAAATGATGGGGACTTTTGGAAGCCCCTGGACTTGTGGCCC 
CTGTAGAAGAGCAGCTTGGGCAGGGTGTGATGGCCATCTCTGTCTCTAGGGGCCCTGTGG 
A 

Sequence 152 

TGGCGAGAGCGCTGGTGCCGAGTGAAAGATAACAAGCTCATTTTCCACAAGGACAGGACC 

GACCTGAAGACCCATATTGTGTCTATTCCGCTCCGTGGCTGCGAGGTGATCCCGGGTTTG 

GATTGTAAACATCCTCTGACGTTCCGGCTGCTGCGCAACGGCCAGGAGGTTGCAGTATTG 

GAGGCATCTTCTTCTGMGACATGGGCAGGTGGATTGGGGATTTTACTCGCAGAGACGGG 

GATCGTCCACAGACCCGGAGGCTCTGCACTATGACTACATTGATGTGGAAGATGTCTGCA 

ANGTGTCATTCAGACAGGCCAAACAGACCTTTCTGTTTTTATGAACAGGGCGTGTTATAT 

GTGCTAACCCATATCTAGGGGGCACCTTCAACGGGTTATTGCCCACCCAGCGGGACGGCA 

CTTTAATATGACGATGTTTCCGTNCCATAAACCGGNTTGNTTAAAGGGTAAAA 

Sequence 153 

GTTCGACCCACGGGCGTCCGCGGGACCGCCGTGGGNNNNACATACTATGCGNACAGGCGC 

GTTGNACACAAANGGCCCATTCCTGTAGCCTCACACTTGACTACACATGGGGGANTCACT 

CGGATTCGGNTCTCCACGTGGNNGNTCTTTGTTCTGTACTCTACGTAGCTTTGGCTTTTG 

TTTTCTCGTCGCAACAGGGCATGAGACTTCGTCGACCTTNGGGGTCTGTATAGTCTTTGA 

CTTACTACGTGTAGGTCTCAATACAAAGTGGGANATANTCATATCCGTCCGCGAAAAGTA 

ATTCTTGGAMAATTTACCCTTGTCTCCCGCNTTATGAAACGTGAACTAAGTMCTCACT 

TTGCCGCTGGGGCGCCTCTNTTTAACANTGTTCTTTTGNCGAAATCATCATAACCTTCAA 

CTGAAAACAATGTGGTCAACAAACTGACTATGGAGGTCTTAGGCTCNGTCTCTAAGATCT 

TTAACCTTGTTTATCGGCGCGTGCGGCGTNGTCCGAACGAAGAGACTATAACCCGCACTA 

TMCNAAMCTCTTTTTTAATCCCACCACCTCGTGAGGGANGGGCCCTAAGACTGAAACT 

GTAGTAAGTCCTATTGATTTGCGTAGGAGGANTTAGGAAA 

Sequence 154 

NCGCGTCCGATAGTCTACCAGCCTTACCTGGTTGATTACACTTGTAAAAGAAAGATTAAA 

AGCAGGCCAGTGACTCTGGTCTGCTTGAACATGTGAATGTAGTGGTTTGAGCAATCTGGA 

GTTTGCCCTAGTGTCAAATTCCAGACTGTCCATAGTGTCCAAAACCTGAGGCAGACACTA 

ATGTTAACCCCCAGCACCCCGTGATTGGAAACAAACCTAAATACGTATTGGGAACTTAAT 

AGCAATT1TMGCATTCTGATAGATTTTTTGTAGGGATGGGGTCATGCCATGTGGCCCAG 

GCTGGTCTGAAAACTCTGGCCTCAAGTGATCTCAAGCTTTGGCCTTCTAAAGTGTTGGGA 

TTACAAGGTGTGAGGCATTGCACCTGGCTTAGCCGTCTTGATTTGACATTGTAATGAAAA 

AGTGTGAGTCTTATTCTACCAGGGGCCCTTTTTTGTCCTCTTGAAAATNGAATAACCANG 

GGAAGGGGGAA 

Sequence 1 55 

CCNCGCGTCCGTCCATCACAGCCTCCGAAGGTGCTGGGATTACACGGCATAAGCCACTGT 

GCCCAGCCTGTTTTTAATA ATGATAT TMGTGGGTTTGGTTCATGTGTTATTAATCAGTG 

TTAATAATCGTAC I I I I I I I I I I I I I I I AAAAGAAACCATGGGTATTCTAAAATCAGGAG 

TCCAAATAAAAGAAAGTTCTCGGCTGTGCGTGGTGGCTAACACCTTGTAGTCCCGGCACT 

TTGGGG 

Sequence 156 

CGCGTCCGAAAGGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGGAGGT 

AGAGAAAGTCAGTGCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAACGGG 

AGGGAGTCTGAGGGTTGGGGACCGTCTGTGAGGGAGGGGAACAGCCGCTCGAGCCTGGGG 

CGGGCGGACCGGACTGGGGCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGGCGT 

TGGTCAGAACCTGAGAAACAGCCGAGAGGTTTTCCACCGAGGCCCGCGCTTGAGGGATCT 

GMGAGGTTCCTAGMGAGGGTGTTCCCTCTTTCGGGGGTCCTCACCAGAAGAGGTTCTT 

GGGGGTCGCCCTTCTGAGGAGGCTGCGGCTAACAGGGCCCAGAACTGCCATTGGATGTCC 

AGAATCCCCTGTAGTTGATMTGTTGGGAATAMGCTCTGCACTTTCTTTTGGCATTTCA 

AGTTGTTAAAAACAAATAGGA 

Sequence 157 
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CGCGTCCGGGTGTCNAGGCCATGGGGCAGCCCTGGGCGGCTGGGAAGCACGGACGGGGCG 

CCCGCGCAGCTGCCTCTCGTGCTCACCGCGCTGTGGGCCGCGGCCGTGGGCCTGGAGCTG 

GCTTACGTGCTGGTGCTCGGTCCCGGGCCGCCGCCGCTGGGACCCCTGGCCCGGGCCTTG 

CAAGCTGGCGCTGGCCGCCTTCCAGCTGCTCAACCTGCTGGGCAACGTGGGGCTCTTCCT 

GCGCTCGGATCCCAGCATCCGTGGCGTGATGCTGGCCGGCCGCGGTCTGGGCCAGGGCTG 

GGCTTACTGCTACCAATGCCAAAGCCAGGTGCCGCCACGCAGCGGACACTGCTCTGCCTG 

CCGCGTCTGCATCCTGCGTCGGGACCACCACTGCCGCCTGCTGGGCCGNTGCGTGGGCTT 

NGGCAACTACCGGCCCTTC 

Sequence 158 

CGAGCACGCGTCCGGGGACTCAGGCATGCACCACCACGCCCAGCTAATTTTTGTA l I I I I 

AGTAGAGACAGGGTTTCTCCATGTTGGCCAGGCTGGTCTCGAACTCCTGACATTCGGTGA 

TCCACCCGCCTCGGCCTTCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTGCCCAGCC 

CCTTCCTGTTGAGTAAAAGGAAGAACTTCAGGGTAAGACACTGTACAGTGCCCAGCATCT 

GGAGAGCCGCCAGCATTACCCCTGCCTTAGGAGGTAGTCGTCTCCTCATCACTACAAGGT 

ATTGAAGCCTGAGGGCCCCTGGGCAGGACGATAGAGTGAGATTGCCCTGGGGACTCAGGA 

AAGGAAACATGCCGTATTTNTAGGGAAGGAGCTGCTGCTGCCTCTCAGTGACTCTGGTTC ' 

CAGGAGGGAAGAGCCGAGAGCTAGGGTTCCCTTTCATAGGGAGAAACCCAGCAGGGTTTG 

GGGTGTTCT 

Sequence 159 

ACCACGCGTCCGAAAGGAGTCGCGCCGCCGCCGCCGCCCCCTCCCTCCGGTGGGCCCGGG 

AGGTAGAGAAAGTCAGTGCCACAGCCCGACCGCGCTGCTCTGAGCCCTGGGCACGCGGAA 

CGGGAGGGAGTCTGAGGGTTGGGGACGTCTGTGAGGGAGGGGAACAGCCCGCTCGAGCCT 

GGGGCGGCGGACCGGACTGGGGCCGGGGTAGGCTCTGGAAAGGGCCCGGGAGAGAGGTGG 

CGTTGGTCAGAACCTGAGAAACAGCCGAGAGGTTTTCCACCCGAGGCCCGCGCTTGAGGG 

ATCTGAMGAGGTTTCCTAGAAAMGGGGTGTTTCCTCTTTTCGGGGGGTCCCTCACCAA 

GMGAAGGTTCTTTGGGGGGTCGCCCTTNTTGAGGGAGGCTTGCGGNTTAACAGGGCCAA 

AAAANTTGCCATGGGATGTCCAAGAATCCCCTGTAATTTGATTAAATGGTGGGGAAATAA 

AG CTTTG C AACT I 1 1 1 I I I GCNATTTAATTTGGTTAAAAACA 

Sequence 160 

TCCGCTCCCCTGTTTTCTTCCTTTTTCTTTTTGCTTGTATGCACAACGGTA 

TCGTMGAMCAAMTGCCAGTATTTTCTTAAGCCATGATGTGAAACCAATGACCCTGTG 

ACCACATGGCACAGMCACTAAATTTTGGTCCCATGGCTGAAACTTGAGGGTGACTAAAA 

GTAATGCCTGTGAAACATGATATCTATCTGGGATGGCCATTTGATCTCTAAAAGGAATTT 

TGTNCACTCCACAGMCTCCTATCTATAGTAAAATTTGATTTTTC 

Sequence 161 

CGTCCGGAAMATATTAAACMCTCATTTTAAGATTCAAATTAACTAATTCCTGCATATA 

TGACATTCCTTACATMGCGAACACTAMCAAAMTGGCTAGAAATGTCTTTTTCTTTCT 

TTTCTCTCTTTGTTGTTTAAGGTATTMGCACCGMTTATTACATGAGACTGGCAGATAG 

CTATTAATCCTCTTACAGATTTGAGAAAGTTGATTCTCAAATATTTATGCACCTTCTCCT 

TCATTGTTTTCTTTAMTCTGTCCTCTTMAAAGCTTCTTAAGAGCTCAGTTMTGCTTT 

TGACTTAACTAGGAGAAAAAGGCATGATAATACAGGCAAGATGGCATTGTTAGCAATTCT 

GGTAGGTGGTTTTGGAATGAATCCTAAGAGGCAAGGGGATCTTAAGGACAAGGAAGAGAA 

GAGAGAGGGGGNGGGATCCCTTTGATCTCT7TCTCTGGNAATCTTAMTGCNTMTTTTA 

CTAAAACATGTTCTCAATTCATTCATAT 

Sequence 162 

CCCCGCGTCCGGATTAATGAGTGTATGCCTAGCTCTTTCTCCAGTTTACTTTTAGACCAT 

ATTGTTGTTTGTTTTGAATATCATTCCTTAGGCTATGTTGAGAGTAGAGTGGCTTCCCAT 

TAGGAGMCTMTTTAGGGCATGTCTTTTGCTGAATCCCGTCAGCATATTTAACAAATTC 

CCMTTCTAGATMTTTCCTTTTATTTCTCTAGTACCCTTTGCCAGGGGCTCTACACATC 

AAAGGTGTTCATGAAGTATTTGTCAAAGGAAAGAACAGTAATGACACCTAACACATAATG 

AGTGATTAGTATGTTCCAGGCATTGCGTGAGCTATTTACTGTGAGTGATTTAATGTTATC 
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TTCCCAGCAGACCTCTGAGGTAGGGTACTAGTATGATCCCCATTTCGTACATGAGGAAAC 
TGACACTAAGGGACATAAAATAAGTTTTTGAAGTCACAAAGTGAATAAAAGGAAGACCAG 
GGTTTTAATTGGAAGCCCATA 
Sequence 163 

TAGACTTTTGCAGTGTTAAACACAGCTTCCTTAACTCTTAGAACTGGAAGTTGTAGAGCT 

CTCCTTrTGGTGCCTTTCCAGCCTTTATACACACTATTGTAGCTTTCTTAGGTTTGATAG 

GTAGCGTTTCAAGTAGTTTAGCTGAGACAGNGAATGTATTAGGTTCAACATGACCTTGTG 

TTTTATTTGTGTTTGCCAACAGGATGCCTTATTTGTTTGAGAAAAAGATGTACTAGTGTC 

ATTCTAAACTATCTCCI I I I I I AGGATTCTAAAGAAGTTAATCATCATCCTTTTGTTTAT 

TTTACCACCATTTAGTGCCTTAAATCCTATCAAGAAAGCAGTGTTACTGCTCAATGCCCA 

AATAAGACACGCGGATATTGCTATTGCTTGCTTTTGAGTTAACAGGCCNACTTTTTATAC 

TTAAAACCTCA 

Sequence 164 

GCCCCGAAGTCCCACTGTCCCTGCCGAGGCGCCGGCGCCGTCCCCTGTGCCCTTGACCAC 

GCCAGCCTCCGCCGAGGAGGCGATACCCCTCCCCGCGTCCTCCGACAGCGAGCGGTCGGC 

GTCCAGCGTGGAGGGGCCCGGAGGGGCTCTGTACGCGCGCGTGGCCCGACGCGAGGCCCG 

GCCGGCCCGGGCCCGGGGCGAGATTGGGGGCCTGTCGCTGTCGCCATCGCCCGAGCGCAG 

GAAACCGCCGCCACCTGACCCCGCCACCAAGCCTAAGGTGTCCTGGATCCACGGCAAGCA 

CAGCGCCGCTGCAGCTGGCCGTGCGCCCTCACCACCGCCGCCAGGCTCCGAGGCCGCGCC 

CAGCCCCAGCAAGAGGAAACGGACGCCCAGCGAACAAATCGGCGCATACGGTCGAACACN 

GGAAGCCCCCGGACCCGGACCCAACGCCGGGGCCCCCCG 

Sequence 165 

AGTCCGCCCCACGCGTCCGGTGAGTTTAGCGCTGCTGTCCGGATGGGTTGGTAGCAGACA 
GGGTGGAGTAGGGTTAAGCACACTGGTCACCTTAGGATTGGTTTCCTGGTGCTGGAGAAT 
GGTTAGGACACAGGCCTTGGAAGG I I I I I I GAGTGTGAAATATTACTCAGCGTTTTCTGC 
AGACCTCGCGGGCAATGCCGCTTCTAATTTTATCCAGGCCTTCTTCTGTAGGGAGGGCCT 
GTTAAGAGTTGAGCAGCCCGATTTCTGAACCCCTCTAAAAAGCTGTGGCTGATTGGTGGC 
I I 1 1 I I I 1 1 ICTTGGAGAGGGGGGTGTCAAAGATTTCTTTAAAATCGTTAGTGATGTGGT 
CTCGCTTTA 
. Sequence 166 

ACGCGTCCNAAAATGTGTGGTACATGGAATA I 1 1 I I I ATTATGCTTATTTCGTATTGCCA 
GGTAGATGGCCAGCCTGACATTCAAAATTATTTTATCAGCCCCTAAATGTTAATATTTCC 
CAAACTATTTAMTCAGTAGAAGACATTTTTACTATTAAGAATAAAAAGTTATAATATAA 
AATGGATTAAATGCCAGATTATATGCTAAACAAGTCCTTTAA AI I I I I AACTTAATATTT 
TTAACAGA I I 1 1 I 1 1 1 1 1 GAGATGCAGTTTTACTCTTGTTGCCCAGGCTGGAGTGCAATG 
GCACAATCTTGGCTCACTGCAACCACCACCTCCCGGGTTCATGCGATTCTCCTGC 
Sequence 167 

CCGTCCGCGAGGTTAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCCATCTCTA 

CTAAAAATACAAAAAAAATTAGCCAGACATGGTGGCAGCCTCCTGTAGTCCCAGCTACTC 

GGGAGGCTGAGGCAGGAGAAGGGCATGAACCTGGCAGGCGGAGCTTGCAGTGAGCCCAAG 

ATGGCGTCACTGCACTCCAGCCTGAGGGACAGAGCAAGACTCTGTCTCAAAAAAAAAAGA 

AAAAAAAGTGGCACAGATCAATTATAAATCACTGCTTCAAGGCCAGTGCTCTCACTTTGT 

ACATTAATAATCTCAGGCCCAAATAAGATAAGTGATATGTCAACGTATGTTCACTTTGGT 

CTTTACATGGCAGCTATAGTATACCGGAATATTATAAGCTCAGATCGTCATAGCTACATA 

ACTCCTTTAGTTGGGAAGANACGCCGTAAATGCCCATCAAGANTAGCAAGTCTTGCATTT 

GACT 

Sequence 168 

CGCGTCCNGGTAACTTGAATAAGGATTATGTGCCCCACCCTTACTCTCATTCCTGCTTCC 
TCTTGGGCTCAAACAGGGTATGAGTATGAAGATTTTGCCTTTAGTTCCTGAACTGAACCT 
GCT TGCTA TCCCTTTCCTCCCCACCACTACCTTATTCCTTCTCTGCCTCCAAATTGCCAC 
TTTGTTTTGAGGCTTCCTTCCCTACCTTATTATTCTGAAGGAAGTAGAGATCTTGCTTCT 
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GAAACCCCTCCTAAGAAACTGCCCAGGGACAAGATAAATTACAAACAATTCATGGGAGTT 
TACTACCTAAGTTGCTTCTAGGGCATATGTATACCATACTAGTAGTCTAGATTTCTGG 
Sequence 169 

AGGGAGTCGCCCCCGCGTCCGGACAGGATCTATGGATGGAAGAAGATGAAAACAAACACA 

CAGCCATAAAGGTMTTGTTTCTGGCCMCATCTTTCTACTGATGCTTTGTTTTGATTGT 

ATGTTGCTGTTTATATrrrCTCAAACTTGAGGCTCTATTTTATGAAATGTTGAATATAAA 

TACATTGTATTTAACTTGAAAAATTCCTGGAAATATACCTGATAATTACCACCTGAGGAA 

TCGNTTTATTTTATGAAAGTAACAGCGTGATGAATACTGTAATTACAAAAGAAAATTAGT 

ACTCACTGACTTATACCCTTG 1 1 1 1 1 1 1 I I 1 1 1 1 I I I GGTTTGG 1 1 I I I I I I GGTTTTGN 

TTTAGGNGACA 

Sequence 170 

GGAGTCGNCCCACGCGTCCGGCTGGCCAGCAGACCCTGCAGCAGTCATTGGTGGTTCTGT 

CTTTCCCTAACCCTTATGNGCTGGAGGACAGAGGAGCCACCTCTTCTCACTGGTTGATAC 

TCTGTTCCTAAGACCTCAGGTTACCCAGTGTAGCCTGCTGTGACGCTTCTGCTTCATTTT 

CTGCCAAGTGATACAGAACCTTCGTTCTGCTGCTTTGGGGACAGGTGATCCTAGCCCCAG 

CTTAGGGCCAGTGCTCTGTGCCACTCTGGAGAAATAGGGAAAAGATAGGGGTGGCTCAGT 

ACAGCAGCCCTGTGAAAGTCAAGGCCAGAGCTTTTC I I I I 1 1 A A I I I I I I I ATT ATT ATT 

ATTATTATTT 

Sequence 171 

TTTAGGGAGCCGACCCACGCGTCCGCTTGGCAAACCTCCGGGGACTGTCAGAGGAGGAGA 

GGAGCGAGAAGGCTATGCTTCGCTCCCGCATTGAAGAGCAGTCCCAGCTCATCTGCATCC 

TGAAGCGGAGGTCAGATGAGGCCCTGGAGCGCTGCCAGATCCTAGAGCTGCTCAATGCAG 

AGCTGGAGGAGAAGATGATGCAGGAGGCTGAGAAGCTCAAGGCCCAGGGTGAGTACAGTC 

GGAAACTAGAGGAACGCTTTATGACCCTAGCAGCCAACCACGAGTTGATGCTCCGCTTCA 

AGGATGAATACAAGAGTGAGAACATCAAGCTGAGGGAGGAGAATGAGAAGCTGAGGCTGG 

AGAATAGCAGCCTCT 

Sequence 172 

CCACGCGTCCGCTTAGCCGCTGCCAGAGTTCCATATGTTCTGGAACCCTTGACTCCTAGA 

GTTCAGAACCCAGCCAACTTGCAGTTTTCAGAATGTTCAAGAAACTTCTGACACTCAGAG 

TTGCAGAACCTCCTGGTCCCTGCAGATTCCTGGAAATCAGAATATGGTGGTTGNAAAGAA 

TCTTGTGGCTGGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGCCGAGGCG 

GGCAGATCGCCTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGCGAAATCCCGTC 

TCTACTGAAGATAACAAAAATTAGCCGGTCATGGTGGCGCCCGTGCCTGTAATCCCAGCT 

CGGCAGGGCCGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAAGAGGTTTGCAAGTGAGC 

CNAAAGATCGAGCCACTGCACTCCACCTTGGGTGACCAGAGTCTTAAAAAAAAAAAAAAA 

AAGG 

Sequence 173 

CGTNCGGTGAATATTCAGTMGCTMTTAAAATTTGTAAGTAAGATTTCCTTATTAAAAA 
AACTATTAAAATAAGANGAAGCCCGTTNAAATAATNATTAGAGNNGGAAAAAAGAAGATG 
ACTATCNMTTACAGCACTTTrTTTCAGCTATACATAAAGGCCTTTTCANT 
Sequence 174 

CGTCCGGTGACCCATTMGTATATTTCGTGACCCANAGTTTGAAAGAGATTATAGGTGTA 

GCTTTGCTAGTTTGTAATTGATATAGAACAGTGACTATCAGGGAAGNTGAAGAACGGCNA 

ATTGAATGTAAATCATGTCTGGATGGTGAAGATTCTAAGAATGCANCTAGGGAAAGGGCT 

GCAAAAAGAAGGTGGCAGACTAATGTAGAATGGTGCAACCAGATGAAGACATGGGTGGCT 

TTAGGAATTCCAAAGTGGCCGTGAAGGCCAGGCACGGTGGCCCACGCCTGTAATNCTAGC 

ACTTTGGGAGGCCAAGGTGGGTGGATTGCTGAGCTCGGGAGTTCGAGACCAGCCTGACCA 

ACCAGGTGAAACACCATCCCTACAAAACAT 

Sequence 175 

GGCGGCGGCGGCGGCGGCGGCCGGGACCCAGCGGGCCAGGTGGGGACGGCGCGTNGCGGG 
TGCGGGAGATGCCGTGCGGGACTGGGGCCACNTTGAGCCGCCCGNCTCGTCCCCGCCTTC 
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TGTGGGAAGGATGTGCGCGC6GATGGCCGGTCGCACAACAGCGGCCCCTCGGGGGCCCTA 
CGGCCCCTGGCTCTGCCTCCTGGTGGCCCTCGCCCTGGACGTCGTGAGAGTGGACTGTGG 
CCAGGCTCCCCTGGACCCTGTC 
Sequence 176 

GAGCTGGCTGGTGTTTGAGCTGTGGCAGAAGCACCTGGGGCTCCAGGGAAGCANGCTGGG 

AACTGCAGGACCTTGCTCAGCCAGGAGCACTTCCCCCTCCTTGAGGCAGGAATACTGAGG 

TGCCTCCCCACAGATGGAGAANGTGGAGAGGAGGATGGGCCTCAGGAGCATCTCAAGCCC 

CAGTAGCAGGANAAAGAAAGAAAGAGATGCCTGGTTTTCACAGACTGGTTCCTGTGGCTG 

GGATGACTGCATCC I 1 1 I 1 1 1 1 1 1 I I I I 1 1 GAGACGGAGTTTTTGCCTTTGTCGCCCAGG 

CTGGAGTGCAATGGGGTGATCTCGGATCACCGGAACCTCCGCCTCCNGGATTCAAGCAAT 

TNTCCCGCCTCAGCCTCCCGAGTAGCTGGGATTACAGGCACGCACCTCCACCGTNCGGCT 

AATTTTGTA 

Sequence 177 

CCTTGTNAGGGGACACAAAGAAAAATTGAATAAACTGTATGATTTAAAAGATTATCGGGA 

GAGTTACCTCCCGATATAAAAGGAAGGATTTACAGAATGTGACCTAAGGTCTGGCGTAAA 

TGTGCACCGGAACCGAGAAGGCCCGGATTGTCATGGACGATGAGATACACCGGAATATCA 

TGGACATATTCTTTAAAGCGCCCTTTATCTTCAAATGCGGCACGGAAACCGGAGGCTTTG 

AAGAACTCAAGGAAGCGCGGCACGATACCGCCCGCAATAAACACGCCGCCAAATGTCCCG 

AGATTGAGCGCCAGATTGCCGCCAAAACGGCCCATAATGACGCAAAACAGCGACAATGCG 

CGGCGGCAATCGGTGCAAGCTGTCAGCCAGCGCCGCGTTCGGTAATATCTT 

Sequence 178 

CACGCGTCCGACCGGAAATGCTGACCTGACCTTTGACCAAGTACGNGCGGTGGGGGGGGG 

GGACAAGTGGGGTGGTGGTATTAACTGGCTCCGGTGGGTCTTCAAGCCCCAGGAACCCTC 

CAAGGGGGAAACAAATGGAGGGCCCTAACGCAAGAAGCTTCAATCGGTCCCTTGGACTGG 

GCTTCTTTCGGTGGGTTGGTGGGTATGGCCCTGGGCTGGCCTTTNCCTNAATCTTTCCTT 

CCCTNCCTGGGGGCAATCTGGCTGGGTGGCCAAGTGGCTTGCCCCGGCAACAACTTTGGC 

TGGCTGGCTTACGGTCAANGGGTGGCCCCTGGCTTGCCCCAAGAACAAAGGTGGCNTTGC 

TTTGCCCCCGGAAGGGCCCCTNGTAATTGCCCGGCCCCGGGCCAAAAAAGCCAAGCCACC 

TTCMGGGTGGTTTCCCAAGCATTTTTAATTGCCCCCCCAAGCAACCCTTAATTGCCCAA 

CCCTTGTTCTTTCCCCGGCCCAAAAGAACCCCCNAACCCCCCCAACCCAAGCTTATTGGA 

ATTTTCCCAAATGGGGGCCCCCCTTGGCCCTTACAAAACCGGGGGTTTACCX)CTTTGGGG 

AGGGGAATTAACCCCCTGGGGAAGGAAACCGGTTTTGGACCANGGGGAAGGTAAGGCTTT 

AAANCCTTGGGGTNGGGGCCCAAAAGGGCTTCCCCTAAATTGTTAACCCCCCTTGGCNTT 

TT 

Sequence 179 

CGTCCGGAAAACTGTTCATCTACTCACTGTAGTGCCCTCCTTGAAATGTGTGTTTGTTCA 

TTCMCTMCMTATTTGGGGATCCCTGTAGTAAACACTGTATGAATTTTACACAGTCTG 

GCCATCAAGAAAGATCACGGAGTATATTCTAGATGGGGAGGCTACTAAGTGAATAGGAAT 

CACCACGCTGGGCTGTTTATTAGGTACAGTAATAAACATAAGTACTGGTTGCAAAANAAA 

ANAANAAAANAAAAAA 

Sequence 180 

CCNCCGCGTCCGMMGACAAGACAGCATACTGTATTTTTCCTCTTAAAATTCAATGTTA 

CMTTAAATGATTGTTNTCTGAGMTAAGTTAGCTTCAGCTTTCTAATCGATGTGTTCCC 

ACATCTACAAATTGATATGAAAAATTATTTTGAAATGCACACTGCAAAATGGTGAGAATA 

TGAAAGTTACCTGGGAATTAAATCAGAACTGTCTCCATATGACTATTTCCAAGTCACAAT 

CATAACTTTCTTAATAGCAATGGTTATATATGTGGCCAGATAGTATTCAGTTTCACAGTA 

ATGTCTCGGTCACATAAAGATAGCANAGCATAGACATAGTACAACAATTTATTATTTCTG 

CTGATTGCCAAATGTGCATAAAACTATAAAGATATATTTTCCAGCCCAGGTGACAGAGAC 

CCTGTCTCCCNTTTNAAAANCTTCATGNTAAAGGTGCGGCCGCTAGACTAG 
Sequence 181 

CGCGTCCGCTAATCAACTTTTAAAAATAATGTTTTACGGCCGGGCGCGGTGGCTCACGCC 
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TGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCACGAGGTCAGGGAGATCGAGA 

CCATCCTGGCTAACACGGTGAAACCCCGTCTCTACTAAAAATACAAAAAATTAGCCCGGC 

GTGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTG AI I I I I I A I 1 1 1 I IGAACT 

TTTTTACAGAGAAGGGGGGTCTCCCTATGTTGCCCAGGTTGGTCTCAAACGTTTGGGCTC 

AAGTGATCCTTCTCACCAAGAGTGCTGGGGATTATAGGCATGAGCTGCTGTGACCGGCCC 

AAAATCAAATTTCAAAACTAAAAAAATTCTCAGATAATTAATGAACCTACGTGAATTAAT 

CTACAAATTCCAGTTTGAAAAACACTAAAGATAAACAACTGTCAATGTGGGAAACTTAAA 

AAGTANGTTGGTATTTAGGTTATTGGTTAAAATGGGGGACCGACTGGCATACACAGTCCT 

AAATATTTAAGTCTTAAG 

Sequence 182 

CCCTATCTTNCGGTACTGGTGGGCCAAATTCCTGGGACCAGGTCAAGGTGGGCTGCCTCA 

GTAAGAAGGAAGGACTGGAGAGTGCCCATTTCAGAGGAGCAGGCTGGTGGGGGCCAGCCA 

GAAAGTAGTTCCCTTTGGGGGGAAGATGTTGGACCTTTATTATTTGTGGTAACCAGCCGA 

GGCTGGTTGTCAGGACAGCAGGTGAGCCACTTTAGGGAAGAAAGTGCAGGGGTGGGTGGA 

TGCCCAGATTACCAAGGCCAGCCACCCTGATGGGGTAGGGTCTGGTTATCTGTGTTCAAG 

AAGCAAATCCCACCCCAGCCCCAGCACTAGCTCTCTATGTATGTATTTTCCCTGTACAAT 

GTTTTATAAAAGAGATCATTAATTTAAAAAAAAAAANAAAAANA 

Sequence 1 83 

TCGTCCATTTACCTCACTTATGGGGTAAAAGGTCACTTCAAGTAAGGTTAAAGGTTTTCC 

CTGGCAAAGGACCTAACCAGAGCCCCNAAGGGGGGGAAAAAAGGAGGTCACCTTTGGGGA 

GGTACCATGGCCCNTTTTGGTCTGGCCCNTTGGGCNTCTTCAACAACAAAGGAATATTTT 

ACCAGGCCCTTTTGGAGGCCTTTGGATMTTTCTTAAGAAATTTGGTTACCAGGAAGAAT 

TTAGGCTTCNTNGGNAAAAAGGAAAAATTAGGACCTAGGAAAGGGAATTAAAGGGGGNAA 

GGGG AAATCAATTAAGCNTTAATGGAAAGGGGGTTTTACTTCTGGCAATCAAGAACCGGC 

TrTTTCNTMGTTTCNTAATGGAACTTTAAACCGGTNCCTAATANGGGCTNGTAAANGGG 

GTTGTCNTGGCGGTGGNAMCCACCTTTCTTTTTCCTNGGGCCCTTCCCTTTTCTGGNCC 

CGCMTTTNCCCTTCTTTNAMCCCTTCAAAGTTTGCCCTTGGAGGGTTTTTAATTAATT 

CCCCCTTGGTGGCCANTTCCCTTGGGGGGCCAATTGGGTTTCAATTTCCANCAATTANTG 

GNAAAAACCAAANTCCAAGGGGAAGGGACCCCCTTNGGGCTTAAN I I I I I CCTTTTAAAT . 

CTTCCTGGGAATTTTNGGATNGGGGGAAAAAAAAATAMTTAAAATTCCTTTTTGGGGCG 

CCCTTGGCCAAGTNGGGGGGGAAAAATAATTTNGGNTTTTNGGGGGGGGGGNTTTGGGGA 

ATANCCTTACCAAGGAACCCCTTCCTTGGATNCCTTTTGGGGGGTCTTTTCAAAAAAT 

Sequence 184 

GCGTCCGGTTGTAGTTTCCCTTCCCATTCTCTTGGTGGCCCTGGAAGCTTCTAGGCACAA 

GTGTGCCACCCTGATTATTCCNACCACTCCATCCAACTTTTCTCTCTCTGTGGGTGTCTG 

CACCACAAGCTGCCTACCCTCCAGGTGCCTCAATGGTCCGGCCACCAGTTGTGCCTCGGC 

GCCCCCCGGCCCACATCAGCAAGTGTCAGGCAGGCCTCCACCCAGGTGCCACGCACGGTG 

CCTCATACCCAGAGAGTAGCCAACATTGGTACTCAGACCACAGGACCCAGTGGGGTAGGA 

TGCTGTACACCAGGCCGGCCGCTCCTGCCGTGCAAATGTTCCTCAAGCAGCACATAAGCA 

CCTATCGGGTCCAGGAGCCGGCTTGTGCACATNCCAGGACAGGAGCCCCTGACCGCGTCC 

ATGCTGGCTGCGGCGCCCCTGCATGAGCAAAAGCAGATGATTGGGGAGCGTCTCTACCCC 

CTTATCCATGATGTCCACA 

Sequence 185 

GTCGCCCCGCGTCCGGGCATTTGTATTTTCAACAATTGTTCTCAAATTTAGAAAAGAGAC 

ATCGCAAGATGGTGAAATAGGAAGCCCTGGGCCCTCCTTCCCTCCACAAACACACTGATT 

TGACAACAGTTCATGGACAGATTCCCTTTATAAGAAACCAAGAAACTGTTAAGAGGCTCT 

TATACCCCAGGTGAGTGCAAAATCATCCACATCAAAGATAGCTGGGAAGTTCAAGACACC 

TTCTTTCCGTAATTTCTAAACTGGCACAGTACTATATGATTGAGATGAATCTCCCCACAT 

CCAAGCTTCCTGCTGGGGAGGAGAGGGGAGGGTATACCATTATGTCCAATGTTCCAACTC 

CTCTAGGAGCTACCCAGGTAGGAGGTGGGTCCAGCTCTGTCAGGCTTGTCTTAAGAGCAC 

TGATTGAGGGTCTGGTATTCTCTAAGTGGCCCAGGACCATAAGAGCAGTGGATGGTGCTG 
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GGGNTNNGNNGTGGGTTACCCATAACCCCCTGGTTTTTGG 
Sequence 186 

TCCGAAGCAGTGAAAATAGTGTTTGACTCCATTGACAACCTAGAGGCGGCTCCTCATGAT 

ATCGGCTACGTCAAACAGGCCATGTTCCACTATTTCCAGGTGCCAGATCGGCTAGGGATA 

CTCACTCACCTGTATAGGGACTTTGATAMTGCACGTTTGCTGGGTTTTGCCGGAAAATT 

GCAGAATGTGCTCANCAGGGAGACCCCCTTTCCCGCTATATCTTCAGGAAGGCTGGGGAG 

ATGCTGGGCAGACACATCGTAGCAGTGTTGCCCGAGATTGACCCGGTCTTGTTCCAGGGC 

AAGATTG GACTC CCCATCCTGTGCGTGGGCTCTGTGTGGGAAGAGCTGGGAGCTGCTGAA 

GGAAGGGTTTTCTTCTGGCGCTGACCCAGGGCAGAGAGATCCAGGCTCAAAACTTCTTCT 

CCAGCTTCACCCTGATGAAGCTGAGGCACTCCTCCGCTCTGGGTGGGGCCAGGCCTAAGG 

GGCCAG 

Sequence 187 

CGTCCGCGTCGCTCCCCTGCTCGGGGTCAGCTAGTGTCCGCTCTGCTCGGCCGCGGGCTC 

CCGGAGGACTGCAGGCAGGATGACCGCAAAAACACGGGTGATTGGTGAATGGAAGTTGCT 

ATGGGCCTCTAAGGGCCATCCCAAGCCCAACCCAACGTTAAACGGTCCACAATCCACCAA 

GGAAGTCMGCTTTTGACACAAACTGCTGAAAAGCTGGGAGGGTTTCTTCTGAAAGAAAG 

TTTCTTTTTTCMCCCTGGGGGACACTGGTGCCCTTTCCACAAGCCAGGGAATTGGGTTT 

ATGAAGCAAGCTTGGCTCTAAGGGGGGTGACCTCAAGATATTTGCTGGGGGGTTGTGAGG 

TTTGGTGGTTCTTGGGAAGTGTGTCTCAAGCTTTGGGGGCCCTGGAACTGTGCTTGAAGT 

GCCCTCAGGCCTGTGCCCTTCTGGGGCCGGGGGTCTTGTGGGTGNATNCGCAAGCANGGA 

AGCCTGGGGGCCATTGGTCCATTNAAGAAGGCACCCCGGGGCCAAACCTTGCTTGGCTAT 

ANTATTCCAAGCCTGCTTCAACCCNTGGGCAGGCTTT 

Sequence 188 

TCGACCNCGCGTCCGGCTTCGACGCCTTCCCTAACATCGAGAAGGTGTCCAAGATCACGT 

CTCCCGTGCTCATCATCCACGGCACGGAGGACGAGGTGATCGACTTCTCGCACGGGCTGG 

CGCTCTACGAGCGCTGCCCCAAGGCGGTGGAGCCGCTGTGGGTGGAGGGCGCCGGGCACA 

ACGACATCGAGCTCTACAGCCAGTACCTGGAGCGCCTGCGTCGCTTCATCTCCCAGGAGC 

TGCCCAGCCAGCGCGCCTAGCGGCGGCCCCAACCGGCCGGACCTCAGCAATAAGGCGGCC 

CCCGGACCTCACCCCGCGCCGGCCCCCACCCAGGGGCTGCATGTGGACCCCCCGGGCGGC 

CCAGGGGAC CCCG CCCCGACCCAGGGGCTGTGGACGATGTACAGGCAACAGAGCTACCGC 

AC TCCTTT CCTTTTGGAAGCAAGAAGAAAATACGTGAAAACGGGAAATTAAAGATTTAAA 

AATTTTTTNNNNNTNNANAAAAANNAAAGTGCGGC 

Sequence 189 

CGCGTCCGAAGCCTTTTGTCTCAGAGAATTTATTGTCCTGACAGCAGAGGCCGATGGTGG 

GATCGACTGGCCCTTAATTTACACCAGCACTTGAAGCCGCCTGGAACCCGACTATCAAGT 

GCATCACAGAGGGGCTGGCGGATCCCGGAAGTCAGAACGGGACACCGNCTTTCACTGTAT 

CAGCGAGCCGTGCGCCTGCGAGAGTCTCCGAGCTGTAAAAAGTTCAAGCACCTCTTCCAG 

CAGCTCCCAGAAATGGGCTGTGCAAGATGTGAAACACGTGACCATCACAGGCAGGCTGTG 

CCCACAGCGTGGGGATGTGCAAGTCTGTGTTTGTGATGGAGGCCCGGGGGAGGCCCGCTG 

AC CCCA CCACGGTCCTGTGCTCTGTGGAGGAGCTGGCACTGGCCCATTACAGACGCAAGC 

GGTTTTGACCAGGGGATTCATGGCGAAAGGGTCCACCTTCAGCACCCTTGTATGGCCTTC 

TTNCTGTGGGGACATCATCTTCATNGGATGGGATTCCCGGATGTCTTCAGAAACGCCTGT 

CANGCATTNCCCCTGG 

Sequence 190 

CNACACATGCGGAATCATAGGCCTGCAAAGCTCCTGCTATTCACTATAACTCTGCCATGC 

CTTAGGCACTTCCTAACCTAGAATTCTGAGTGAAGGACAACAATAACTAATACTTTTGAT 

TCAGGTATTACAAAGAAGTTAAGAGTTCATAAGGCACCTAAGTAAAGTCACATTGGTTAA 

GAGTACATGTCTCCAGATACTCTTACATTTGCAAAGNAATTGCATTTCTGNATCTATGGT 

CTGTAMTAMATTGAAGAGTTGNGAGAATAAAAGCATGTTGTCTTTGATAAATTGTTTT 

TACMAACAGGCACAAGAGAGGCTTGMGGGTCCTTGCTATCTTTTAACCTATTTTATM 

TCTTTGCTGCATMGAAACAAATATGCTTATTTACATTCTATACTTAAACATATTATCAA 
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ACTTTTATTCTGAAGATAATACACATGAAGGCTTTAAACTTTAATCTTCAATATAI I 1 1 I 

TAATCTCCCTGATGAATTAGGGAAATAAATATTGGG 

Sequence 191 

TCTCCCCCTTCACCCTCCI 1 1 1 I I I I 1 1 I I I I I IGl I 1 1 I I CTTAAAAGCATTTAGCTGG 

GTGCAGTGACACATGCTTGTAATCCCAGCTATTGAGGAGGCTGAGGTAGGAGGATTGCTT 

GAGTCCAGGAATTTGAGGCCAGCCTGGGGAACATAGCAAGAGCCTATCTTAAAAAAAAAA 

AAAAMGCATTTCAATTATTTTAMTTTTNTMTTACMMCMTTCTTTCCTGTCTTAG 

TTTAGTTTACTTTTTACTCACAAAGTTCCTGGAAAGTAGTTATTACTCAATAATTGAATG 

CATGAGTGTCAACTGCAAAATCTATGCATTATGTAGNGATTTGAATCAATTAGTCT7TNT 

TGATACTCCAAAATTACCCTTTTTCGAGNGTCTTTATCANAAATTTGATAAATCGGAACT 

T 

Sequence 192 

TCCGCCCCGTGAGGGGCAGCTGTGGTGTGGTGTGATGATGCTGGGATAGGCACTCAGATG 

GTTGAGTCTGAGTTTGGCTCTGATATTATATCAACTGCATGGGCAGCTGTGGTGTGGTGT 

GATGATGCTGGGATAGGCACTCAGATGGTTGAGTCTGAGTTTGGCTCTGATATTATATCA 

ACTGCATGGGCAGCTGTGGTGTGGTGTGATGATGCTGGGATAGGCACTCAGATGGTTGAG 

TCTGAGTTTGGCTCTGATATTATATCAACTGCATGGGCAGCTGTGGTGTGGTGTGATGAT 

GCTGGGATAGGCACTCAGATGGTTGAGTCTGAGTTTGGCTCTGATATTATATCAACCGCA 

TCATTTTGGGGAAGACACMTTTCTCAGMTTTTATTTAAGTTGTAAAAATAA 

Sequence 193 

ACACATTGCGGCACCGGGCTGGGCCTGGCCATCGTCAAGCATGTACTGCTGCGCCACCGC 

GCGCGCTTGGAAATCAGCATGTGTGCTGGGCCATGGCAGTACGTTCACCTGCCATTTTCC 

GCCAGCTCAGGTGACGCGCACACGGCTGGTCGGGAATGATGAATAACCTGAAGTGACGAC 

GACCCAATGTGGGAGCTGCTGATACTTTGCTGTCTCCGCCCAGCAGTGGGCACTAGGCAA 

GGGCCCCATCAGCCGCTACATTGGCCGACTTGCGCCTGCCTTTCAGGCCCTCTCGTCACC 

CCTTATTGAACACACGGAACCTGCAAAACCCCATCATGGACCCTTCCCCTGGTATTACCC 

TCGCTACACTCTTCGCCGATTTCGGCATGATTCTTTTTGCACTGATCCTGGTACTGCTCA 

ACGGT7TCTTCGTTGCGGCGGAATTTGCCATGGTCAAACTGCGCTCCACCCGGGTCGAGG 

CCATTGCCCACACCAACGGGCTGGCGCGGGCAGATCCTGCGCACCGTACACAGCGAGCTC 

GACGCCTACCTGTCGGGCTGGCAGCTGGGGTATTACCCTCGCCTTCCTGGGTTTGGGTTG 

Sequence 194 

TGAAATCTCCTMTACACTGNGTTTTTATTGTTATGTATTCTATGTTTTAAAGCTCCTCA 
AGGTATTGTTATTCTTTTGTATMTCAGTGTTTGTTTGGAGCTGTCTGTAAATTTTCCTC 
I I I I I IGCTCTTTAI I I I I ICTTTCAACTCAGACCTTCCAAMTGGGATTATTTTTCTTT 
TTTCTTTAGAGACATTTGCTTGATMCAGAATTTTAATTTGGCGGCTATTT AI I I I I I I C 
AGCACACTGCAGGTATTAMCTACTTGGCTTCATTGTTGGTTTTGAAAAATCAGCTGTTC 
TMTGTTGCTCTTTGAATATMTCTGCCTTTTTCTGTACTTGCTTGNCTTTAAG I I I I I I 
GTT 

Sequence 195 

CGCTCCCTGGTTTCTTGTCTCATGAANAAGAAAAAATCCTAACTGTTCTTGATGATCTTT 

AAGGCTCANAATGATCTGGACAGAGGTATTTACCTTGAAGCTCATAAAGCATAGGCCTTT 

CTCACTTGCAGAGTATCTTTCTGAAGCTGGACTAAATTGGTTAAGGCCACTGTACTTTTC 

CACTTTGTCTTCTTTCCTGTCACACACCCTATCCTTTCAGGCTGTGTTGGGATGGACAAA 

AGCAGTTCTGGAGTCCTAAGGAGAAGATGAGTGGGATATGTTTCTGTGACCTGCAGTCAT 

TTTAAAGTTTAGCTGTTGCTAGCTGACTCCATGTAAGAATACCTTCCAGGAATTTGATGG 

CTGTGCACTCTGGCAGTGCAACTGGCATGGT 

Sequence 196 

CNCGCGTCCGGCCCTGATTGATGAAGCACAGTCAGTAAATCATCTCTTCATTCCCCAGTT 
CTTAAGCCAACATCAGCAACACTGAGAGAACATTAGATTAAAGGCAGGTATAGAAGAGAG 
ACTTAGGGTAACAAGTTAGTGGGTGCCTGAAGGCATGTGGGAAGAGATGTGGTAAAGGTG 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

36/467 



TGCACACTATTCCTTACGTGACTTAAAATGTCAAGGGGCTTGGTGTGGTGGCTCATGACT 
GTAAMCTAGAGCACTTTGGAAGGCCAAAGCTGGAGGATAGCTTGAGGTCAGAAGTTCAA 
GGCCAACTTGGGCAATATGGCAAGACTGTGTCTCTACCAAAAAAAAAAAAAAAAAAA 
Sequence 197 

CCGTCCGGAGAGGCTGTTNTTGATNGATACTCCAGAATAGACACGGGCCCTTGAAAAGCT 

TGTACCTTTGGGGAAAAGTAGACCAGGAAAATCTCAAGCCAGAGAAGGACTATGAAGCTT 

CTTTTTCCAGTTACACATGGAAATAAGAGGTCTTGCATGAGAAACCTAAATCAACCCCTT 

TNTlvrrTTAAGCAGGTTTTGGAGTCTAAATTTATACTAATTGCATTGTGGAGCAGGTCTAG 

CCAAGAAATTAACATGACAATCAGTCTCTAATCTGTCTGTCTATCTACTTACTTACCTAC 

GAGTCAGGCCAAAGAGATTCCCACAAGTAGGTTAAAAAAAGTAAAGTCTGATAACAAATA 

ATTAGTAAGAATGAATGAATAAATAAATAGCACAAAGGAACACTCTACCACAAGGAGGAG 

TACTCATCACATCAAATGAGAGAATACCCTAAGAAAGTGAGATAATGCCAGCAATCTGAA 

AGATATTTTACCNAAACGTGTTTAAAATGNTTAGGTTATTAAAAAAAAAAAAA 

Sequence 198 

CTMGCTGTTATTTTCCCTAAAAATGCTTCCCTTGCATTTATACTTATTAAGAATAGATA 
ATAGCTAACATCTATCCACTGCCTTTGATTCATGAGGCCCTCACGAATTGCCTCATTAGG 
TCTCCGACAGTATGGTACAACTACTCTTTATATTTTACAGATGAAGAAACTGAGGCTGGA 
CTGNTACC 1 1 I I I GCTACACATCCTAATG 
Sequence 199 

AACCTGTTTTGTTAGATGTGAATCTAGGAAATACAATATAI I 1 1 I AATGTAAAAGNACTC 

TTGCTTTACTTGTAAACTGATTTTCG 1 1 I I I I I CCCCTCAGGCCATCAAGCCCTGTCGTC 

CT ATGAC CAACAATGCTGGCAGACTTTTCCACTACCGGATCACAGTCTCCCCGCCTACGA 

ACTTTTTAACTGACAGGCCAACTGTTATAGAATACGATGATCACGAAGTATATCTTTGAA 

GGATTTTCTATGTTTGCACATGCCCCCCTGACCAATATTCCACTGTGTAAAGTAATTAGA 

TTCAACATAGACTACACCGATTCATT 

Sequence 200 

MTGCMGAACATCTGGATNAAATGGCTTTCTAGATGAGAATGGTTGCATTTTTTAATGG 

CTATTCTGGTAGAAAGGACAACATGTGATATTCATCGACCCTCTTTCTCAGACCCTCTTT 

ATAACAGCTAGTATGGAAAAATCTGTCTTTCCTATAAATATTTTCCCTGGGGAATGGNTT 

CCCATTMTGTGTNGGATGCTTTGNTGTTTCTCTCTTGCACAGCGTGTATGTTGTCATCT 

GCACTTAAGAAGTGGAATGGAATTGGGAATCCTGGTTTCTTGCTCTGGCTGGAGGTCCCA 

AGTCCGCATTGTGATTTTGGGAAGTCACCGTTGTTCTTCTAGATCTAATTTAACTCATCT 

GTAMANMNNNGGNTNGAATTCCGACMTTTCATGGGTTTCATTAGATGCCAATATTTT 

ATGACTCATGATCCCAGCNAGACTACACCTATTTATMGGCTGGTTTTGCTTGTTTTTAC 

TAAGAGCAACAATNACTACATATTTTCAGGTTACTCAATCATCAAAAAAATTATAAAATC 

CATAAACACTTTGGATTTGAAACATTGCAACTTTG 

Sequence 201 

CGTCCGAAAAGCAGTCTTTCTTGCTCAAAGTATTAANGGTGAACAATTGAATAGAGTACT 

GTGGTCGGGAGACTGATTTGAGACTGCAGAGCTGATGCTGGGTAGAGGGTCTGGACTTGT 

ATTCATGTTCTGTCTCAGGGCAGCCCCTGGAGCAGGAGATGGCAGAGGCATTTACAGCTG 

CAGAAAACAGGGAGGAATGGAATCTGAGGTAGCCCTGGCCTCAAAATTCAGGCCTGGCTG 

TATCATTTACAGAGATTTTTCTGGAGGGAAAAAGTCTCATTTCTGAGGAAGGCAAGGNGG 

GCTAATCATTATTAA I I I I I I I I AAACTTTTTG 

Sequence 202 

GCGTCCGGTTGAAGAGGGCAGGGAATAGGGGTGGGTGAGCGTGAACAGAGTCAGGCTGAT 

TGCTGCAGGGTCCCTTGCATTAGTTCAGGTGAGAAGAGACCCGAGTAGGCCAGTGAGCCT 

GGAGGAGAGGCTCTCTGTGTGTTTAATTGGTTTCCAGC I I I I I I I CTCTATTCATGTAGG 

TTATACACGTTTCTTCTGTGAA I 1 1 I I A TTTAAATGA 1 1 I I I I GTTGTTACTGGATCTAC 

AAACAGCCCAACTCCAAGGAATCTGGCATCTCTCAGTGGAGCATACAGGTGACTTCATAA 

TCTAACC GCAT TAGTAACTGCCAAAATCGGAAGTAATTTCTCTCTGTTTAAAAGGCAGTG 

AMCAMTTTTCAGAGCAGGTTTCTTCAACTGAACAAAATATTTTGGACCTTAAAGGTGG 
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TATGGCTCTCCTGATCAGGGAGGGACAGTGAAAGGTTTGAGCTCTGACACTGNCCAGCTC 
TCTGGATACAACCAAGTGAC I I I I I I I IG 
* Sequence 203 

CGCGTCCGAGTATATAGAAAAACCATTAATTTAGACTCTGTGAGATTAGGTTGCATGAAG 

MGGTTTTCTGAATATTTGAAGAGTGGATAAATAAATGTCCCCCAAAGCAATAAAATCAT 

AATCCTTTAAAATATAGGAAAAATAACTAATGGGAACTAGGCTTAATACTCGGGATGAAA 

TAATCTGTACAACAAACTCCCATGACACATGTTTACCTATGTAGCAAACCTGCACATGTA 

CCCCTGMCTTAAAATAAAATTTAAAGTAATAATAATAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 204 

CGCGTCCGCTGGGAGGCTGTGGGGTCTGCCACCCAGCAGATCTGTGTCACGGGAGTGGCG 

CTGTCACTCGTTGAGGTGGTGGCCTGGTTCCTTTGGCCTTAGGGAAGGACAAACTTCAAC 

TCTGAGCCTTGATTGAGTGACCTTGGCCMGTTACCTAGCTTTTCTGAGCCTCACTTTTT 

TGGCNATTAGATGAACCAGAGGTTTATTTCACTCAGAATCCTGTTCACGATGCTGGTATT 

TGGACCAGCCTGCGGGTTTATCCTGGGCTCTTTCTG 

Sequence 205 

CGCGTCCGAAAAAGGATGAGAAGAGAGGTGCATTCCAGAAGACAAAAGGTGTGTAGTATC 

AGGATAAGGGGCTTTAAATATCAGATCCAGAGAACACTGCACATGTAGAAATGGGCTTGG 

CCTGGGTCAGGGCATTGAGATTGGTTACATMTCTTTTCAAGGATTGGTGAATGAGTTGG 

AGTATG TGTAG AMCCTACAMGATGACAGTTTMTCTCATGTCATAATTTTTAGACAAA 

TMTGTATTTTAAAACTGGGTGCAGTTCCTAAAGCTGTTCTAAAAGTCAATGCAACTGAA 

TTTGGAATGTAAGCATAGGACAAACAGATGGGAAATAAGTCATGACCTCTGTGGGATAAA 

GTGAGAGTTATCAMGAATGTCAGTGTTTATMCMGGMCAAGCTTGTTTTGGAGAATT 

ACTAGATATTATGGAAAATTTTTTTCTTTTCTACATTTGGGTAACTATAGCTGAACTATA 

GCAGATCATATTGACTTGGCAAAAAA 

Sequence 206 

CCNCGCGTCCGGTTATGTACTTTTGGAGACTTCCATTAGAAATATTGGCAAGTCCCTGCT 

TCGTGGCCATAGATTTAAMGGCCTATCMTTTTAMTGTTTCGGTCATTGAGAGCTAAA 

ACATGTAACATATCACAGTGTTATTCACCAGAAATAAAAAATCAAGAGTCTGCTCAGAGT 

AGGTTMTATGAGTTCCTTTCTTCAGTCCAGCTGATGGTTTTTAGTAAGATGAACTGCCA 

AGGAGACMTGAGCACTGACTTCTCGATGCATGACTTCATCTTGTTAGAAGGTGGGTTGC 

CGGGCCGCGGTGGCTCACGCCCGTAATCCCAGCACTTTGGGAGGCCGAGGCGGCCGGATC 

ACCGAGGTNGGGGGGATCGAGACCATTCTGGCTACACGGTGACACCCCGTCTCTACTAAA 

TATACAAAAAATTGCCCGGCCGTGGTGG 

Sequence 207 

CCNCGCGTCCGATGAATAGTTAGCCCATGATAAAGGAATAAAAGGATGAAGAATATTTGA 

AGAGAAATAAATCTTCCTCACTCCTCAGGTTCCCTTCCATGTGCAGGAGCCTCAACCTAC 

AACTAGCAACCTTATCTCCTGACTCATTCCTCTCTAGAGGAGGAGTAAATTAGTCAACTG 

ATATGCTCTGGAAGAAAAACCCA 

Sequence 208 

CGTCCGGTCTTTCCTCCCCCTAAATAATGCATTACAAAGTGGAAATGCAAATTTCCTGTG 

CAAGCTCTMGTAGCAGGTGGTATTTCCTTAATATATTGTTTTTGACCTTTGGGGAMTT 

GGTATTACGAGCTGACTTTGGAMATTAMATAGCATCMGGTCCTACATTTTAAATAAA 

A CAATC G ATATC TTAATTTTTAAAATCAGACTAGATTACGATACCAGGAAAAGACATACA 

TATTTTGCTTTTATGTGTTAMGTTTTGTAATTCAGGGAGGACAAGAAAAGGGATATGGT 

GCAGCTGMCTTTCTAATTCACTAAGACAGGAAAAAAAAAAAAAAAAAAAN 

Sequence 209 

CG TCCGG GAAAAACAAGGGTTTCCGCCAACAGGCTGAGAGCAAAGGAGGACGCAGGAAAA 
CTATTTTAAAMTTGACCCAAGAGTTCAAAAGGCATATGGAAGCATTTAATGGGGGTGGG 
AGGTATCCTTGTAATAAGAATACCATGCATGTATTCCCACACTGCTCTTGGTGGTCTGCA 
AAGTGATTTCATATGTATTTTATGTCAACACCAGCACAATGAGGTAAGTAGGACTGTATA 
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CCTCAGAGGCATTTGGTGATTTGTCAGAGTGGAGTGTAGTGTGTTGGTGCCCAGATTTGA 

ATAGGATCATTTGAGTCTGATATCATCATGTTGCCCACCGCCTACTCAGCCTCTACACCC 

GA TGAGG CCAATCTGCAGCTCACTACAGTCAATAGAGAACAGGCAATTAACCCTTAAGTT 

ATATTTTAGAAAGATTTCTGTCTAAMTAGATAAACTTGAAAGTATAGCTCTTCAAAATA 

ACGTATTCCTGTGTTGGCAMTATTTTCCAAAGTCACAATCAACACATAGGTGTATTTCT 

TAGACTACTAGAAGTGGGGACTTACCCCAA 

Sequence 210 

CGCCNCGCGTCCGGGAAAAACAAGGGTTTCCGCCAACAGGCTGAGAGCAAAGGAGGACGC 

AGGMMCTATTTTAAAMTTGACCCMGAGTTCMMGGCATATGGMGCATTTAATGG 

GGGTGGGAGGTATCCTTGTAATAAGAATACCATGCATGTATTCCCACACTGCTCTTGGTG 

GTCTGCAAAGTGATTTCATATGTATTTTATGTCAACACCAGCACAATGAGGTAAGTAGGA . 

CTGTATACCTCAGAGGCATTTGGTGATTTGTCAGAGTGGAGTGTAGTGTGTTGGTGCCCA 

GATTTGAATAGGATCATTTGAGTCTGATATCATCATGTTGCCCACCGCCTACTCAGCCTC 

TACACCCGATGAGGCCAATCTGCAGCTCACTACAGTCAATAGAGAACAGGCAATTAACCC 

TTMGTTATATTTTAGAAAGATTTCTGTCTAAAATAGATAMCTTGAAAGTATAGCTCTT 

CMAATMCGTATTCCTGTGTTGGCAAATATTTTCCAAACTCACAATCAACACATANGTG 

TATTTTCTTAGACTACTAGAAGTGGGGAC 

Sequence 211 

NCGCGTCCGGTTTCNTTGGGATAGATTTTACCTATGMTTCCTCCTTAGAATTCTGAAAT 

TGCTCAGATTTACCCMATGACAGCCAGTTTCTCATTTCACATTTGGGGGCTGTAGAATC 

TTCCAACATTGAGAACCTGTTTTAATCAAAGGATGCTTTGTGGAATCCTGAATGAGGAAC 

AGCATGTTGCAGGAAGMGAGAAGGATCCTGATGCCCTAATGGGACTGATTTCCTTTTGG 

GGGGCAGGAAGATATATATTCGTTGGGTGCTTATAAAAGGTTAATTCCAAAGATTGTGTA 

TGGTTAAAGGACTGAAAGTCACACTTAGCCTCATACTTCACTTAGATGAAAAACAAAAGC 

CTTCCTCTCCATTACCTTGTAAGATCTATTCCTTGTGTCTTGTGCTGAGTGGACCTGGAA 

TAATGGATAGCCCTCACTGAGTACCTAGAAGGGGACTAGGGGTGGGTGATGAAAGGGGGT 

TCACACCGAAGATCTAAGTGCTAGCTTGGGTA 

Sequence 212 

CACGCCCCGTGGCCTTGCTAGAGATCCATATAATGCAGTCATGCTGTTTCTTNCTCCATA 

GTATGTGGGGCATGAGGAGGAGACAGGGAGAGGGTGGCTTCATTGNGCAMNGNGGAATG 

GCTGTGCTTTGGGGCCAAGGAGATGCTGTCCTGCTGTAGCTGCTCTGTGAAAGGTCAGGC 

CTGCCCCTNTGAGGCTCCCTTTATCCTCCTAAATTCTGGGGCATCTACATGACGCTTTCT 

AGTCCACCTTTGCCTNCGCAGATCATGGCTACTAACCTGACCTTTGTCTGTACTTGAGCA 

CCCTTCGCGATTTAACTTNCATGTANCGTCCGACTTCTMTATGGATTTGAATTTNTTGA 

CTGTTACTGCTCANAACAATCACCCTTTTTTGAGCAGNGAGCTGGNAGGATAATTGCCGA 

CAAATGACATTNGGANCCGTTTTNAACCACAGGGGGCATGGGG 

Sequence 213 

CGATCATTTTATTAGAGTNATGTATTTAAGAACTGATAAATCATGGGCTTACCTACACAA 

TGTCTAGACACATGAGCAATGAACAAATAGCAAGGTCTGTGATATCTCATATGGCAATAC 

TAGGACTGAGATTATTTTTGTTACAATTAAATAATTGTCAGTAAAAATCCACAGAGATCA 

TTTAGM TGGG MAAAAGTCTGATATATTTTGTTTCAGATTATAATCATTAATAGGGTAC 

CTGACAGTTTTCAAAGTTGTTTAATGTTTTTTAGGTCTTTMCCTTCCTAATCCT^ 

GTGGTTACAATCCCACATATTATCCTTGTGTCAGGGGTCTCCAGCACCTGCCTTAGGGTC 

AGTGATTTGCTAGAAGTACTTACAGAACTCAG 

Sequence 214 

ATCTGACAGCCTGGAACNGCACCCACACCCCCAGGTGAGAATCTGATGTTCTGGAGCATC 

ACACACAACCACAGGTGAGCATCGGAGAGTCTGGAGCAGCACCCACAACCCAAGGTGAGC 

ATCTGACAACCTGGAGCAGCACCCACACCCCGAGGTGAGCATCTGACCTCCCGGAGCAGG 

ACCCATACCTCCAGGCGAGCATCTGAACCCATGGAGCAGCACCCACGCCCCCAGGCGAGC 

ATNTGACCGAACAGAGCAGCACCCCCTCTA 

Sequence 215 
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TCCGGGAAAGACTCTGAGAAAGGACTCCTGACCTGGCTCTTCCAGAATGAGGATGGATTT 

TCCATATGAAGGAGCCGGGGGCAGGACCTTCTGGGTAGAGCGTGAAAAGAACACAGTATG 

TGTGGAGAGCGGGAAATTGTGTTGAACTGTCTAGAAAACAGAGCCACAGGAATGCGTGTT 

GAGGGCTTGGGCTGTACAGTGGAGAGTGCCCTCCCTGGCCAGGGAGTTGGACATTCATCC 

CACCACAAGACCCCATGAAGAGTTCCTCAACAGCTCTGTGTCCTCATCAAACCTGTGTTT 

GCAGAACAGTGGAGGAAGAGAGCTGAAGGAGGGGAGAGGCCCTGCACCTGCCAGGCCCTG 

GCCTAGACTACAAGGGTGAGCACTGAGCCATGCTCTCGGGGAACCTTCACTGGAGTTGAG 

GGCAGTGAGAATGTTTAAAAAAAAAAAAAAAAA 

Sequence 216 

ACGCGTCCGATGCACACTTGTCTTTTACCACANGGGTGGGGCGTGGGANGGAGGTTTAGT 

TTGGATAGCCACGTAAACGCCTTTCCCTGTGGCCTGCGATGTTCCACACCGTTTATGTGT 

GAACTGGCTGCACCCGCGCCTCCCGGACGGGGCTGCCAGGGAGGAGGGCCCGGGAGACCC 

CATCCAGACCCCGGCCCGCACGCTGCAGAGGTCTGCTCTCAGACATGTGGTGGGCTCCGT 

GTCACGGGTAAGGGGTCGGGGTCTAGACGGCAACAGAGTGTCTCTCTCTTCCCGCTCCCT 

GGTGTGCCCACCTCCCTGTACAGTGTTTCTGTTCAAGCTGCTGCAGGGGACGGGGCATTT 

TTCGTCCAGACTCTATTTTCCTGCAAGGAAGAGCTGCTGTCCTTTTTCTTACTGAAGCCC 

CTGATTCTGTGTCTGATGTTGCTGACCGCCGTGCTTGCTTTCTTGCCCGTGTGCAACTTC 

CAATCCCAAGCACACGTGCTCACTTCCAAG 

Sequence 217 

GCGTCCG GGGA ATGGCTGTNAGTNAAGTTAGAGGTAAAAAMTTTCATGTTMGATTTTG 

GMCTGGATTTTATTTAAAATAATGATGCGAAGCCATTGAAAGGTTTTTGGTGTGACAGG 

ATAAATT TAAAA TATGAACACACCAACGCATACTTCTTTTAAGAAAGAACCTGATTAAAT 

TTGGGAATT TTAAAATAAAAACAGGAAGCATATCGTACTCTAATATAATAATTCAAGGGT 

TTTTATTTTCCTAGMGATCAAGGTCATGTTAATAAAGGGAATATAGTTTTCTTATCTGT 

GTTMGACACTGATGACTTGCAAAGAAAAGTAACACTTTTGTGATATCCTTAGGTAATTC 

AAGAGGAAACGCTTGAGCAATTACTGATGTTGTAAACTGGGATCAGAAGACATA 

Sequence 21 8 

NCCCCGCGTCCGCCCAGTTGCAAAGGAGATGTTGTAGGATGTTAGGTCTCAGCACAAGGA 

ACCCAMCCTTCAGGGGCTCTCCTCTACATTATGCTCCCATTTTTCTCCCAAAATATCGA 

TCTCCACCCACCCTAGACATAGAACTGGAGMTMGTTCCAGTTTCATCCCTTTGCAGAT 

CTTAGGGGGACCCATCAAATTCCCAGCCACTGGGTGAAAATCAGCAGCTTCTTTATAGGA 

CCTGAGTTGCCTTCCTAGAGGATCCTAGAGGAAAAAAAAAAATCTTATCCTTCAAATACT 

GCTGTCTTCCAAAATACGTAAGGACGCCACGGTGAATCATAGTGGACACCCTGCATTGGT 

TGGGTTATTATTTATCCTAGMGCTTGGGTTCTTGGAGCCCTAGCTTATTTAAGCAACAA 

AGTCCCTCACAGCCCACAGGTGAGGAAGTGAGTGTAACAAAGAGATCATTGGACCTAAAA 

TCAAAACACCTG 

Sequence 219 

CCCGAAAGGAGGTTTGTGGAACTTGGAGAGATCCAGGAGGTAACACCAAAAAGCTGCATT 

TAGCMTGCCTGCCTAGCCCTCCTGTACAGCTCATGTTATTTTGTTGCCTTAGGCCATCT 

TACACCAMCCATTCCTATCTCCATGCTTTTGTTTATGCTGTTGCCTCCTTTTAGAAGGG 

CCACGCTCCACCGCTCTGCTGTGTTGATACTACCGACCCTTCCTTTAGTTCCTGTTCAAT 

TCCCAAGCCTTCTGCCAAGCCTTCTTGGACATTCCCATCCCATGCTGACTATTCCTTAAG 

CTAAAMCCCTTAGAGTTAATATGATACACTTGGCTACCTCATTGTTTCATATCAGTCAT 

TCCTGTCTCTAGCTATAATCGGCTCAGCCAAGGAAGATATATTTATATGGACAATGTCTT 

TGTGCCTTGGCTGTAAACAGTGTTGAATAATTAATCTCACTTGATGAGGTCTTACTTAAT 

GATAGCCTCC 

Sequence 220 

CTCGTCCGGGCTGGATCCGTCTGCNCCACTGCAAGGGCAAGATGCAGCTGGTGGCTGACC 
TGCTGCTGCTGTCGAGCGAGGCGCGGCCCGTGCTCTTCGAGGGCCCCGCCTCCTCTGGTG 
CCGGCGCCGAGTCCTTCGAGCAGTGCCGGGACACCATCATCGCGCGCACCAAGGGGGTTT 
TCATCCTTACCCACGACGTGCAGAGCCAGCTCAACATGGGCCGCTTCGGGGAGGCGGGGG 
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ACAGCCTGGTGGAGCTGGGCGACCTGGTGGTGTCGCTGACCGAGTGCTCGGCCCACGCGG 

CCTATCTGGCCGCTGTGGCCACGCCGGGCGCCCAGCCCGCGCAAGN 

Sequence 221 

GATTTTGGCTNCTGCAACCTCCGCCTTAGGGGTTCAATGCAATTCTCCTGCCTCAGCCTC 
CAGATTGGGATTACAGGCATGTGCCACCGTGTCTGGCTAATTTTCTTGTAATTTTAGTAC 
AAAAGGGGTTTCACCACATGGGCCAGACTGGTCTCAAACTCCTGGCCTAAAGCGGATCCA . 
CCTGCCTCGACCTCACAAGGTGCTGCGATTACAGGGATTACAGGCTGGGATTACAGGCAT 
AAGTCACCACACCCAGCCTAAAAATTAAAAGTTTATACTGTATTGTTCACATTGTGATGC 
AACTTTCCATATGTATTTCCAGAATCAACTATGTATCAAGGAATATTGAAAGCATAAAAT 
GAGATCATGGTGAATTCTCTCCCATTGTCATTCTTGNGGTAAGGGAATGAATGGTGG 
Sequence 222 

CNCGCGTCCGAGTTATTNATATMAGAACATTTTTCTGTTTTAGAGAGAACCCATTTATT 

TGTGAMGAAACAGAGTTTTGTTCTTATCCCTATAAACACTGCATTTGCTGTTTCTTCTC 

TAGCTGATGTGACATATTAGGGAAGCAGCACCCGACTGGGACTCAAAAGACCTGGGTTTG 

GAAACTTTTTGATTACTAGCTTAGCTGTGTGAAATCAGGCAGATGGTATAACTTCTCTCA 

TTGATGATGTCAATCTTTTTAAAAATTTTCTAGCAGTGAAATTTTAAAAAATGAAATGTT 

AGGTGAAACCTCAAACTTACAACAGTGGTAAAATTGGAAGGGTTCTGGATGGAGTGAGGG 

CAGGGAGGGGGTAACTGGGCTGAGGGGCCCCTCTTCAGCCTCTTCCTCCCCTTCAACCGA 

AGTCTGAAMTCTCTGGCTMTATGGGTGGATCCCTMGATTTCTTTTGTACTTTTAACA 

TACCGAAAAGNTATTCTAAAGAATTTTGGTATGGTTTT 

Sequence 223 

CGCNTCCGCCGAGCGCAGCAGTCTCCCCCTAACCTCAAAAGCCTCCTCAGAAGTAACCTG 

GTATCAGTGGGCTGTGCAGATTCTTATAGTCTTCTTTGCCTTTATATATGAAAATAACTT 

TG7TTTATGTTTGTTTTCACATAGGTGGAATTATACTGACTCTTACTCTGTGACTTGCTT 

TCTTCACATGACAGTAACTCCTGGACTGTTTCTGTAGCAGCCACACAAGTCTCCTTTATA 

TTGCCAATATCTATCTTTATCCCAAAGACAAGATGAATGTTCAAAAACGGTAATCCAAAC 

TCACTTCTAAAAATGGGTGTGTGTTTAAAGAAGAGCTGCTCAGCTGAGGCAGTTTTGCTG 

CCGAGGGGATTTGAGCAATGTCTGGGGACGTGTGGTTGTCACAACTTGTGAGGGGGCTC 

Sequence 224 

GCNTCCGTCAATTTGCATCAGTTGCCTTAAAGAATGGGGTAGTTATAGGAAGCTCACAGA 

GAGGMAAAMCTCTTCATGTCTACATTGCTTCTTGAACTGATTTTATTTATAAAAGAAG 

AATTGTTGAGCTAGTGATAGAAGTTTCATGAATTCCCTGGTACTAATTATCAGTTAAATG 

AGCTCTCTGAAGTCTCTGGAGAGCGTTCTGTGTCACTAATATTGGTGAAAATTTGAAACA 

AAAATGCTCTCCCATTGTCCACATATTGCTTCTTTGAATTTGG I I 1 1 I CGAGCCAAGAAC 

TTAGGGTGTGAGAATATGT7TGTGGGGAAACCCACACAAATTTTATGTTAGTCTCTGTAC 

ATTTAMTTTTTACCTTCCTGATTACTTACGTMGACTMACAATTTAAGTTTCTAAAAT 

GCCATCACTTTTGCAAATAAAGGACTTTATTAAAGNTGATAAATAACAATNATGGGCCAT 

CAGCTCCACCTATAATTAATATCCTTGCCTGGCACCCCTGGAAGGGACTTAGCTTNTT 

Sequence 225 

CNTCCGCAATTACTGCCTTAGGCTAGATTCCCATGAATANGAATTGCTGAGTCAAAAGGT 

TGACACATTNTTTMAGGTTAGATACATATTAGTCAAAGTTTTTAAGCAANATGCTTTCT 

MGCCTNTTTGATCTTTATAANNCATTGNTCCTTTCTAAAAATATATAACTGTCTTCTGC 

GTNCCMGGATMTTNTTTTTATTAATATGGGGCTTCTTGTGTCTACTTCCTCCCCTTTC 

GTTATTTCTTCAAAATGTTTAACAAACTAATACATTTCAGAACACATTATGCTTNCATCT 

TGTCACTATTTGCAGTACCTTGTATCTCCTGGACTTTATGCAATGCGTGTGTGTGCACAG 

ATGGAATATGTTNCACCATTTGGCT 

Sequence 226 

GCAGTGCCGGCGGCGCAGTTGGGAATGGGAGTGCGCTTGCACAAGCGCCGACGAGCGATA 
GCGACGCTATGCCCTCTCTTGCCCAGGCGCCGGGTGGCGCGGATCAACCTGCTCTCGCAG 
CAGCCCCTGTGGTGGCCGCCGGTGATGATGCAGGAATCGTATTTCGACGGTGGNCAGCTT 
CGCTTTCGATCAACATCTTCGNCGAAGTTCCGCGTGAACCGTNCGGNATCGACATCGTGG 
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CGCTGGCATCTGCCGCTATACCAATCATCGAAACCGCGCTGGAAGGTCGGGTCCAGCGCC 
CTGCAGAAGAGACGCCTCCAGCAGGACAAGGTGGCCTGCTCGACGAGCAGGCTACCCGAG 
TGTCGTTGAACAAGTATCGCCTTGGGTCT 
Sequence 227 

CTGACACCCAGTATTGCCAACCTAAGGACTAGGAAGAAAGGGCTTACATGTTTCTTGCTT 

TAATAATATGTGAATTAGAAAAGGTATTCAAGAAGATACAGGGTTAGAATGTGCATCTTT 

CTTTTATGTACTGAAATGTTTCAACTCATATGCAGCAATGTGGTNACCTTGAAAGATGAT 

TCCAGAGATTTCATGCTTCTGGCAGTTTCAGGTCACATTGGGGTGAAATTTGACATACTG 

TAGTCATCCCCAAGGTAATAATGTTGATGAAATGGTATAAACCCTGATACATTCTTAAGC 

AAAATGAATCTAAMGTTTGTTTCAAAATTTTTMCGTTTTATGTTGCTCTGACTTTCCA 

TATACTGATTTTTACATTACTTAGTGAAAAAAAAAAATTACCTTTTAGCTTNTGGCAACA 

AAAACAI I I I I GGCTATT 

Sequence 228 

CCNCGCGTCCGCAAGACAGGAGATTTCTTAAGGTCATATATGCACGATTTGCTTAAACGT 

CATATAGCATAACTGGAAGAATATAGAATCTACCCAAAGCCTATGCTCTTTCTACTGNAT 

TATTTGCTTATAATAGAGAATGCTAATCAGAATCACCTGAATAGTCTTGTAGAGGGTCAG 

TACACAAGCCTAGGTCCTATCCTTCAGAGAGGTGAAATGGAGGCAAATATATTCTGAAAG 

AGTTTCCCAGTTGATTCTGATGAGCACTCACAATTAAGAAGTGCTGCATTAAGAGCTGTA 

CATGGTGGCTTATGCCTGTAATCCCAGCTATTCCTGAGGCTGAGGCTGGAAGATCACTTG 

AGCC TGGAAGA TCACTTGAGCCTGGGAGTTGGAGACCAGCCTCGGCAACGTAGTGAGACC 

ATGTCI I I I I I I 

Sequence 229 

TTGGGAGATATGGCANGGTGAGAATGTTTGGGAAAGGAAGTAGAGACAGACATTGGATTT 

TTGTCAGTCAANTTTCTTTTGATTTTAAATTATTTATTCATTACATTGTCTACATGGCAG 

TTMTATTCTAAMCTATMACAMATTTAATATAGAAAAAATA7TATGCTTTTC7TTTT 

TGCCTTTGCATGCTCTTATTTCAAAATAGATTTAAAATTCATGGCATATTATACTAGAAA 

ACAAGTCTGTCAATGATCTAAGCTTCTATCTTTAGATACTAGGAAAAAAGAGAAAAACCC 

AAAGCAAGCAGAAGCAAGGAAATAAGAACAGAAATAAATGGAATTGANAAGAACACTTCA 

GAAAATCAGTCAAACTGAAGCTGATTTTTATAAAGATTA 

Sequence 230 

CCACGCNTCCGAGGAAACTGGCAGGGAAACAAAGTATCCCTGGAAGGAATTTCTTAAAGA 

GGAAGAGGCAGATCCCTACAAGTTTAAAATCAAAGAATTTGAAGATGTTGATCCCAAAGT- 

GAAATTGAAAGATGGACTTGTGAGGAAGGAGAAAGAGAAGCATAAAGATAAGAAGAAAGA 

TAGAGAGAAAGGCAAGAAAGATAAAGATAAGAGAGAGAAAGAAAAAGTGANAGATAAAGG 

CAGAGAAGATAAGATGAAAGCCCCAGCACCCCCACTGGTGTTGCCCCCAAAAGAGTTGGC 

CCTGCCCTTGTTCAGCCCTGCCACAGCCTCCAGGGTCCCAGCCATGCTGCCATCTTTGTT 

GCCAGTGCTTCCGGAAAAACTGTTTGAGGAGAAAGAGAAGCCGAAGGAGAAAG 

Sequence 231 

NCTAGACTCCCCTCTCGTATCATGGATCCCAACATCNAGGNATATGGNCATTTACGTGTT 

GGGATCTGCTCTGCCATTGNACACAGCTATATTCNATTGCCCCGGGNGTTGTGTATNTTT 

CCAAAAACGTTGAAAGGGAGGTTCAGMGTATNCAGTTATTNGTATTATTAGTCGTTTTG 

AAACTGAGTNGAAAGACTCATTNANGAAAGNTCCATATGCCTTCTTGTCTGTCTATGGCT 

GGNNTGCTCNNGAGAAAAGTCCNCANTTATACAATTGTA 

Sequence 232 

TTTCCI I I I I I IGGGGGCGGGGTCGCGCTCTGTCGCCCAGGCTGGGGTGCAGGTGGCGCG 

ATCATGGCTCATTGCATCCTCAAATGCCTGGGCTCAAGCAAACATCAGTTTTCTTATCTG 

TGAAATGAGGATAAAAATGTCTCCACTTAAGGGTTGTTGCAAGGAAGGTGTTGCCTTAGT 

CATAAAAGCTAGGGAAGGTGTTCCTAACGAAAAACAATTCGTCAGAGACATGAAGGTAGA 

GGAAGAATTCACACATGAAGGGGGCTGGGGAAAATGATTTAAGAAAAGAAACAGGCCTGG 

CGCAGTTGCTCAGGCTTGTGATCCCAGCACTTTGGGAGGCTGAGGAGGGTGGGATCACCT 

GAGGTCGGGGAGTTCTAGACCAGCCTGGCCAACCGTGGTGAAACCTTGTCTTCTACTAAA 



# 
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AATACAAAAANATTGGCGGGGTGTGGTGGCAGGTGCCTGTATTCCCAGCTACTTTGGAGG 
CTGGGACGGGATAATTNCTTGAGCCCCGGAGGCGGGGGTTTCCGGTGAGCCCGGGATTGC 
GCCCCTTGCACTACAAGCCTGGGGCACANAAGCGAGGACTNTGTCAAAAAAAAA 
Sequence 233 

CCACGCNTCCGGTCTCAAATCNCCTCMGTATATTCAAAATTTGACCACTTCTCACCAGC 

ACCACTGTCATTATCCTGATTCAAGCCTCCATCATCTCTCATCGTTACTGTGACCTCCTG 

ATCATTCTCCTTGCTTCAGCCCTGGCCCCTGCAGGCAGCATTCAGTATTAAGCAGTAGAG 

TTGTTTCTATAAAAATGTAGTCAGCTGGGTGTGGTGTCTCACGCTGTAATCCCAGCACTT 

TGGGAGGCCAAGGTGGGAGGATCACTTGAGCTCAGTTTGAAGACCAGCCTGGTCAACATG 

GTGAAACCCTGTCTCTACTAAAAATACAAAAATTAACTGGGCATGGTGGCGGGCACCTGT 

AATCCCAGTTACTGGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAAAGGCT 

TGCAGTGAACCCAGATCACACTACTGNTTTTNCAACCTGGGCACAGAGTGAGACTGCCTC 

AAAAAAAAAAAAAAAAAAA 

Sequence 234 

TCGAGGTCAAGGACGGTTATGGCCCGTAACTACCTGCTGCCGCAGAATTACGCCATCAAG 

TGGACGCGCGGTGCTGAGGCCCAGATCAAGGACATCACTCGCGCCCGTAAGGCTAAGGAG 

ATCAAGTCCAAGGAGGAGGCTGAGCAGATCCGCTCGCAGCTGGAGCACCTGGTCGTCCAG 

GTGACTGTCCAGGTCGGCGAGAACGGTCGTCTGTTCGGGGCCGTGACTCCTGGCGATATT 

GCGCTGGCAGTCAGGAAGGCCGGTGGCCCCGCCCTCGAGAAGCGGTCCATCGAGATCACC 

AAGCCGATCAAGACCATCGGCAAGCACACTGTCCGGCGTCAAGCTGCATGACGCTATTAA 

GGGTCACGTCACGGTCGAGACTGTTCCCGCCGCGTTGATTTGACGTACACGCAGANTAGG 

GGGAGGGGCATCCAACCTGGGTGCCCTTCCTTTGCTTNCGTCACCACCGGCGAAAGGTAA 

ATGACCGAAAGTAATTCTTATTACNGTGCTTAGGGGGGTTTCGCCGCGCCGGGCCGTTAG 

ACTTAGTCTAGAAGAAAAAACCTTCCACACC7TCCCCTTGAACCTGGAAACATTAAAATG 

AATGCNATTGGTGGGGTGGTAAACTTGG 

Sequence 235 

GNGGTAGCTTTGTGTATGTCGGGCACTTCNAGGAATAGGGTGCAGGAGAAACGTCTCAGT 

GTCTCCCCTTCCGAATCTTGGCTTCTGGAGGGAGAGATGCTGGGGTGGGAGTGCTCCTTG 

GTGGAGTACTCAGGAGCTTAGTAAAAGCAGAGGGGGCTGGAGAGGCAGGCCTGGCCTGCA 

GAGCCAGCATGGAGAAGCCTGGTGTAGGGCTCTCCAGCCTGCCAATTTACAGTTAAGAAG 

AAAGGAGATATGTATATATATATATACACACACATACATACATACATACACACACACATA 

TATACACACATATATATACACACATGTATACATATATATACACACACATACACATATACA 

CACACATGTACACATATACACACATATATACGTGGGGTGTGTATATGTATATATGTGTAT 

ACGTATATA 

Sequence 236 

AACTGTCTAATCTCTTGACCTCGTGATCCCCTGCCTCGGCCTCCCAAAGTGCTGGGATTG 

CAGGCATGAGCCACTGTGCCCAGCCATAC 1 1 I I I I I I I I 1 1 I I CTGGNGCATTTCTGAAG 

TTAAATATGTTGAGTTCTCTGCCCATTTGTTCAMTTCTATTGNATTATTTCTGNNGGAT 

TGACAMTGTTATGAACMTTTGGTTTCMAATAGATTTTTCTATTTAGCAACTTTCTAT 

TCCTCTAGGACCTAGTATTAAGGTCA7TAAGATGGCTTAATGAGTCATTAAGCCATTAAT 

GAGTCATTAAGATGGCTGACCAATCTGTGCCATCTTTTTTAGTGACATTTTGTTTATCAG 

TGTCTCTTTTGTGTATTTCTTTGATTTATATCTTGACAGTATTACATAAGCAGGAATAAA 

AGAGACTTTGAGTGGGAATGTCTCGCTCAAATATAAATAACCTTATAAATACAATCTTTA 

CTTCTCAGAGTCCTAAAATCTTCTTGGTTAGTCTGGTCTTCCTGGGTTGATTCAAGGTTC 

TCGGNGACTAAATGGCAT 

Sequence 237 

TGTTNACCTTTGGGATTACACAATACTTGCAATCCAGCTCTGCCATGGGGGACATCATTA 

CACAA AGCACTCCTCACCAGTAGTCAGATGGCACTTGATAGCTACAGGCATGTGAATATT 

CTTTTACTGTCAACTTTTGTTGTGATTTCACCTAAAATATGATAAGCTCCTTGAGGATAA 

AGGCTAAGTCTTACTTCTTCGTATTTCTGGTAACCGCTTGTACCAAACACCTGCCAAGCA 

TTGTTGA7TGCACCTATGAGGAGGTGAGAAAGTGCCAGGCCGTCATGCCCCTCTATACAA 
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ACATGTAACAGTAGACTCTGGTTCATCAAGGAGTAAATGACATGATATGTGATAGACCAC 
ATACATTCGTGGCTATAGATAGGGTGAGTAACTGTAACCATATCCAATTGTGAGCAAAAA 
GTGTTTMTTGGGGNGGTTTTAAACTAATATAAAATTGGCTAAATCATCATTTTTCAAGC 
TCTCGAATCATCTGAAGACC I 1 1 1 1 I IAI I I I I I AANANTTAGGCC 
Sequence 238 

CGCTCCGTTGCAGTGAGCTGAGATCACGCCACTGCACTCCATCCTTGGTGACAGAGCANG 

ACTCCATCTCAAAAAAAAAAAGAAATTGGGTTGTCTAGGCACAGTGGCTCGTGCCTGTAA 

TCCCAGCATTTTGGGAGGCTNCNTAGAANAATCACTTGAGCCCAGGAGTTTGAGACCAGC 

CTGNACAACATAGTGAGACCCCATCTGTACCAAAAAAAAAAACACCAGACCTGGTGGCAC 

ATGTTTGTAGTCCCAGCTACTCAAGGGCTGAGGTGGGAGGGATCACTTGNNCCCAAGAAG 

TCAAGGCTTCAGTGAGCCATGATTGGTGCCACTTGACTACAAGCTGGCAACAGANCCAAG 

ACCCTNTTTAAAAAAAAAAAAAAAAAAAAATNGGNTTGGGTTATAGGNGAAGTTNCCTGG 

AAAACCTTACAAAACAAGAACCATGTTTGGGGGAANCAANTGGANGGGGTNAAANTGNCC 

NCTTGANGCTTNTCCTTTNGNGGNCCNTAATTAAAAANGTTTCCCNGGTNTTGGCCCNGG 

GNCCGGAAAATNCCTTNGAATTTCTTTAGCCCCTTTTGGGGNGGGCAAAAGCCAGGGNGG 

AATCCCNCNNGGGNTCAGNGATTTTNGNAACCNNACCTTGGCCAAAAATNGGGGAAAACC 

CCNG I I I I I I I INTAAAAAATAACNAAAAANTTTNCCCCCTTTNCTTTNGGGGGGGCCCA 

CANNNNCCCCCTTTTTTNATANANAANAANAANCCCCCCCCCCCCCCCCCC 

Sequence 239 

CATCTCAAAAAAAMCTCACAACATATTTTATGTACACCTAGTATCAGTGAGATTTTTTA 

AT AATTAG ATTTA7TATCTTCCATATTG N NT AC AA AACTTTC7TCAACTTCATAAAAAGT 

CAAGTAACATACTGTACTTACTTCTCTCACCTCATTTTAAAATAGTGGATAAATTGTCAG 

ACACAGTGACTCTGCACCTGTAATCCTAGCTACTTGGGAGGCTAAGGTGGGAGGATTGCA 

TGAGGCCAAGAGTTCAAGACCAGCCTGAGCAACACATAGAGACCCTATCTCTTTAAAAAA 

AAAAAAAAAAAAAAA 

Sequence 240 

TGTCGACCCACGCGTCCGCGCTCCTGTCATCTCCCTTGGGTCTTCATTTAAATGCCACAC 

CAGAGAGGCCCTCCCTGGCCACCCTAATGAAAACTTCAACATCCTCAACCCTAACATTTC 

CTGTCCCCTGGGTTATTCCTCCCCTTGGTATTTATCACCATTTAATGTACTATCTGGCCG 

GGCATGGTGGCTCGTGCCTGTAATCCCAGCACTTTGGGAGGCCCAGGCGGGCGGATCACC 

TGAGGTTAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAA 

TACAAAAATTAGCCAGGCATAGAGGCATGCCGCCTGTAATCCCAGCTACTTGGGAGGCTG 

AGGCAGGAGAATCTCTTGAAACCGGGAGGCAGAGGTTGCAGTGAGCCAAGATCAAGCCAC 

TGCACTCCAGCCTGGGTGACAGAGCAAGACTCTGTCTCAAAAAACAAAAACAAACAAACA 

AACAAACAAACAAACAAAAAAACCCAACAAACCCACCAAACATACCATATTTGGTTTCTG 

TAAMAMTAAMMGAGAGAGAGMTTTTTAAAAAACAAAAACCATACTATCTAATTTA 

CAI I I I IAATCT 

Sequence 241 

CCCCGCGTCCGCTAAGGCATGTGAGCGCCTTGTAGGGCACGTTCTGTCTTCTGACTACAT 

MGCAACACTTTTGCGACCCTAAGTCACAGCACCCACAGCTTTTGGCAGGATACTTTAAA 

ACAGAAAAAACACATCATTGATCCTGGCAGGATTTTTTAAGGGATTGTGCTTAAGAATGT 

TTAACTTTGGTMTCAGAGACCACCACTGGTGTTTTTCTCCAAAATCACATCTTTAAGTT 

AATTAATACTTGAAGTTAATAAATAATACTGCTCAAGTGTATTAGTAATGATGCCATAAT 

ACC ATGT GAATTTATGCTGATTCAAATGTTGI I I I 1 1 I CGTTTGATACTCATATGGCCTT 

CTGTTTTGAGGACTTCAGATTATTTAGCAACTGATTTAATCTGGTCAAGAAATAAACTTT 

GCTTCAGCTGGAAAGCGTGAGGCTTGAGAAAAGCAAGGTTTTGGACAGGGGACCTATGAA 

GCTCATGTTGAAACTTAAGTGTTTTAAGGCTGTATGGGAACTTGGAATGGGAGTGAAAAG 

MCCAAGCCGTCTACTGNCAAGGTmTCCCTCCTCCCTCTAAAATTTATAAACCTCATT 

CTTAGAAGTGGCAAAMGTTGGGMCCTTTTCCACTGNTTCACTTCTNTTANTNAGGGGA 

TTAAAGGNGGATNGGNAMNGGAAGTTTTCNTGGTTTTTTAAAAAAAANNGNANAANGGG 

GGGG 
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Sequence 242 

GNCACGGCGGGGCAACCTGGTTCCCCTGGGCGGTAGTCCCGGCAGCTGTGCTTCCCCCTT 

CACTGCAGTGCGTGTTCAGTTGGAAACACGGAGATTTCACCTTTACTCCCTGGCTATAGG 

GGGA TGGTGAGAANGGGGCTGCGTGTNAGGNGANATTCAAGGCATAAATCTTAGNTTGTC 

TTTTAMGMTTGNGTGCCGGGACGTGNTTGGAGGNTGGGTGCANANAGGAAGAGGCGAN 

CTNCTGGAGGANGCAAAGGACAGGTCANTNTGCGNANTTCNGNNNGATGCTGCTGANTGA 

ACCCCNGNTGCTGTNGCCCCAACAAGAAAGGCTNNGCTNNGATCAAACATTTACTGNACC 

TANAAGGCCACATTCTNGCTTAANNANNCCAGGAANGCNCNTTCTTGTTNCCCATTACCA 

MATCATTNACACTTGGTTTTTGGCCCNACATAC^ 

NNAAANGGCANTANACCNNNGNATNTNNCNTCATNACANGAATAAAAAGCCCNCAGAACN 

CNACANANGNGCCCATCACTANNNNTANAGATNTCAAAAGGGGGCCCNGGATCCTTNACC 

ANAGAANACTCTGGTCCNNAGNGNAAGAAAGNAAAACCCCNCNANANGCACTGGGACNNT 

NTNATANTTCTNNTTNAAACAAAANCTTATAANNNAAAAAAAAMTTTTT 

Sequence 243 

AATTMTAAGCCTCCTGGGTGATTTGGCTGCAGGTGGTCCTGAGACCACACTTTAAGAAA 

CACTAATCAAGAATCAAGGCCTTATGACTCTTAGTCCTGTTTCTTCCCAAAGGACTATGT 

TGCCTCCCTACATTCGAAMCATTCCAAACAATACATAAGAGTCTTTGTAATTCACATTA 

G TGCT CGTATTTACTGACTCMCAATTMTTCCTGATTGAGTCATTCATTGAGTMTTTT 

CTTTTACTCACTCAGCCTCTGTGGTTTGCCAGGAGCTACAAATAAAAGATTAAAACACAG 

TCACACCTCCAAGGAACTCATATAATAGTAACAGAAGTAACTACCATTTCTTGAGCACTT 

TATTTATACATACCGTCATACAGTTTATCTCATTCACCTATCTCATTCCCTGCCTTTCAG 

TMGCCTGTATACTATTATAACCGCATAGTTTGGGAATATTTCATATATATATATATATA 

TATATATATATATATATATATATATATGTATAATCACATATATTCTCTTCAACTTGTNCT 

TTACATTTAATATGTCTTAMAGTATTTCTTTGTCAGTTCTTGCTTTATATATGTCTAGT 

ATTTCTTTATMGGGCTACAAGAAGTATCCCTCAAGCATTGGATATTTCAATCAAATTAC 

CCTTGGGGGGGTNGGGGGAAAAAAATNNGGGGGGGNTTT 

Sequence 244 

TCGCCCCGCGTCCGGGGAAGGCTAAGGCGCGAGGATCCCTTGAGCCCAGGAGTTCTAGGC 

TGCAGTGAGCTATGATCACGCCACTGCACACCAGTATGGGCAACAGAGCGAGATCCCATC 

TCTAMCAAATTTAMACAAACGAACAMTGAAAMTATTGCTGTCTTAAGGTTGGGAAG 

GGGCAGAGACCCCTTTGCTTGCTCATCACCAAGACACTTCTGTGAGGCCCCAGGGCTCTT 

TGGAGMCGTTTTGAMATCACGGTTCTAAGTAATTATAGTTACTGTGACTGAACTAATT 

TAGCCCTAAGCTTCCTACAATCAAGATAGAGATACACTATGGACTGCATTTCTCCGCTTC 

AGATTAMAAAAAAMGTTTAAGTCAGAATGTAGTTATATTTTCAGGTAAAATGCTCAAT 

ACATTTTCAAGATGAAGCTGCTCAAAATTAAAGCAGTGAGTCCAAGGGTTAATCTGNAAA 

AAAAAGTACAATTTACTATCTCCTGGTTNCANACTTATAGACCCTCATAGGTGCATTGGC 

TMTACAAGGGGCCACTMACACATTGTGGCATTTACNGGATTTTATTGGTGAAGNGCTC 

T ATMGTTTT ATTGGTGCCAGGTAAAAGAAANGCCTCNTATAMAAAATGGTGGNGGGGG 

G I I I I I I I I I NCCCCCCTTTTT 

Sequence 245 

TTCCGTACCTMGAATTGTACCCTTTCATAACAGCACCACTTGGATATGTAGAAAGAGTT 

TGTTGTGAGATCAGATGTAAAACAAATAAAAGTTATTCGTGAAATGATATGGAAGACTGG 

GATTAGAMCTGTGGCATTCAAGAAGCCAGTTAAGCTGTTCTCAGAATTGACAGAGATTC 

TAGAGATGGTGTAGTGCAGATGGTGTAGTGCAGGGATTCTCAGCCTCACTGCACATGGTA 

ATCTCATAGGAMATTTTAACAAGGACAGATGCAGAAATTCTTATTAAACTGTCAGGACT 

GAAGTCCAGAGGTCACTATATTTTAAAAAGGTCTGCAAGTGATTGTAACATGCATCTATG 

GAGGAAAACATCANCGTAGGAGAAAAAGGGAAAGAAACCAACTGGAGTAAAGGCTCTGTC 

TTGAGCACTGTGCTGGCTCCTGTCATTCTCCATCTCTTTTTATCTTCATAGTAAGTGAGA 

TGGGTTTGACAAATAGGGA 

Sequence 246 

GCGTCCGGNGTMCTTGAACNAAAGTATTCTCCTTCTTCTGTATATTTGTTCTCAACCCC 
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CAATTCCTCATAACTTTCTAAGTAAGTAGGGTGAAAAGAAAGTCAGGGCTGCCTAGGGAA 

GCTACATTTGCTTCTCAAGGCTTGCACCTGCATATTTCATGCCCGACATGTTGCTGAACA 

CAGCCGAGTAAAGTGCCAGAATAAAATCAGCTCTGTCCACGTGGTACCAGGAATGCTGCT 

GAAATGAAGGTAAGTATAGGATTCCACAGGATCTAAGACAGTCCAGGGAACTGGGACCAA 

TTGTGGGAAACCTCAAAGCTGAGCATATTTGAAGAAAATGCAGAGTAAAATGGCTATGGG 

CTGCCGGGAGGCCTGGATTTGAGATACCAAGCACCACTGATATAAACACTGATGCTCAGA 

AAMGAMTGATGAGTGTGTATTTGTGTCCCTTGAGACCTAATTTAATTTATAATTATTC 

ACCAGAGATAAGAGGCAAGGGATGTCTACTTGCTGGATATCTGAATTTACACAGTTCTCT 

AGTCATTAGTATTTTTTACAAATATACCACTCAGACTAGGGGGGCAGAGGATCTGGGGTT 

TNAATTTCAAGACCTGTCACTTATCTGCATGAATTTTGNAGCTTATAAGAACTTACAGGT 

TAACCI 1 1 1 I I 

Sequence 247 

TGTCGACCCCGCGTCCNTNGTATTTATAAANATAATNCTGNTAGATAAATAAGTGATTCA 

TATTTTGTCMANCTATTTTMMTTTCAATATTTAAMTATTNTTGAATCACTGGGTGT 

CGNTAAGTGGCATCATNNATGAGATTTGATTCCATGTACCATATAATNTTAGATTGGTCC 

TMTCTCACCCCTTTTAAACTCCTTCAAGCATTGCTATTACTGGGGTTGCCTTTGGGAAAA 

CTTACTTCTAGATACTACCATATATCTGAAATAGTAGAGGTGGATGTTAATAAAATTCAT 

AAMTNATCATGTATTACTTTTTTTGATTTACCACTGGAAGGAAATACAGNCATGTGCAA 

TATAATGACCGTTTTGGTCATNGAGACCCACATGTGTGACAGTGGTCCCATAAGGATGNG 

GCTGAAAANNTCCTGTTGCNGCCTAGTGACACTGTAGCCATNGNAACNCCATAGCACGAC 

ACGTNACTCACCTNTTCATGGTGATGCTGGTGT 

Sequence 248 

CCCCGCGTCCGATTTTGAATGTATTGAGACTAAAGTTCCTTTGAAACATTAAAGAAGATC 

CATCAAACTGGCAACTGTATATAAGGTCTGATATCTTCTGGCACTCAGAGGGAGAGTTTC 

TGGTGGAGGTGGAAGTGACTTAGGAGTCATCCATGAACATAAATGAGATCACTCGTGTAG 

GGAAGAGTTAATGAGATTAGAGAGCCTTGGACAGAGCCTTGCAGCCCAAACCCATGAAAG 

AGCAGTAATAGTGAGTGAATTACCTTCTGAGGCAGTGCAAAGTAGATAAGAAAACAAAAN 

CTN N AAAAAA AAAAAAN AAN AA AAAAAAA 

Sequence 249 

CCGCGTCCGCAGTGGTNGTGATCTCGTCTCCTGCAACCTCTGCCTCCCAGGTTCAAGCAA 

TTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGATTACAGATTATGTCTTGTGTTANNGAAA 

TCATTCMTTTTTCACCAGAGAATAAGGAAAAGCAAATTAGACTGTGAATAAAATCTCCA 

TGTACCCACTAGGATGACTACAATTCAAAAGGCTGACCTATCCAGGTGAAGATGAGAGTG , 

TGCAGCATTGGAAATTCTTACACATTGCTGGAGATTTAGAAGCTAAAGAAAGAGGACAAA 

TGATTTGGAAGCAAAGATAAGAAGGAAGAAGGAGATGGGGTCTTGCCATGTTTCCCAGGC 

TGGTCTCAAACTCCTAGATTCAAGTGATCCACCTGCGTTGGCCTCCCAAAGTGCTGGCAT 

TATAGGCATGAGGCACCATGCCAGGCCTGTTTTGAAATTTCATACATATTTATAAAACAT 

ATTTTCAGGATGAGAGAATATACCAAGAAGTTACATAGCATTGTGTATCTGTATAGAATA 

AAA 

Sequence 250 

TCGACCACGCGTCCGGGTGAACGTGGTCACCAAGGCCATGGGTACCCTGGGGGTGAGCTT 

ATCCTCCTGCAGCGTCCCTGGTTCCAAACCCACCTTCGAGCTCTCAGCCGACGAGGTGGA 

GCTGGGCCTGGGGATCCACGGGGAAGCTGGTGTGCGCCGGATAAAGATGGCAACCGCCGA 

TGAGATTGTGAAACTCATGCTCGACCACATGACAAACACCACCAACGCGTCCCATGTGCC 

TGTGCAGCCCGGCTCCTCAGTTGTGATGATGGTCAACAACCTGGGTGGCCTGTCATTCCT 

GGAACTGGGCATCATAGCCGACGCTACCGTCCGNTCCCTGGAGGGCCGCGGGGTGAAGAT 

TGCCCGTGCCCTGGTGGGCACCTTCATGTCAGCACTGGAGATGCCTGGCATTTCTCTCAC 

CCTCCTGCTGGTGGATGAGCCTCTCCTGAAACTGATAGATGCTTGAAACCACTGCAGCAG 

CCTGGCCTAACGTGGCTTGCAGTCTTCATTACTGGGCGGAAAGCGGAGCCGGGTAAGCCC 

CTTGCCGAGCCCCAAGAAG 

Sequence 251 
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GTCGCCCCGCGTCCGCAAGAAATGATGTTAGTAAAGATACTTTAATTGCGGGGAGTTCTC 

TGTTGGCTGGTGAATAAGGACGTTCTCTTTGAGTTCCACCTTTCTTTGAGAAATTTTCAT 

GTTTCCTGTACTTTTCCTCTATTATATCATGGAAACTTACTAAATCAGCCTGTGCATTCC 

GCCCACATATTCCCTCATGGTGCCTTCCGGTCCTAGCAGCTGAGCCAGGGCCATCAATAA 

ATCTTCTTCTTTTCCCCCAAGATATTCCTAGTTGCTTCTTTACTATTCATTCTTGTAATT 

CCATTCATCTGTGTGTATCATTTAGGATGCTGGGCAGTTGCATCGCATTTCACCTAAGGT 

CACATCCTGATGGGATTTGGTGGGATCCTAGAGCTGGCTAGGACCACTTGTGAGAGCTGA 

TTGTTAAATTCTCCGAAATGACGTGACTCAAGTTGTTAGACTGCTTAAAGG 

Sequence 252 

CCGGTGAAACCTGGAGCTGAAGTGAATTCTCTTAGAGTATATTTTGAAACTGTACTAGGA 

CTTTAMCACTTTTGGAATTTAAAACAGCCATAAAAATTCTTGTTATACTGAAGGAGTTC 

CTGAGGCAGTGTGCCTCTCATTTTACCACCTAAAGTTGCCATAGAGGTCCAAGGAGACAC 

TGCTGATAGCAGAAAGTCTTCCAGAAAGAAATTAGGCGACCCACACCAAGCATGTATGGC 

TTTGAGTCTTACAGATGGCTTTTTMTAGTTTAGTCTCTTAACCTAAGGAAGTTTCTGAA 

GTTCCGGTCAGAGAGTCTAAAAATTCACATTTTACCTAATAAATGATAATGAGGCTATTT 

ATCTTGTCTGTCTGGATTTTTTCACTTGACATTTAATGAAATATCCCATATTACCTATAA 

TTTTTATTTGAAG 

Sequence 253 

CCCCCCGCGTCCGAGATAATGCTGTTTGCTTCCGGCCGCTGTAAATCATAGGTGAAAACC 

AGTAGCANGTGCTCACTCAGTGCCTCCCAGAAGCGGTCTGCGGGTCTCAGCTGGGCTGGG 

GGCAGTTTTCATTGGGCAAGGCTTGGGCTTAGCTTCGAAGCANGGGCTGGGAGAGGATGG 

ATGGGGGTGTGAGAGCAAAAGAAAGACCTGGCTTTGCAGTGATGGCANCCCACGTTCAAA 

TNNNAGCTCACCACTGACCNGTCGNNTGACGNGCGCCAGGNGTTAGGAGACTGNMCTGN 

TTNTGNGTNNNGNNTCCGGNCGTNCATNNNNNCTGCTCAGCATACANANCCTNTTNCTNA 

TCNTAATCCTCATACNCATGNCTGNNNACTNTACACTGTTCTACTTATCAATGACAGGTC 

AAAAGTGTTATCATNTGTGACNTAGAATGAGTGAAGTACACNCCCTCTTGGAAACTATGA 

ATGACTTAAAGAATCACCNTTTGCAAAAAATC 

Sequence 254 

GCCTTCGCCCCTGCCTCCTCTTGGCTGCGGCTGGTCATCTTCCCACCTCCACAGTGGGCC 

GAGCTGCCAAACAAGGGGTACACATGTGCCCCCTGACATAGGCCAGGTGGTCTCTGCCCT 

CTGTAGCTCCCATGAGAGAGGGCTCCTCGGACCGAAACAGGAGGCCACTGCCCTTGCGCA 

CACACCGTGGCCGGGTCCTCCTGGGCGGAGCGCTTTCCGTGTGTGGGAAAGTCAAGGGCA 

GCCCCGAGCCTCGAAGCCCAGGCTCCCAGCCCCGCGCCATGTTGCATTCCCGCCTCTACT 

CCTTGGTAGGCTGGNTGCTTTGAGTGGTTCTTTTTMTTCTTTCTGTTGGTTTCTCCTTT 

TCCTTTGCCTGGGTTTTGCTTTAACCTCTCTGTTGCAGAGATGCAGAGCACTCAGAGAGC 

CTATTTCTATCATCGCTTTCCTATTCTCCACCTAGAACCAGNTGACTGGCCGCCCGAGTG 

GNGTCTCTTGTGTGTGTGGTGCCGTCAAAGCTGTGGAAAGAAATGCTTCTGCCTAGGTTT 

CTTCGCGCCCCCCCCTTGCTTGGCTTTTCCTGCCTGCTTACACCCCCGGTTCCTGATCTG 

CCCTGGGC 

Sequence 255 

GCC CACG CGTCCGCAAGANAGCTCCTCAGATTTGTCATAGACTATATTTAAAGAAAGGCC 

ACATTTTTCTTATTTAAAATGCATTAAACAATGCANCCAATTAAAAGAACTGAGNTGGAT 

TTGTACAAAAGCAGGGACTAGGTCTGNTTTGTTCACTGCTATATTCCCAATGCCTAGAAC 

CATGTCTGGCAAACATACTGGCATGGGAAGAACATTTCCATAACCCCTGAATGTTCTGTG 

C CCCT TTCCAATTAATCCCTACCCTCAGAAGCAACCACTATTCTCATGCTTATTACATTA 

GTTTTGCCTCTTCTTGACTTTCATATAAATGAAATCATACATCTAANAAAAAANANAAAA 

AAAA 

Sequence 256 

TCGACCCCGCGTCCGATTTGATATAAATAGTTATGTTACTCATATAGAAATCTCTTCCCC 
ATTACACACATACAAACATTTATCTATGAGTGGCTTATAATTGCAAATAAGATGTAAATC 
ATGCTCATGATCATTGTCAAAATTGTGAAAGA I I I I I I I CTATACCTCTTTTAGGTTTGT 
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TTTGTTTTTGTTTTTGATTTCCAGGTGGCATTAMGACAAGAGGGA 

TACTTCTACTCCCMGTCACTAGTTTGCTGMTTTAATTGAGTTAAAGAATTGTATCAGT 

CTTCTTGGMGTCTAATACAAMCCAGTTCAACACTAGTTATTCATTCTTTGCTAATTCA 

CCAGAATTGAAGGATGGATAAAATGAGAAAGAGAAGTAGTTCTTCATATTATTAAAATAA 

AGAGTTAMTTAGACACTTTGTTGGACTCTTTGGTCTTAATAATTCCTACTC 1 I I I I GAG 

GTCCAAMGTTTTGTCTTTGATAAATATAATTTTAATGGG 

Sequence 257 

AAGTTGGGAAAATAATTCATGTGAACTAGACAAGTGTGTTAAGAGTGATAAGTAAAATGC 

ACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGTCACCTGGGGAG 

TGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGMTTTTTAGTTATATGTGCTGTAAT 

GTTGCTCTGAGGAAGCCCCTGGAAAGTCTATCCCAACATATCCACATCTTATATTCCACA 

AATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGG 

GGCGGCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGC 

CTTGGCTTCTCTTCCCAACTGACAAATGCCAAAGTTGAGAAAAATGATCATAATTTTAGC 

ATAAACAGAGCAGTCGGCGACACCGATTTTATAAATAAACTGAGCACCTT C I I I 1 I AAAC 

AMCAMTGCGGGTTTATTTCTCAGATGATGTTCATCCCGTGAATGGTCCAGGGAAGGAC 

CTTTCACCTTGACTATATGGCATTATGTCATCACAAGCTCTGAGGCTTCTTCTTT 

Sequence 258 

GAGTCGACCCCGCGTCCGCTCTGGAGGAAGCATAGATTAGAATCATGAI I I I I ATCTATT 

TTAAGAGAATAGAAGAACAGAAGGGGTTACAATCTTGCAATATTATGCAACTCTTCTGCT 

CTAATATATCAAAAACTTGATGATCCAAGATCATGCAGAACAGCTGAGAAGAAATCAAAG 

TAAACAGTGTACCTTGCAGCCAACAGATCCTGCCAATATGAGATTAGAACTCTCCATCCT 

AGCAAAAAAAAAAAAAAAAAA 

Sequence 259 

CTGGTACCTGCGAGTCGCTGCAGCAGCTGTGGCAATTGTCACCTTCATCCAGGCCCATCC 

CGCTTTGAGGGCCTAGAGAGAGTGGGCCAGAGGTTAACCCCCGATTCATCTGCCTCCCCA 

CGCTGGGCATCTGGGTGTGCCAGGGCATTCCCCCGCTGGTCAGACAGG I I I I I GGGCCAG 

GGCGGGGCTGACCAGGGTTAATTAGAGGGAACTGGCTAGGAGGAGCTGGGGAGGGGGCTG 

GGGAGAGTCCAGGCCTNCAGAGCCCCTGGGACACAGCAGGTGTGTGCTGCCATGGGCCGG 

GGCTTGAACTCTGCCAGACTCAGGCGCCAAAAACGGCGCTTGCGACCTCAGGTCCAGAAG 

CCCCGGCAGCAAGCTG 

Sequence 260 

TCGACCCCGCGTCCGAAACTCTGTCTAGTCTAAACTATTATTCTATACTTCTCATCTCTA 

TATGTTAAGGATTGATCTCCAAGATAAATTG I I I I I I GTTTGN I I I I I AGGGACAGGATC 

ATGCTCTGTTGCCCAGGCTAGAGTGTAGTGGAACAATCATAACTCACTGCAGCCTCGAAC 

TCCTGGGCTTAGGTGATCCACCTGCCTTGGCCTCCTGAGTAGCTGGAATTCCAGATGCAA 

GGCACCATGCCTGGCTAACTTITTMMTTTTTCATAGAGATGGGGTCTTACCATCCTGC 

CCAGGCTGGCCTCGAGCTCTTCACCTCAAGCAGTCCTCCTGCCTCAGCCTCCCAAGGCAC 

TGGTATTGCAGGAGTGAGCCACCACGCCCAGCCCAAAATAATTN I Mi l I AAAGCAAGAT 

GTAGAMAGTGATTATMTATGTTTCCATTTAGGCAAGAAAAAAAATGGAGAGGACTATA 

CCTGTACTCTCTGNACATAGGATCCACAGAAAACTTCTAATGGATGGTTATCCCTGNGGN 

GGGAMCTGGGGGACAGGGMTGATGMGCAGGAAMTTTTTACTGGATAAACTTTAGTT 

CTGGTGGCTCTTTTTCTTCTATCATGNGNATGGTAACTT 

Sequence 261 

gtcccgcmagcctttaaamgagtccgmtttcacttttcaccttttgtagatgtgcac 
gtgtagctgtagagctcatacttacgtttcacatggcatagttgatggatatgtaggtgt 
aaagtttatggtagtggacaggctgagaatggtgtatctgtgacaaaaaatctgatggaa 
gtgatata tttg atatgamgtgmcattttcttagttgggtgtttataacl i i i i i i ig 
gtaaattgtttttagtttttatccttatttY^ 

tccccaaatcttcttctgmttccaaggamcactgttttagcatttatttgattacttt 
ggttcattcttttctccactcccatttatttgt7ttccattttgtaacttctataaag 
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GATAAMATCTGGMCTTCTAGATCTGACCTTCATGCCTTGTCTTTTCTATGGTACTTAT 
TCTTTCTGTCTCCTTCTCATTTTGGATTGGGCTTATGAGAGAAATCTTGGGGTTGATCTT 
CCAGCTCACTMTTTGATATTCATTTGTGTCTCTTCAGTTACTTAGCTTGCGTGAAAACT 
I 1 1 I I 1 1 I CAGCAATTGTGTACTTAATTTTCATA 
Sequence 262 

CCNCGCGTCCGTGCCCTGGGGCCCCCCTGGGCGAGCATACCCTAGGCGTCAGGCCTGGAG 

GTCTCCTCGGTGCCTCCCAGCTCTTCTGTGCTCCTCACACTCTGCTCTGNGCANATTGGC 

TGTATTTGTAGGTTACTGCCTTTTATTTCTCACATTTCTTTTTGGTGAGAGTATATCAAT 

CAATCAATCATCCTCAGACCTCTATGATAACACTGTGCCCCACACACAAGAAGCTACTCA 

ATTAATGTTTGTTTGTTGAAATGAGAGAAAATATTTGTCCTTAGTACAGAAAGAAGATGA 

AGCCAACCTCTGATAGAAGCCCACCCATAGACTAGGGTGTTGAGCTGCCTTCCAA 

Sequence 263 

NCCCCGCGTCCGGTAGTTGCAGAAGCATGTTCTTGAACTTATCAGTCCTGACCTCAGATT 

TCATCTTCTTCTGGTAGATGTAGCATACATCTCTGAGTGTTATTAGAGGACCAGNCTAGA 

GCCTCATCGTACTCCTTCAGTTACTTCATAATCATCCAGCTCTTTATATTAAATG I I I I I 

CTGTTTAAAATGGCAGCAATGTTTTTTATTTTTTGAAATGGTCCCTGACAGCAACAGATC 

TTCTCGTGTTTAATTATTGAGTCTGTGCTGATTACACAGAATTAAGGATATAGTTTCTAA 

AGTACTTCCATTTTTATATATTTTAGCATTATTCTGAAAGGCCTGGAAAAAAACTATTTT 

TTATTCGATTTGAAAGTGAAGTGACATAGGTGGGTCGCTATAGCAAGAAATTACCCTGTA 

TTTTTCCATCTCTATCATCACAGGCATCTCACAGAATTAGAAGTCGGACATTATTGATGG 

ATATATTAGTCATGAATAATTAAAATACATTAAATATAAAATGGGTCAGATACGGGCAGA 

TTT 

Sequence 264 

CNCGCGTCCGGGAGAAAAGAGTTTTATACCTAATTCTTNTGCAAGTGATTACAT AI I 1 I I 

ATACCCAGGATGTTCAGCAAGATGMCTTTATTTTTTAGATGTATCTATGGNTTTTCCAC 

CCATTTTATTAATTTTTAGAAATATMAAGTGCCTTTAAAATTTAGCTGGGTTAAGATAG 

TAAGTTTTAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGGCGGAGGTTGCAGTGAGC 

CTAGATCGTGCCATTGTACTCCAGCCTGGGCAACAAGAGCGAAACTCAGTTTCAAAAAAA 

AAAAAAAAAAAAAAAAAAAA 

Sequence 265 

GATCCTCTMGTNCCCMTGATCNGAGMGAAATATGAMGGGMTTTTAAAATATTTTG 

AACTGAATGAAGATGAAAATGCCACAGATAAAACTTTGCATGGGGNAGCTACATTAGTTA 

GCTTAGAGGGAAATTTATATTTTTAAATTCTTATATTAGGAAATAAGTCTTAGATAAATA 

ATGTCAGCTTCCACCTAAGAAGTTCAAAATAAACCCACAGTAAGCNGAAGAAAGGAAATA 

ATAAAGCTGAGAAAAAATCAATAAAGTTTAAAAGAGAAAATTAATAGATATCAATGGAAC 

AAAAGTTGCCTTTTTAAAAATATCAACAAAATTGATAGACCTTTAGCCAGAATGATCAAA 

AAAAAAAAAAAAAAAA 

Sequence 266 

CCCGCGTCCGCCTGAATTTCAGAGTCNGTTGAGAAGGTAGGGAGCAGGGATCTCTCAGAC 

ACCMCATGTTCAGCTTTATTATAGATAAGGCTGAATTGATTTCCTGGAATNGACCTTAC 

TTCCCCACCTCCAAACTGCAGGCCTTCCCCTTGCAGGTTCAGAACATGTTAAGGTAAGTT 

CACTGCAGAGATAAACAGGGTGAGGACAGGCCAGTCCTAAAAAAAAAAAAAGAAAAAAAA 

A 

Sequence 267 

CCGTTCTTTTTCCNAAACTAAAGAATGCATAGGACATAAGTTAAAAGTTCATACATAACC 
TGGCTTCAAATCCAGTTCTACCACTACCTGAAAACATCAGTTTATTTCTCATCAATGGGT 
TGTTATAAAGTACCTAGCATAGGGTATTGCTTAAATGTTAATACTCCCAATCCTGACACT 
AATGTTTCAGGGAAGAGTGAAAGAAATCACATTAACTCCACATTATTGAACATCTTCTGT 
GTCAGGCTGAATACATCTTTGTATCCATTGTCTTCTGGTTGTTTAAAGACTAGTGTAGAA 
GCCTGACATGTAAATCGGTGATTATATAAGATAGTACTGTCTTGTAATGTGTTGTGCTAG 
AGAGGTTAAGTATTAATTATTATGGGAGCTCAGGAGAGGCACCCACTGGCTGCTGGGAGA . 
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AGAGCAAAAACCTATATATAAAGCMTTTTTMGATTTTGGTTCTATAAGGGTAGAATGT 
TCGATCCATTAGTGTATTCCCAAAGTCCTATACACATGTCAAGATCGGGAAAGCTCACAC 
ACACAATAATGCCCAAG 
Sequence 268 

CGTCCGGCCCCATGCCCAGCCAAGTTATGTTGTTTGGTCMTTTTAGATAATTAAAAAAA 
AAAATCCTATGTAGTAACATCCTACTAGGAAAGAAGATAAACTCAANTATCCACTGAGTG 
GTGACCfGAAACCAAAGAGACATGGAGGGGCAGCCAGTCAAAAGCCACTGCACTCCAGCC 
TGGGCAACAGAGCAGGACTGTGTCTGGAAAAAAAAAAAATCTAGAAACATACTCAAATGT 
AAACATAAGAACTTAGCATATGGTAAAGTTGTGACTGTGAGTCAGTGCAGAAAAGAATGA 
ATCMTCAGTAMTGGTCCTGGGAAMCCGGCTATTTATTTTAAAATAATAACAAAATAT 
GCTGTATATCATCAATGGATTAAAGATTTATATACCTTCAGCATATGTCCTTAAATGTTC 
TAGAATAAAATATATATGAATTCAATGTAATTTTTGATGTGAGGGAAGGCCTTTTAGACA 
: TGACACAAATGTCAAAAAGCATAAAAGGAAAGATTTTGAATAAATAATAAAC 
Sequence 269 

GCGTCCGACAGATACACCATCATTATAGGTGGCCAAATCATTCAAATCGCTGTTTGCTTC 

TTCTCGGGC TCTTA TTCAAGCTCACGATGCTATTCTCATAGTGATTGCGAATCTCACTGC 

ATMAGGTGATTTTGAAAACATCACTTTTCCTCCACACTAATGTGTGAGCTTAGCCAGAG 

TAGGAGATATATCTTACTTAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 270 

CCCTTTTTTNCGGCCNTNCGGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAA 

AGGG CTGTG TTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCAT 

CTACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAG 

GTCCTGAAATAAGTCACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCA 

TTCGGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAAGA 

TGCTGTTGCTGCACAGATCCTGTCACTGCTGCCATTGAAGTTTTTTCCAATCATCGGCAT 

TGGGATCATTGCATTGATATTAGCACTGGCCATTGG 

Sequence 271 

ACCGCGGNGGCTTCATGCAAGCTGTGGGCATGGNCAACGATCACGAAAATCATTNTTCCT 

TTAAATAAAATACAATCCTATNNAAAGGAGTNCTTCCATGAGCAACAATCAAATACGTGC 

TTATGCTGCGATGCAAGCAGGTGAACAACTGGTTCCTNATCAANTTGACGCAGGCGATTT 

ACAAGCCCATCAAGACGAAGTCAAAGTTGAATATTGTGGATTATGTCATTCAGATATTTC 

AAGTGATTAATAATGATTGGCCGATCATCTACTTATCCTGTAGNCGCAGGTCATGAAATC 

ATTGGAACCATCACTGCTCTTTGGTTCATGAAGCGAAAGGACTCAAAG 

Sequence 272 

CCCTTAGCGTGGTCGCGGCCGAGGTACACCAAGACCAATTGCTAAAATCTTGGATTATGG 

AAAATTTAAGTATGAAAGAAAGAAAAAACAAAANGTTGAAAAAGAAAAACAATCTTTCAC 

AMCAATAGAGAAATTCGTTTATCTTTTGGAATCAATTTAAATGATATAAAAATCAAAGC 

AAAAAMGCCMAGMTTTTTATTAGATAACCGACAGAGTMAAGTGGCTCTTCGTCTTA 

GAGGGCGTGAAAATACAAGACCTGMCAAGGTAMTTAATTTTAAATTCTTTTTTTGATG 

MGNAAAATCGATTGCAMATTAAAGTAAAGMNNGCAATCAGTTTGGTAATTTTTTAAC 

TCTTCATATTGMCGTGATAAGAAAAMTTACCCAMTTTACTTCTTCAAAACAAATAAA 

GGMTTAATTGATTTTGAAAAAACTATTNAAGGAAGGAAGAAACTAATGCCTAANGCCAA 

AAACAAAATCGCACTTTAAAAAAA 

Sequence 273 

CCCTTAGCGTGGTCNCGGCCTTTGTACGMATTATGACTGTTTTAGCTNCAGGAACAGAT 

TTAAMGTAATTGMGACGTTTTAAAATCTGTTTTTGATGCTMGMCATCGAAAAAATT 

G AAAM CTTGAMGMCAGAGTTAGCATACGMATTAAAAAACACAMCMGGMTTTTT 

GTTTTAGCTAACTTAAAATCTGAGGAMGTTTAATCGAAGAATTTGTCAGAAGAGTAAAT 

ATTCTCMAAMCAAGTTTTAAGATTTTTAGTTATTMTCTAGATTCTGAAAGAGGAATC 

CACAAMCTTTCAGACCTAGAAAAAATGATAMCACAAATTTTTCTCTTCTAAAAAACCA 

ACANCTTCAACAGAAGAAGGTAAAAGTTTTCAAAAACCATTTGTCAAAAAACCTTTTGTT 
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AAAAAATCAGAAGAAACAGATTCTTCAAAACAAAATGANCAAGACAAAGTGCTAAGAAAA 
CCAAAAACTGTAAAAACCAGCAAAAGATCCTAAAGTAGCTCACACAGCAAAAAAAAAAAA 
NAAAAAGTCCTGC 
Sequence 274 

CCCTTAGCGTGGTCGCGGCCGAGGTACNAMTTATGACTGTTTTAGCTCCAGGAACAGAT 

TTAAAAGTAATTGMGACGTTTTAAAATCTGTTTTTGATGCTMGMCATCGAAAAAATT 

GAAMACTTGAMGAACAGAGTTAGCATACNAAATTMAAMCACAMCAAGGMTTTTT 

GTTTTAGCTAACTTAMATCTGAGGAAAGTTTAATCGAAGAATTTGTCAGAAGAGTAAAT 

ATTCTCAMAAACAAGTTTTAAGATTTTTAGTTATTAATCTAGATTCTGAAAGAGGAATG 

CACAAAACTTTCAGACCTAGAAAAAATGATAAACACMATTmCTCTTCTAAAAAACCA 

ACAACTTCAACAGMGAAGGTAAAAGTTTTCMAAACCATTTGTCAAAAAACCTTTTGTT 

AAAAAATCAGAAGAAACAGATTCTTCAAAACAAAATGAACAAGGACAAAGTGCTAAGAAA 

ACCCAAAAACTGTAAAACCAGCAAAAGATTCTA 

Sequence 275 

CCCTTAGCGTGGTCGCGGCCGAGGTANTTNCCTGANCAGTCGAAGTGGATGCCCAGACCA 

ATGGCCAGTGCTAATATCAATGCAATGATCCCAATGACGATGATTGGAAAAAACTTCAAT 

GGCAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGTGCAACAGGACTTATT 

TTC AAAT CATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGA 

TCATTTTCCCCCATGGTGACTATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCT 

ACCTCATAATTCCCGAGTCCCAAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCAC 

AAGTGTTACTGGCTTCCCATAAACACAAGCCCTTTCCT 

Sequence 276 

CCCTTTCGAGCGNCCGCCCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTT 

MTAGMTACATCAGAGATGTTAAAGATTTCCCMTTGAAGGGATTGTATTTAAAGATAT 

TTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAATTAATCAAATGGCTGAGTT 

AGCTAAAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACC 

TACTGCAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATTACCAGG 

AGACGTTATTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAAC 

TAATATGTTGAAAAMGGCCAMAAGTAGCMTTATTGATGATGTTTTAGCTACTGGCGG 

AACAATGAAAGCGATTATTAACTTAATCGAATCTCAAGGTGCTGGTTGNTCATAAAGTAA 

TCTTTTTACTTGAATTANGATTTTTAAACCGGAATTGAAAAACTTAAAA 

Sequence 277 

CCCTTTCGAGCGGCCGCCCGGGCAGGTNCTTCTAANGTTAAATCCTGAGGTAAGTCAACC 

MMCAGGATGTGMTATCCTMTAATAATTCTAAAGTCTTATCTTTTAATGTTGCTCTG 

TMCCMCACCTTTGATTTCTAATTCTTTAGAAMTCCTTTANAAACTCCGGTTAACATA 

CCTTGTAATGATGAATTTGTAGTTCCGNGTAATTGNTTAATAI I I I I I I CTTCTGATGTT 

CTTTTAGTAAGTAGAGTGTTTTCAACTTTTTCMTTGAMTAAATGATGAMATTCTCTT 

GATAATGTTCCTAATTTACCTTTTATAGNTACTNMGMTTGTTAATGTTTACTTCAACA 

CCTTCTGNTATTTGTMMCACGATTTCCGACACGAGACAATATTAATTATCAAATGTAA 

GCAATGGATTTCACCACCTACATTTTTCCCTTTCTTGCNTTGTATAATTAGGTTNAATAA 

ATCCTTT 

Sequence 278 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAGATAGTNNTCTCAGTAAAAGGTCTATTATCT 

AACTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCA 

CAGCTTGAAAAGCAATTAGAGTAAGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGC 

AGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGANCCAACAATAAAAATAGCACT 

AGCAATAANAAGAATGCCATCCCATGGAGCACACCATAAT 

Sequence 279 

CCGAGGTACTAAACTCTNTTTAATGNNCTAACGTCATA I I I I I I AAGTTTTTCAATTCCG 
TTTAAAAATCCTMTTCMGTAAAAAGATTACTTTATGAACAACAGCACCTTGAGATTCG 
ATTAAGTTAATMTCGCTTTCATTGTTCCGCCAGTAGCTAAAACATCATCAATAATTGCT 
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ACTTTTTGGCCNTTTTTCMCATATTMGTTNGGATTTCTMGAGTTGATTTACCATATT 

CTAAATCATACTCAAAACTAATAACGTCTCCTGGGTAATTTTTTAGGGTTTNCTTACCAT . 

AMTAAAMGGGGTTTTTTTAAAAAAAAGCTGGCAGTAAGGGTGTCCCAAACAAAGAAAA 

CCCTCTTGCGGTCTTGGGACCTATAATAACAATCTGCATCTTTAAGCTAAACTCAAGCCA 

TTTGGGATTAATTGGTGTAAATTTAAGCACTTTCTCCATTTGGCTAAAAAGTGGGGNGAA 

MTATCTTTAAAAATACAATCCCTTCAATTGGGGAAAAATCTTTTAACATCCTCTGGATG 

GNATTTCTATTAAAAATTGGATTTGGA 

Sequence 280 

TATTTGGTGAAGATCAGCGTTATCAGCATTTTCTACGATTAAACGCTGGCCATGCTTTGA 

CTGATGAAATCCGCCMGCTATTCAGCAGTTGGCGGATTGGGTCAGAGAAAGTCTCAAGT 

CGAAATTGAGTTAGTCGAAGTCTGCTTTCAATACGATGCGATAACGTGCTTGACCTGAAT 

G AAGTCG CTCAATGGCATCGTTGAGCTGTGACATAGGATAGAGTTCAATTTGCGGGGCGA 

TATTTTTACGTGCTGCAAATTTGAAGAAGTTGACGAAGTGCTAAAGGAGAACCCGTCGGT 

GAACCTGTTACTGATTTGGCACCATCAATCAAAGCACCCGACTGAAACTGGGAATAGGTT 

CTAAAAGTCAGAGCTAGAAAAATG 

Sequence 281 

NGGTACCTCCACCGCGGTGGCGGCCGAGGTACANCCTCTCGGCCCGGCTAAACATCATCG 

TCTTGGTAGGTCATTACCNTACCAACTAACTAATGNTCCGCACCCCCATTTTTAAGTGAA 

GCTGNGAAGCTCCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAAGCT 

ATTGATTCCAATAGTTATCCCAGNCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 282 

CGANAAGCANGATTTTNAATTNTTGCAGCCCGGGGGATCCGNGGAGNGGGGAGAGCCCAC 

CGCGGGGGAGCGCCAACACGACNNAGAGCGAGNCGNAATACGCGCGCACACNGACCGCNG 

ANNAACAAACGNNGAGACGGGGAAAACCCNGGCGNCCNCAACAAAAACGCCCGNGCAGCA 

CAGNCCCCAAAAGCCAGCNAGGCGGAANANCGAACAGGCCCGCACCNAGCGNCCGNCCCA 

ACAGNAGCACAAGCCCCGAANGGCGAAAAGGG 

Sequence 283 

AAACCCCCTNAAAAAGTNCCAAGGGGCCGCCAANNCGACTANAGCGAGNCGCATTACGCG 

CGCNCACTGTTTGTCGTNNTTAACGCCGCGACCGGGNNGAACCNGGGCGNNACCCAACAG 

GAAGCGCCGCGCANCACAACCCCCNACGCCAGCAGGCCGAnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnAGNGGAGGAAACAGCGCAGCAGAGACCGNNACACANGCCAAC 

GCCCNAGCNCCCGCNCNNNACGCNAGCAACCCCACCNAACANCGNCCACGNACGACNGCN 

NGCCCCCGCNCAANCNCNAAAACGGGGGCCCCCNANNANGGGGACCCGANAAANCGCNAN 

NACGGCACCCNCCGACCCCAAAAAACNNGAAAAAGNGAGAAGGGAGCACGNAAGNGGGGC 

CAANCGCCCCNGAAAAGAACNGGCGNCCCGCCCNNANGAACGAANGGGANGACCCACGNC 

CCNAAAAAANAGGGGGACACCCGAGNNACCCAAACGGGGNAACAACACGCAAACCCNAAG 

GCCGGCCGAAACGNGAANGGAACCAACAAGGGGACACANGCCCGAAAACCGGNCCAAAGG 

GGGAAAAAAAAAAGAAGCCGNAANNCAAACAAAAAANCNNAACGGCGAAAGNNNAAACAA 

AANAAGAAACCNCNAACAACNCAAAGGNGNCACNNNNNCNGGGAAAAAGGGGCCCC 

Sequence 284 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATNGGAAACAATAGCTAATAC 

CGGATATAGNTATTTATCGCATGATGAGNAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGT 

TNAGCCCGGGCCCGAGAGGCTGTACCTCGGCCCGCCACCGCGGTGnnnnnnnnnnnnnnn 

nnnnnnnnnnAGNCGTATTACGCCGCGCTCACTGGCCGGCGnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnACTTAATCCGCCTTGCAGCACAATCCCCCTTTTCGCCAG 

CTGGGCGTAATAGCGAAAGAGGCCCCGCACCCGAAC 

Sequence 285 

CGTCCGTNAGGCCTGTNCTTACGGCTGGGTTTAGAAACCAGCCCATTCAGAAAGACTGAA . 
TCAGAACATGGATNAAGTGAACTNATTCTAAGATGACTCGNNTATCCATGTNGATTAATC 
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TNCTGGNTCATAATAGGCCTCTTCCCTTTGATTGAAGGGTCACGTNTAAGTATANAAAAC 

ATAAAACTGTAAGGTAGAGGAAGCGAAGGATAGCTTNGTATTAATGTTGCGTTAAAGCTT 

CAGAGACAAGAACAAGAACACTCCTCCCACGTGACAGCATTTGAATAGGAGGCGGNGGGT 

GCNGCAGCCTGGGCAGCTTCAGTCCCGATTTACAATAAAGTACCTTGNGNGTNATTAGTT 

CTTAAATGTTTATTTAGAAATGGCATTGATGTT 

Sequence 286 

GCGTCCGGTCACCGCACTGAACTTCGGGACCAAAAGCTTCCATGCCGCGGNCCCCAGACA 

AAGGCAGCTTCCCGCTGGACCACTTCGGTGAATGTAAAAGCTTTAAAGAAAAATTCATGA 

GGTGTCTCCGTGACAAGAACTATGAAAATGCTTTGTGCAGAAATGAATCTAAAGAAGTAT 

TTAATGTGCAGGATGCAAAGGCAGCTGATGGCACCGGAGCCGCTAGAGAAGCTCGGCTTT 

AGAGACCTAATGGAAGGGAAGCCAGAGGCAAAGGATGAATGTTGAGAAGGGAGCCACAGG 

ACCTTGTCCCCAGCCTGGAGCAGAGCTGAGCCCTTCTGCCCACAGNGCAGGGGGACCTGA 

CACTCAGCCCGTGCTGGCCCGTGCAGGGGCTCTCCCTGGG 

Sequence 287 

CGTCCGGTGCACTGCAACTTNTATATNTAACCCTAAACTCCAATAAAACAAATTCCAGGG 

AAAACCAAGGGTGTAATGGGATGTGCGTGTTTATCAGGAGTGTGCTCTCACGTGGATGCT 

GAATGATGGAGGACAGCGGACTGCATAAGCCAGAAACCTGTACGGGTGCTGGCTGTGGAA 

AGACGTGTCTGTCTCTATCTATGTACAATAGTTNATTCTGTCAGGCTGAAAAAGTATGGT 

CTTTAGGACCTTGCCCCTCTAACTATAGAAACTTAAACAGTGTACTGCTATTAGATATAT 

CTGATATTAATAGAACATGCCAAGTGCAGGTCCCAAATGCGTATTTGTGAAGCACACATC 

TGAGTAAATGGCTTAGATGGAAAGCAAGTCATCATGAGTAAAAATTAAGCCTCAAACCTG 

CCGGTGCTCCTCACCTCTTTGTCACCAGGTAAAGGTCACACTGTGTGTTGCTTTTGNTGT 

CTTCTCTTCCTAACCTAG 

Sequence 288 

CNCGCGTCCGGGGCTTGTTTACTATGGCCGATGATCTGGAGCAGCAGCCTCAAGGCTGGC 

TGAGTAGCTGGCTGCCCACGTGGCGCCCCACTTCCATGTCTCAGCTGAAGAATGNGGAAG 

CTAGGATCCTCCAGTGTCTCCAGAATAAGTTCCTGGCCAGATATGTGTCCCTCCCAAACC 

AGAATAAAGAT CTGG ACGGTGACTGTGAGCCCCGAGCAAAAGGACCGCACCCCCCTGGTG 

ATGGTGCACGGTTTTGGGGGCGGCGTGGGCCTCTGGATCCTCAACATGGACTCACTGAGT 

GCCCGCCGCACAGTGCACACCTTCGATCTGCTTGGNTTTGGGCGAAGCTCAAGGCCAGCA 

TTCCCGAGGGACCCGGAGGGGGCTGAGGATGAGTTTG 

Sequence 289 

NGGAAAGCCGGCATAAGTGACATNGTTTGGGCAGTTGCCNGCTGGACTGAAGGGCNAACC 

CANACCACCTTAAGCCATAAAAGCCCGTGACACTGCTANCNAAGGTGCCTTGCCCACCGC 

TTTGCCACCNGTCCCGGAAATGNAAAAAAAGTCGCGTGCCNTAAAAAAGCTGCCGGAAGG 

NCCTGGGTGNACNTTTGGGCCACCCCCACCCCGCTGGCAAGGNCTTGAATTGNGTNACNC 

CAAAAGACGCCCANGCCGGACCTTGGNAAANNATTGTTNTTTNGGGANAAAAAAAAAATG 

GAIMCCCGNTGGGGGAGGCCCTTGGGGGCATTGNGNAAGCCCCCGGAGGGTTCCCCGNTGT 

TGGCNGGGGTCAAATTCAAAGGCCAGTGGTNGGCCACCCCGGGGAACCTGGNNGCTGTTG 

CAAGNACCNNGGTGGGGAACCGTTTCAAGGAATACCCCAACCAACCCANGTAGCCACCTT 

AAGGTAATTTGGCCCACCTTGCCCACCANAANGNGCCATTGGGGAAGAAACCACCAAAAA 

CGTTCCCCCCGGGT 

Sequence 290 

CCCCGCGCTCAGTATGACTCTTTAGTCCCAGTTTTCATGGGTAGTCTCTAAATTCTTTAC 

C TTTA TGTGATTGTGAGTTGGGAGGTGGTGGGCATCATCTTAGTCCATTTACCTTTTTCA 

GTTTTGTAATT ATCG TCTTCATCTACTACCTTTATATAATATAAAGGGAAGGGGTCTTCC 

TTTACAAATAGTTTTATCATCCTTCTCTTTTTGATGTCTATATCTTCTA I I I II I GAGGA 

GAATATATGTTGTATAGACCTACACGTGGGTGGAAGAAGAGTCATGTATGTGTGATTGTG 

TGAGATCCAGMTGTTGGAACTTTTMTTrCTTATTTTGTACTTATAATTACCTGCTGGA 

ATACCTGGTTGCATTCTGTATATTGTACCCTCATTTAAAAGTTCATGGAGGCAAAATAAC 

TCTGTTGCACATAAGGCCGGGGCTTATGCATGTCCTATCGGATGTGGGCTCAGATCACGG 
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AAGACCCGAA 
. Sequence 291 

TCTTGCAGACTCAAGCTCCGCGCGCAGCCGCTCCTGGTGCGGGCCCACAGCAGCCTGGGC 

CCCGGNTCGGCCCCGGAGCCCCCTGGCCTGCGACGACTGTTCCCTTCGATCGGCCAAATC 

CTCCTTCAGCCTCCTGGCGCCCATCCGCAGCAAGGACGTTCGCAGCAGGAGTTACCTGGA 

GGGCAGCCTNCTGGCCAGNGGGGCCCTGCTGGGGGCAGACGAGCTGGCCCGCTACTTCCC 

AGACCGGTACGTGGCGCTCTTCGNGGCCACCTGGAACATGCAGGGCCAGAA 

Sequence 292 

NGGCCCGCCCGGGCAGGTACTAAACTTTAATTAATGAGCTAACGTCAT AI I 1 1 I IAAGTT 
1 TTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAAAGATTACTTTATGAACAACAGCAC 
CTTGAGATTCGATT AAGTTA ATAATCGCTTTCATTGTTCCGCCCAGNTAGCTAAAACATC 
ATCAATAATTGCTACTTTTTGGGCTTTTTTCMCATATTAAGTTTGGATTTCTAGAGTTG 
ATTTACCATATTCTAAATCATACTCAAAACTMTMCGTCTCCTGGTAATTTTTTAGGTT 
TTCTTACCATMTAAMGGTTTTTTTAAAAAAGCCTGCAGTAAGGTGGTCCCAAACAAGA 
AAACCCTCTTTGCGTCTGGGACCTATAATAACCATCTGCATCTTTAGCTAACTCAAGCCC 
ATTTGATTAATTGGNGTAATTTAGCACTTCTCCATTTGCTAAAAGGTGGTGAAATAATCT 
TTAAAATACAATCCCTTCAAATTGGGGAAATCTTTTAAACATCCCCCGGCGGTACCTCGG 
GCCCGCTTCTAGAACTAGTGGGATCCCCCGGGCCTG 
Sequence 293 

GCTCCCCGCGGTGGCGGCCGCCCGGGCTGGTACGCGGGGAAATGCAAAAAATCAAATCAA 

TTTMTACGMTACATCATGAGATGTTAMGATTTCCCAATTGAAGGGATTGNNTTTAAA 

GATATTTCACCACTTTTAGCANATGGAGAAGTGCTAAATTACACAATTAATCAAGTGGCT 

GAGTTAGCTAAAGATGCAGATGTTATTATAGGGCCAGACGCCAAGAGGTTTCTTGTTTGG 

GACACCTACTGCAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATT 

ACCAGGAGACGTTATTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAAT 

CCAAACTAATATGTTGAAAAAAGGCCAAAAAGTAGCAATTATTGATGATGTTTTAGCTAC 

TGGCGGACAATGAAAG 

Sequence 294 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATTTCTAATTG 
ATCCTGTTCACATTCAGTGAAATGGCATTGCATATTTATATGTTGCTTACAGCTTATTGA 
TTTAGGTAACTATTGTGTCTTCCTTCACTATCTGACCTGAAAAGCACTCTCTTCTCTATG 
CACTCTTATATTCTGCCTTTCTGCCTGGAGTTTGAAATACATGTCTCTTTAGTTTCTTTT 
GCACATGCTACATTGTGCTTTAGACCGGGAGATAATACAGGNGCCTTACCTTACAAATTN 
ATNT TTNTGGCMCNCNAATTNTNTNGAMTTTTNNTTTAATTTNAAAACCCCAAACCAA 
TTTTCCNNCNCAAAAA I I I I I I I I I IGGGAAAAATTAANTTCTTTAAANNNAACCCCCCN 
AAAAATTATNGGNGNNAAAAGGGNGCCCCNTTTTGGGCCC I I 1 1 I 1 1 1 I 1 1 I CNCGGGNG 
GGGNAAAAAATTTNAAAAAAAN I 1 1 I I I I I GGGNNCCCCGGGAGAAAAANNTCCCCTNTT 
1 1 I I I I I ICNGGGGTTTTAAAAANGGGGGGNNAAAAATNTTTNNTGGCCCCCCCCNTTTT 

Sequence 295 

TATAGGGCG AATTG GAGCTCCCCGCCCGNGGTCCCAAATGGAAGTGTGAAAACCANGGCC 

CATCCCCCNNTTTTNTAGAGGGGTGGTAAAAAATAAACCCANANATCAAGGGGAGAAAGG 

AAAAGGATGAAAGGACAAACTGCCAAAMMTTTNCCCAAAGTGGCGACTTTTTTAANTN 

TGGGAGCCAGAATTCTGAGGGCTTTGCATTGTCTTTGCAATTCNCTCAAGGAGCCTGAAA 

TTGAAAAAAMTGCCAACAAGGCCAMTNAACTACTTTTTAGGAGGGGGTTTTGGAGGTC 

TTGGGAAGCCTCATTCCCNTTCAACCNNTCNAATTCTGGGAATGGGGGAAATGGAAAGAA 

TAGAAGATGTTGGGTGCCCACTAGGCTACTGNTTGAAAAGGGGAAGCTTGAAAANTTNCT 

NCACCCAAGGTTGGGTATTCAAAAATATTGTAATGGACTGGGTATTGGCAAAAAGG 

Sequence 296 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACAGGTGGGTCCCTTTTCAGAGGT 
TGGGCCTTCTAGACCTCACCTGTTCTCACTNCCCTGGTTTAAATTCAACCCCAAGCCATG 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

54/467 



GCCAATGGCCAAATAATAGAAATTGGTTCCCTACCCAGCTGGACCAGGGGAGGGAGGTCT 

TGTGCAGTTTCTTGACCACTTTGTTGGTTGGACCATNGGCTTAAATACCAATGGGGTATT 

CGGCTTGAGACCTAMGTTTGTAAGAAAATTNAACCAAAATGGTGCCTGCTTGGGTTAAA 

AATGGGCTACCACCTCMTCTGGACTTCMTTCTTTTMTTCTMTTTTAAGTTTGGGTT 

TTGGTATTCTTTGGCCTAAAGGTGGCGGTAGTCCCMCCTCTTTGGGTANTTACCCCTTC 

CTAAATAGGTCAATACCTAGGTAGGTCAATACCTCCCTGGGTGGTAAGGNGGTATTTCTT 

CTTAAAAAAGCCTTTTAAAA 

Sequence 297 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGGCCACACTGGGACTGAGATACGGCCCAGACT 

CCTACGGGAGGCAGCAGTMGGNNTTTTCCACAATGAGCGAAAGCTTGATGGAGCGACAC 

AGCGTGCAGGATGAAGTTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAA 

AATAANNAAAAAAANGGTACCT 

Sequence 298 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACGCGGNGAAATGCA 

MAAMTCAMTCAATTTAATAGMTACATCAGAGATGTTAAAGATTTCCCAATTGAAGG 

GGTTGTATTTAAAGATATTTCACCACTTTTAGCAAATGGAGMGTGCTAAATTACACAAT 

TAATCAAATGGCTGAGTTAGCTAAAGATGCANATGTTATTATAGGTCCAGACGCAAGAGG 

TTTCTTGTTTGGGACACCTACTGCAGCTNTTTTAAAAAMCCTTTTATTATGGTM 

ACCTAAMAATTACNAGGAGACGTTATTAGTTTNGAGTATGATTTAGAATATGGTAAATC 

AACTCTAGAAATCCAAACTAATATTTTGAAAAAAG 

Sequence 299 

CCGGGCAGGTACGGCCACACTGGGACTGAGATACGGCCCAGACTCCTACGGGAGGCAGCA 

GTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGCGTGCAGGATGAAG 

TTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

GTACCT 

Sequence 300 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTCTAGGACAATCAGGAAG 

TAATCTTAAAAAAATAATTGAAGATGTTAAMAATACGTTAAMATMAAAACTTGTTTT 

AAACATAGATGCAGTAGAAATTGAAAAACCAGATTTAGATGCAAAATTATTAGCTGAATC 

AATTGCAATTAAATTAGAAAACCGTGGATCATACCGTATGGCACAAAAATTTGCAATTCG 

TTTAGCACAAAAAGCCGGAGCTMAGGTATTAMACTAMGTTAGTGGTCGTTTAAATGG 

TGTTGATATGGCTAGATCAGAAGGATATTCTGAAGGTGAAATGAAATTACACACACTTAG 

ACAAGATGTTAGTTATGCAACAGCAACAGCAAGAACAACTTATGGAGCACTTGGAGTTAA 

AGTTTGAGTTTCATTAGGCGAAGTATTT 

Sequence 301 

CCGGCCAGGTACGCGGGGAAATGCAAAAAAATCAAATCANGNTAATAGAATACATCAGAG 

ATGTTAMGATTTCCCAATTGMGGGATTGNATTTAAAGATATTTCACCACTTTTAGCAA 

ATGGAGAAGTGCTAAATNACACAATTAATCAAATGGCTGAGTTAGCTAAAGATGCAGATG 

TTATTAT AGGT CCAGACGCMGAGGTTTCTTGTTTGGGACACCTACTGCAGC I I I I II AA 

MAAACCTTTTATTATGGTMGAAAACCTAAMAATTACCAGGANACGTTATTAGTTTNG 

AGTATGATTTACAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAG 

GCCMMAGTAGCAATTATTGATGATGTTTTAGCTACTGGCGGAACAATGAMGCGATTA 

TT AACTT AATCGAATCTCMGGTGCTGNTGTTCATAMGNAATCTTTTTACTTGAATTAG 

GATTTTTAMCGGNATTNAMMCTTAAAMMTATGACCGTTAGCTCATTAATTAAAAG 

TTTAGTACCTCGGCCCGCTCTAG 

Sequence 302 

AGGTACTTTGATATCTNCGCCCTCTCGTGTGTTCCTTGTGGNGNTAACCAGAGGCAAGAT 

GCCCGAGGAACTTCATGTGTATGTCTACCAGGATTTCAGATGATCTCTAATAATGGAGGA 

CCTGCTATTATTTGTAAAAAGTGCCCAGAAAACATGAAAGGTGTTACAGAAGATGGCTGG 

AACT GCAT TTCTTGCCCTAGTGACTTAACTGCCGAAGGAAAATGTCACTGTCCCATTGGC 

CATATTTTAGTGGAAAGAGACATTNATGGAACATTGNTGNCTCAAGCAACTNGNGAGCTC 
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TGNGATGGAAATGAAAACTCTTTTATGGTAGTAAATGCTTTAGGAGACAGGNGCGTNCGA 

TGTGAGCCAACATTTGNTMTACCAGCAGGTCCTGTGCATGTTNCGAACCTAACATTTTA 

ACAGGGGGATTATGTTTCAGNAGCACAGGGMTTTTTCCTTGTACGTANMTTTCACCTG 

CACGTTATGGAGAAGTTTGGCAT 

Sequence 303 

CCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTTAATAGAATACATCAGAG 

ATGTTAMGATTTCCCAATTGMGGGATTGTATTTAMGATATTTCACCACTTTTAGCAA 

ATGGAGAAGTGCTAAATTACACAATTAATCAAATGGCTGAGTTAGCTAAAGATGCAGATG 

TTATTATAGGTCCAGACGCMGAGGTTTCTTGTTTGGGACACCTACTGCAGCTTTTTTAA 

AAMACCTTTTATTATGGTAAGAAMCCTAAAAMTTACCAGGAGACGTTATTAGTTTTG 

AGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAG 

GCCAAAAAGTAGCAATTATTGATGATGTTTTAGCTACTGGCGGAACAATGAAAGCGATTA 

TTMCTTMTCGAATCTCAAGGTGCTGTTGTTCATAAAGTAATCTTTTTACTTGAATTAG 

GATTTTTMACGGAATTGAAAAACTTAAAMATATGACGTTAGCTCATTAATTAAAGTTT 

AGTACCTCGGCCGCTCTA 

Sequence 304 

GCGGTGGCGGCCGAGGTACCTTNTCCGAATGCACCTTNAAAGCGGGTATTAGCCTATACA 

GGCTGTTTTAGTCGAATGCAGACCATCAAGGAAATTCNNGAATATCTATCTCAAAGACTG 

CGCATTAAAGAGGAAGATATGCGCCTGNGGCTANTCCANAAGTGGAGAANTACCTTACTC 

TTTCTGGGNTGATGAGGAATCATAAATCTGGAATATTTNGAAAATCCAGGATGAACAACA 

C 

Sequence 305 

GCNGNCGCGGGGAAATGCAAAAAAATCAAATCAATTTAATAGAATACATCAGAGATGTTA 

MGATTTCCCMTTGNNGGGATTGTATTTAAAGATATTTCACCACTTTTAGCAAATGGAG 

AAGTGCTAAATTACACAATTAATCAAATGGCTGAGTTNAGCTAAAGATGCAGATGTTATT 

ATAGGTCCAGACGCMNGAGGTTTCTTGTTTGGGACACCTACTGCAGCTTTTTTAAAAAA 

ACCTTTTATTATGGTAAGAAAACCTAMAMTTACCAGGAGACGTTATTAGTTTTGAGTA 

TGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTAATATGTTGAAAAAAGGCCA 

AAMGTAGCAATTATTGATGATGTTTTAAGCTACTGGCGGAACAATGAAAGCCGATTATT 

MCTTMTCGMTCTCAAGGNGCTGGTGNTCATAAAGTMTCTTTTTTACTTGGAATTAG 

GGATTTTTNAACCGGAAATTGGAAAAA 

Sequence 306 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAATTA 

TGAAATTATTGCAGAAAGAAGATTCACTCTCACCTGATGAATAAGTGTTCATAGGTNAAG 

GCTACAAAATACTAATTTGTTATTATTTTTAATAATAA I I I I I GTTTTGCTG AGAAAGTG 

GATTTACCACTTTTTTATTTTTTAATCCAAGGAGGAAAAATTATTTCC 

AAAAATT TTTCAC GTTCTAMCCAGTTCMGMCATTGAGTAMCAGAAATATTCCATTT 

GTCMAGTTTTTCTTATCGGCTCAGATAATGAAAAAATTGGGATAATTGAAACAAGAGAA 

GCTATTGAAATGGCAAAAGAACAAAA 

Sequence 307 

CGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGCAGCAAGCGGACGTGAGC 

GATAATGGCGGATATGGAGGATCTCTTCGGGAGCGACGCCGACNGCGAAGCTGAGCGTAA 

AGATTCTGATTCTGGATCTGACTCAGATTCTGATCAAGAGAATGCTGCCTCTGGCAGTAA 

TGCCTCTGGAAGTGAAAGTGATCAGGATGAAAGAGGTGATTCAGGACAACCAAGTAATAA 

GGAACTGTTTGGAGATGACAGTGAGGACGAGGGAGCTTCACATCATAGTGGTAGTGATAA 

TCACTCTGAAAGATCAGACAATAGATCAGAAGCTTCTGAGCGTTCTGACCATGAGGACAA 

TGACCCCTCAGATGTAGATCAGCACAGTGGATCAGAAGCCCCTA 

Sequence 308 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATGTTATTAATGTGACTGACA 

AGTAATTAGAMACTGGAAATTAMTTTTACAAACATTTTTAAAATCGCTNCAATTAAAA 

AMTTCMGATGGTTACATTATGAATATGAATGAAATGTCATTAGCGACTTCGTTAAATG 
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TTTATTTTTGTTTTTTGAGATGGAGTCTCGCTCTGTTGCCAGACTGGAGTGCAATGGTGC 
GATCTCCGCTCACTGCAACCTCCACCTCCTGGGTTCAAACGATTCTCCTGCCTCAGCCTC 
CCGAGNAGCTGGGACTACAGGCACCGNCACCACGCCCGGCTAATT 
Sequence 309 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGACCCTCCTTGATGGTTTACTTTGCAAGCTA 

TGGTGACCTCCGCAAGTTGTGTCTGGGCCCATCCAGGGCTCTGACTAATTGTATTCAAAT 

CA AGGCA GGAGCGGGCCAGCTGGCGTTGACTTAACCAAGCCATTTTATAAGCCTCCCGAT 

CATTTTTAAGCCACTCTAAGTCGTGTAGTAGGATCTGGTCAGAGTTATGTATACTCTGAT 

GGGCATGTGCTGTGTCGTCTAAAATGTCCAGMGTTCTGAAACACTTTTAGATCTTCCAG 

AATTTCTTGAGGA AGTCT GCCTAAGTAACTGATGCACATCAAGTTCATCACCGGAGGAAT 

CAAAAGMTTTCCATTTTCTATTTCTCTACAGAAAAGMAAGGATCTTCCTTTAAGATGG 

AAATATTATTTCTCTTC 

Sequence 310 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTTGAATTATCTA 

TTGAAAGMCT ACTACAT CGAGTTTTTGTTCTTTTGCCATTTCMTAGCTTCTNTTGGNN 

NAATTATCCCAATTTTTTCATTATCTGAGCCGATMGAAAAACTTTGACAAATGGAATAT 

TTCTGTTTA CTCM TGTTCTTGAACTGGTTTAGAACGTGAAAAATTTTTAGGATTTGGTT 

TGGAAATAATTTTTCCTCCTTGGATTAAAAAATAAAAAAGTGGTAAATCCACTTTCTCAG 

CAAAACAAAAATTATTATTAAAAATAATMCAAATTAGTATTTTGTAGCCTTTACCTATG 

AACACTTATTCATCAGGTGAGAGTGAATCTTCTTTCTGCMTMTTTCTAATTGCCCCGN 

GTACCTGCCCGGGCGGC 

Sequence 311 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAAGGTACCTGACTGTGGC 

TCANATCTGCGTCGCAGCAGCGAGAGAAGAAATCACTCCATATCCGATGAGAGGAAGGGT 

GGCACAGANATGGTGTCTACAATTAGAGACATTTCTGACTCCACCTTAGCCTAAGCAAAC 

TTTATATACTGAGTMCATTTGAAGGTTGTCTTTTAATGGTGGGGGGTGNTTTTTCCT^ 

TTAAACTACAGT 

Sequence 312 

CCAAAANGGGNCCTGGGGCGTGGTCACGGCCNAGGTACAAAATTATGACTGTTTTAGCTC 

CAGGAACAGATTTAAMGTMTTGMGACGTTTTMAATCTGTTTTTGATGCTAAGM 

TCGAAAA AAATT GAAAAACTTGAAAGAACAGAGTTAGCATACGAAATTAANNAGCACAAA 

CAAGGAAl I I I IGTTTTAGCTMCTTAAMTCTGAGGAAAGTTTAATCGAAGAATTTGTC 

AGAAGAGTAMTATTCTCAMAMCAAGTTTTAAGATTTTTAGTTATTAATCTAGATTCT 

GAMGAGGAATGCACAAAACTTTCAGACCTAGAAAAAATGATAAACACAAATTTTTCTCT 

TCTAAAAAACCA 

Sequence 313 

AATNGGGCCTNGCGTGGTCACGCCCAGGTACNAAATTATGACTGTTTTAGCTCCAGGAAC 

AGATTAMAGTATTGAGCNTTTTMTCTNTTTTGATCTAAGACATCGAAAATTGAAAACT 

TGAAAGACAGAGTTAGCATACNMTTTAMNGCACAACAGGATTTTTTGTTTTACTAACT 

TAM TCT GGGGM GTTTAATCNMGATTTGTCAGAAGAGTAAATATTCTCAAAAAACAAG 

TTTTAAGATTTTTAGTTATTMT CTAGAT TCTGAMGAGGATTGCNCAMACTTTCAGAC 

CTAGAAAMTTGATAMCACAAATTTTTCTNTTTTAAAAAA 

Sequence 314 

NGGGCCTNGGNGTGGNNACGANCCAGGTACTTTTACCAAAGAATCTACTAGAACTCTCTG 

CTATTCAAMCMAGAGCTCATACTTGTTGGAGTAGGGAAAAMTTAGAAATTTGACCAA 

AAGATAGATTCAATCAACTACAAAGTCAATTCCAAGATGCTGNITAACATCGAAACTCTTG 

AAAAAGMCTATTANMTCTGGAGTTGMCTiTMTGGATCATATACCTGNTTTGCTTGA 

TCAAGCGGATTGNTCAGCTTAATATTAAAGAAGATGGNATCTATTTAGATCTTACTTTAG 

GACGNGGNGGNCATTCGAGNCAAATTTTAAAAAAACTTACTA 

Sequence 315 
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CCCTTTCGA GCGGCCGCCCGGGCAGGTACTTGTCCATMTTTGTGAATATATTAACTAAT 

TTTTCTTTTGAGTATTCATTTACTAACTCATMGCTGMAAGCTTTGAGCT^ 

CTCATATCCAAACGAGCTTGTTTGTTGTATGAAAATCCTCTATTGGCTTGATCTAACTGA 

GGTGMGATACTCCTAAATCMGTMTACACCATCTACTTTGTTGATTTGMGTTTTTTT 

AGTTC TTGATCAAM TCTTTAAAATCAGATCAAATMATTCMTATTTGAAGAMTTTTT 

MGAGTTTTTCTTTTGTTTGTTCAATTGCTTGTT^ 

Sequence 316 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGAACTGCCAAAGGCAACAGAAATTCTTTCTC 

CCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTCAGAAACCAGATTC 

AGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGGCCAGGGAAGGAGA 

TCAAGGTAGTCAAGCGGGGGTCGTCTCAGATCTGGTTGTGCTCGAGCTATGCAAATGCCT 

CTCATGGAGATGTGAGGACC TATCT ATTATGATGACCAGGGCCACATCCGGAGGGGGCAA 

CAGACTTTCATCTATCAGCCCTTTTCAACCACTGATCTACTAAACTGGAAACACTCTGAA 

Sequence 317 

NGGGCCTTGNGAGCGGCCGCCCGGGCAGGTACGCGGGGGTTTAAAAAAATATTTAAAAAA 

ATGGAGGAATTATGAACTTAAAAAGCAAATTMGAAGCTTTTTAGGTGCTACTGCACTTG 

TGTTGCCAGTAGCATTTTT TGCT TCTTGCCAAACAAGATTTGNNNAAGTAAATGACCATA 

MTTAGTGATTGCTCACACTTTTMTAGTAGAGAAGGAAGGTTTTTAGCATTAGATCAAA 

TTGTTAAGCTTTGAAATGAAAGTGAAAAAGTTMMACAMGAAGAAGGATTTTATCCAA 

TAACACTAAATCGACAATTTGCGCAAACTTATGCAGAA 

Sequence 318 

CAGGTACTTTAGCTCCAAAATCAGTTTGATGAGATACAGTTGCCCCTATATGAGAATGCA 

CAGGATTCCTCATTGGTGAGTTCMCCATACATTTTGGGTAACTCCTGAAGACATCTGCA 

MTTGTGAGTTAGTTGGTGGGGTCCATTMCATTTGCATATGTTATTCTTTCTACTGAAG 

TGTGTGAGGCCACAACTGTCCCATTATGCATATCANAAACAGAAATTTGTTGAGGATAAT 

TTTGGATATTCAGCAGNGGCTGNGAAACTGGATTTGAATTACCGGGATACATGCATGCTT 

CTTGGTT 

Sequence 319 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGGCAATACGTAGAGATMTMCAGTTTTT 

TAAAMACTTMTATTTGTTATTGMTGTATATTTTTGAGTATTGCATCTTTTCTATACT 

MTAAGGAGGTGTMT7TGAACGCTTTTAGAAAGAAAAAAGAAGAATTAGTGCAAGAAAT 

TAAAGATTTGATTMTTCTTCTTCTTCATTAGTTATAGCTGAATATCGTGGATTAACAGT 

TGCTGAMTTGAMCTCTTAGAAATGMGCTTAMGMGCAGGTGTTTTTGTMAAGTTT 

ATAMMTAGACTATTTAAAATAGCATCTAMGAAGCAGGTTTCGGAGATTTAGAACAAT 

CACTAGTTGGTCCAAATTTTTITGCTTTTGGTTCTACAGATGCANTAGCTCCAGCTAAAA 

TTATTTCAAAATTCGCTAAAMCAMTCCAGTMGTTGTATTAAAAGGCGGTATTTTTTG 

A 

Sequence 320 

CCCTTAGCGTGGTCGCGGCCGAGGTACCACGGAATTTTAAAAATAGACTNTAAAAAACCN 

TC AATTGC AN AACT AATTTTG NTTG ATTTACTTCAAATTG A AGG AACTCTTTTAG GTTTT 

GTTGNAATAGTAAANGAAATTCGATNAGTAATTCAATCAAAGAATAATGAATNGACAAAT 

TCTGACATAAAAACAGCCAAAGAATATCATTTAGAACAACTAANAGCGCTTAAAAAAACT 

TCAMTTAACAMTA TAAT TTATAATTTATGGCACTATGAAAAAAAAGNGACTTTATTTA 

ATATCAACTTTANCATTTTCMCMTTTTATTAGCTATTNCTNGTGGTAAAAATCAAACT 

ACTCCTGTAGGCACACAACCANTTGATANTCCAAGCCTGGAGGATTCATCAAGNGAATCA 

ANAGTTCTGGATTTAAATTAATAGAAAAAAAG 



CCCTTAGCGTGGTCGCGGCCGAGGTACAATTTCTAATTGATCCTGTTCACATTCAGTGAA 

ATGGCATTGCATATTTATATGTTGCTTACAGCTTATTGATTTAGGTAACTATTGTGTCTT 

CCTTCACTATCTGACCTGAAAAGCACTCTCTTCTCTATGCACTCTTATATTCTGCCTTTC 



Sequence 321 
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TGCCTGGAGTTTGAMTACATGTCTCm 

AGACCGGAGATAATACAGTGACTTTACCTCACAAATCATATTCTGTCAACACAAATCTAT 

GMTTTAGTTTATTTAMATCAGAACAATTTCCTACMAATTTTTCTGGAAAATAGACTC 

CTAACAGACCTACCAGAATCATGCTTAAAGGGCTCCCTTGACACTTATTCTATACTGAAG 

GATAAATTTTAAAAAAT 

Sequence 322 

CCCTTTCGAGCGGCCGCCCGGGCAGGTAC I I I I I I I I I 1 I I i I II I I 1 1 I 1 I I 1 AGTTAA 

ATATAMTTTATTTATTAACTTT7TCAATACCTTTTGAAGGTCAM 

TTAGGTTCTTTTGAAAGTAAATTCCCACAACTGAAAGTGCAGCTAGAATAATAAATACTG 

CATATGCAGCTTCTGTTGATGCTGATTTAATTCCTGCAACTAGAACAGTTGCTATAGAAT 

AAAATGCATMCCTMTCCTCANATMGTCCAAATTGATATCCAACTTTTTTAGGATTTG 

ATCCTTTGTATTeGGGAGGMGATTTAAGATAACACCTTGAATTCCTCAAAGCATTACTC 

CCATTAAAAATCCTAAAATATAGAATAATCAAGTTCCATTACCTA 

Sequence 323 

GCGGTGGCGGCCGCCCGGGCAGGTACAAGGTAAAGCAAGAGCTGGCTCTCTACGTTCACC 

MTTTTTGTAGGTGGTGGTCGTGCATTTGGGCCTACAAATAATAAAAATTACAAAATTAA 

ATTMACAAAMAGTTCGCMATTAGCTTTTGCCTCAGCTTTTMGTCAACTTGCTCAAA 

ATAATCAAGTACCT 

Sequence 324 

NNCTTAGCGTGGTCGCGGCCGAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTTAA 

TAGAAT ACAT CAGAGATGTTAAAGATTTCCCAATTGMGGGATTGTATTTMAGATATTT 

CACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAATTANNCAAATGGCTGAGTTAG 

CTAAAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACCTA 

CTGNAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAMCCTAAAAAATTACCAGGAG 

ACGTTATTAGTTTTGAGTATGATTTAG 

Sequence 325 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTGTCCATAATTTGTGAATATATTAACTAAT 

TTTTCTTTTGAGTATTCATTTACTAACTCATMGCTGMAAGCTTTGAGCTTGATTCA 

CTCATATCCMACGAGCTTGTTTGTTGTATGAAMTCCTCTATTGGCTTGATCTAACTGA 

GGTGAAGATACTCCTAMTCMGTMTACACCATCTACTTTGTTGATTTGMG7TTTTTT 

AGTTCTTGATCAAMTCTTTMMTCAGATCAAATAAATTCAATATTTGAAGAMTTTTT 

AAGAGTT I I' ICTTTTGTTTGTTCAATTGCTTGTTTGTCTTTATCAAAGACTA 

Sequence 326 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGAACTGCCAAAGGCAACAGAAATTCTTTCTC 

CCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCMCAGCCGCCTCAGAAACCAGATTC 

AGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGGCCAGGGAAGGAGA 

TCAAGGTAGTCAAGCGGGGGTCGTCTCAGATCTGGTTGTGCTCGAGCTATGCAAATGCCT 

CTCATGGAGATGTGAGGACCTATCTATTATGATGACCAGGGCCACATCCGGAGGGGGCAA 

CAGACTTTCATCTATCAGCCCTTTTCAACCACTGATCTACTAAACTGGAAACACTCTGAA 

Sequence 327 

NGGGCCTTGNGAGCGGCCGCCCGGGCAGGTACGCGGGGGTTTAAAAAAATATTTAAAAAA 

ATGGAGGAATTATGAACTTAAAAAGCAAATTAAGMGCTTTTTAGGTGCTACTGCACTTG 

TGTTGCCAGTAGCATTTTTTGCTTCTTGCCAMCMGATTTGNNNAAGTAAATGACCATA 

MTTAGTGATTGCTCACACTTTTMTAGTAGAGMGGAAGGTTTTTAGCATTAGATCAAA 

TTGTTMGCTTTGAAATGAAAGTGAAAAAGTTAAAMCAAAGMGAAGGATTTTATCCAA 

TAACACTAAATCGACAATTTGCGCAAACTTATGCAGAA 

Sequence 328 

CAGGTACTTTAGCTCCAAAATCAGTTTGATGAGATACAGTTGCCCCTATATGAGAATGCA 
CAGGATTCCTCATTGGTGAGTTCAACCATACATTTTGGGTAACTCCTGAAGACATCTGCA 
MTTGTGAGTTAGTTGGTGGGGTCCATTAACATTTGCATATGTTATTCTTTCTACTGAAG 
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TGTGTGAGGCCACAACTGTCCCATTATGCATATCANAAACAGAAATTTGTTGAGGATAAT 
TTTGGATATTCAGCAGNGGCTGNGAAACTGGATTTGAATTACCGGGATACATGCATGCTT 
CTTGGTT 
Sequence 329 

ATAGGGCGA ATTGGA GCTCCCCGCGGTGGCGGCCGAGGTACTAAACTTTAATTAATGAGC 

TMCGTCATATTTTTTMGTTTTTCMTTCCGTTTAAAMTCCTMTTCAAGT 

TTACTTTATGMCAACAGCACCTTGAGATTCGATTAAGTTAATMTCGCTTTCATTGTTC 

CGCCAGTAGCTMAACATCATCMTMTTGCTACTTTTTGGCCTTTTTTCAACATATTAG 

TTTGGAT TTCTAG AG TTGAT TTACCATATTCTAAATCATACTCAAAACTAATAACGTCTC 

CTGGTAAI I I 1 I I AGGTTTTCTTACCATMTAAAMGGGTTTTTTTAAAAAAGCTTGCAG 

TTAGGGTGGTCCCAAACAA 

Sequence 330 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACCTCTGATTCTCACTAGTTGAATGCAAGAAC 

TTGAMGGTTCAGGTMGTGTTTTGAAAMTTTTGACTTTCCAMCTm 

ATCTGAMCTCAGGAATCMAAMTACCGACAGGCACTGTTACTTTCAAAATTCTTTCTA 

TMGTTGAGAATGGGACAGATTTGCAGAGCAAGGGAAACTTGAACAGTTACTTCTAGTGG 

TAGGAAATGAGGTGGCTAGGATATTACCCAGCTGGTGGGTGACTTGGGCAGTGTGTTCCT 

GCTTTCAGTGGTTAGCCTTTAGCAMTCTGCTTTAGAGTGAGAGTAGAGGGCAGGCTGTT 

GTATTACAGTGCTCTTGTTTTTGTAAAATTTMTTCACTCTACTGNTATTTrGTCTCC^ 

GGGTAAAGNGNTATTTAA I I I I ICT 

Sequence 331 

ATTATTGCNGAAAGAAGATTCACTCTCACCTGNTGAATAACGTGTTCATAGGTAAAGGCT 

ACAAAAT ACTAATTTGTTAT TATTTTTAATAATMTTTTTGTTTTGCTGANMAGT 

T TACCA CTTTTTTATTTTTTAATCCMGGAG 

Ai m iCAC GTTCTAAACCAGTTCMGMCATTGAGTAAACAGAAATATTCCATTTGTCA 

MGTTTTTCTTATCGGCTCAGATAATGAAAAAATNGGGGATA 

Sequence 332 

CCGCGGTGGCGGCCGCCCGGCCAAGGTACGCGGGGGCAGAAGAGGAAGATTTCTGAAGAG 

TGC AGCTG CCTGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGGTA 

AATATTTTCCCTTCGTATTCGGTAGTGCTGTTGAGTCATCTTGTCCAATGCAAATCCTGA 

GMGCTATGTTCCCAAAGAGGGCCAGCTCCATTTAGTGTTTGTTTATAGCCTTACTATGC 

CTCTACC TCTGG GGGTTGTAAATCTGTTNTACCCAATGGGNGGGTTTGTTNCCCTTCCTG 

MNCAMTTTTTCTGCTTNNACACTTGGGCAMCCNTTTCCTAAATTTCATCCTCCCANA 

ACTTTGCNCCNCCNTTGGGGGGAGGTTTGGGGTTTTCAACTCCNGAANAAAAAGAGGGGC 

CCCCCACCNNAGGGNNTNNTTTNTTTTAMTTNGGGCCNNGGGGGNTTNNANTTAAAAAA 

NNGGGNTTTTNGGGGGACNNTTTTTNTTTTNACCCCCCCCCCCCTTTT^ 

Sequence 333 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACGTTGACCTTCCTTG 

CCCAAGMTGAATGATTMAAAGTTGAAAAGCTGTGTTTGTAGATCGTTCAGGCCTTCGT 

CTGCMCTATCAGGAMTATGGGTMCTCTCTTCCAAGAMCTTCCCCACTCTTTTGATA 

AGCATCTCGAAAATGTTGTTCGAACATCMTGGTGGTATTTTCTGAAGATGAAGAATTAA 

AGCAGTTCTCAAAGAAAGAAGTTCAGGGAAGTTGAAATACGCATACTCGGCATCCTTGAA 

GAAAACTATGGTGAATTGTCCTCCTTTGCTAATAAGATCCACAAGATAGCCGTTCAACCA 

GATAGAAAAAATGGGAGGTNCCTGCCCCAGGCTTTAAANTGANTNNCNCCNCAGCATCCC 

CTTGGGGATNNGGTAANNNCCNCCCCCCCNANCAAAMAAAAAATTTTTCCCCNCCNACC 

NMTTNTGAAAAACCNNGNGGGGGTTTAAAMTTTTGNGNTCNNAATTTNAAAAAAAAAA 

AN 

Sequence 334 

CCGGGCAGGTACTAGGAGATATTGATTCTAGTCAATTAGGCATTGTAGACTGTCATGACC 

ACTTAATAAAAAATTATGGACCTGMGCTCACGAGCATCCAGATTTTATTATGATGTCAA 

AAGATGCTGCAATTAAAGAAATGAATGAATATGTAGCAAAAGGAGGAAAAACTGTTGTTA 
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CAATGGACCCTCCTAACGTTGGGCGTGATGTTTATCAAATGTTAGATATTGCAAAGAAAT 

TAGMGGAAAAGCTMCATTATTATGGCMCTGGTTTTCATAAAGCTGCATTTTATGACA 

AAGGTGCTTCTTGACTTGCTTTGGCTCCAACAGATAAAATTGNAAAAATGGTTGTAGCTG 

AAATCGAAGAAGGAATGGATGAATATAACTACAGCGGACCAGTTGTAAAAAGATCTAAAT 

CCAAAGCCGGAATTATTAAAGC 

Sequence 335 

CTCCCCGCGGTGGCGGCCGAGGTACCGCGGGGAMTGCAAAAAAATCAANNCAGTNNANT 

CNMT ACAT CACAGATGTTNMGATTTCCCMTTGMGGGATTGTATTTAAAGATATTTC 

ACCACTTTTAGCAAATGGGAGANGTGCTAAATTACACAATTAATCAAATGGCTGAGTTAG 

CTAAAGA TGCAGA TGTTATTATAGGTCCAGACGCAAGAGGTTTCTTGTTTGGGACACCTA 

CTGCAGCTT TTTTA AAAAAACCTTTTATTATGGTAAGAAAACCTAAAAAATTACCAGGAG 

ACGTTIvrTTAGTTTTGAGTATGATTTAGAATATGGTAAATCAACTCTAGAAATCCAAACTA 

ATATGTTGAAAAAAGGC 

Sequence 336 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAATGCAA 

AAAAATCAAATCAATTTAATAGAATACATCAGAGATGTTAAAGATTTCCCAATTGAAGGG 

ATTGTATTTAAAGATATTTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTACACAATT 

AATCAAATGGCTGAGTTAGCTAAAGATGCAGATGTTATTATAGGTCCAGACGCAAGAGGT 

TTCTTGTTTGGGACACCTACTGCAGCTTTTTTMAMAACCTTTTATTATGGTAAGAAAA 

CCTMAMMTTACCAGGAGACCGTTATTTAGGTTTTNGGGTNNAGATTTAAAATTGNGG 

AAANCCCCNNT TNTAG AAATCCAAACTAATTNTGTTGNNAAAAGGGCCAAAAAGTNCCAA 

TTATTGGTGATGTTTTAGCTACTGGCGGAACAATGAAAGCGATTATTAACTT 

Sequence 337 

CCGCGGTGGCGGCCGAGGTACCAATAATAGCAACCCTGTGATTTGTCCAAGTGCCCGGGA 

GTGGAGGCCATCCTGACAACAGCTCTATGATTTTCTATGCCAATGACACAGGAGCCCAAC 

AGTTTGAAAAGTGGTGGGATAAGTCCAGGACAGTCCCCTTTTATCTTGTAGGGCTCCTCC 

TCCCACTGCTCAATTTC AAGTC TCCTTCATTTTTTTCAAAATTTAATATCCTAGGCACAG 

TGTCTGTCCTTTATTTGATTTTCCTTGTCACCTTTAAGGCTGTTCGCTTGGGATTTCATT 

TGGMTTTCATTGGTTTATACCAACAGAATTTTTTGTACCTGCCCG 

Sequence 338 

CCGGGCAGGTACCTGGAAGACTTCTCCACCTCGGGGGCCTGGCTGCCTCACAGGTATGAA 
GACAACCACCATAACTGCTACTCTTACGCACTCACGTTCATTAACTGCGTTCTGATGGCA 
GAAGGTAGACAGCAACTGGACAAGGGTGAATTTACGGAGAAGTACCT 
Sequence 339 

ATAAACTGCGGGATCTCAATGGCTTCTATGATCGTATTGAGGCAGTAGTTCCCACACTCT 

GCCCGGTGCCAGCATGAAAGAGAACAGGGAGAGTCAGCCTTTCACAGTCCTTGTCAAATC 

CCAATTACTCTGGTTGCAGATCACTTGAAGCCTGCCTTGTTTCTCTCACAAAGCTCTGCC 

CGAGAGTCCAGCCCCGCGTACCTGCCCG 

Sequence 340 

GCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACAAGATAGTCATNTCAGTAAAAGGT 

CTATTATCTAACTTGCCAAACTTGTTTANNGAGAGCCCTAAGGAACTAAAACTGCCATAA 

TGCCNrTGCACAGCTTGAMAGCAATTAGAGTMGCMGATTAGTTTTTCCTCCCTTCCAG 

TTCCTCAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAA 

AATAGCAATAGCAATAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCAC 

CTNTCCCGGATCAGGCTTCCATTGCTCACGATGCTCACGCTGGGCAG 

Sequence 341 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTCTAGGACAA 

TCA GGAAG TAATCTTAAAAAAATAATTGAAGATGTTAAAAAATACGTTAAAAATAAAAAA 

CTTGTTTTAAACATAGATGCAGTAGAMTTGAAAAACCAGATTTAGATGCAAAATTATTA 

GCTGAATCAATTGCAATTAAATTAGAAAACCGTGGATCATACCGTATGGCACAAAAATTT 

GCAATTCGTTTAGCACAAAAAGCCGGAGCTAAAGGTATTAAAACTAAAGTTAGCGGTCGT 
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TTAAATGGTGTTGATATGGCTAGATCAGAAGGATATTCTGAAGGTGAAATGAAATTACAC 
ACACTTAGACAAGATGTTAGTTATGCAACAGCAACAGCAAGAACAACTTATGGAGCACTT 
GGAGTTAMGTTTGAGTTTCATTAGGCGAAAGTATTTGCAAAGCAAAATCAAGCATATAA 
TGAAGAAGAACCAACNCACAAAAAAGGGCCAAAAAGAGCAGCAAGAGTTAAAAAAGAA 
Sequence 342 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACC 

TTGCAGTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGA 

Sequence 343 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCATGAATGCCGTGCTCCAGGTGTTCAC 

AGCTGC7TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGC 

CTGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCT 

GGAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTNTTGCCAGATGACAAGGN 

GACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCAC 

Sequence 344 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTTTAATTAATGA 

GCTAACGTCATA I I I I I IAAGI I I I I CAATTCCGTTTAAAAATCCTAATTCAAGTAAAAA 

GATTACTTTATGAACAACAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTGT 

TCCGCCAGTAGCTAAAACATCATCAATAATTGCTACTTTTTGGCC F I I 1 1 I CAACATAAT 

TMGTTTGGATTTCTAGAGNTTGATTTACCATATTTNTAAATCATTCCTCAAAACTAATA 

ACCGCCNTCCTGGGAMTTTTTTTAGGGTTTTNTTACCCCTAAATAAAAAAGGGNTTNTT 

TT 

Sequence 345 

CCGCGGTGGCGGCCGCCCGGGCACGGTACCACTTGAATTATCTATTGAAAGAACTACTAC 

ATCGAGI I I I IGTTCTTTTGCCATTTCAATAGCTTCTCTTGTTTCAATTATCCCAATTTT 

TTCATTATCTGAGCCGATAAGAAAAACTTTGACAAATGGAATATTTCTGTTTACTCAATG 

TTCTTGAACTGGTTTAGAACGTGAAAAAI I I I I AGGATTTGGTTTGGAAATAA I I I I ICC 

TCCTTGGATTAAAAAATAAAAAAGTGGTAAATCCACTTTCTCAGCAAAACAAAAATTATT 

ATTMAMTAATMCAAATTAGTATTTTGTAGCCTTTACCTATGAACACTTATTCATCAG 

GTGAGAGTGAATCTTCTTTCTGCAATAATTTCTAATTGCCCCGCGTCCTTGGCCGCTCTA 

GAACTAGGTGGG 

Sequence 346 

CACTACTATAGGGNGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAGGA 

GATATTGATCCTAGTCMTTAGGCATTGTAGACTGTCATGACCACTTAATAAAAAATTAT 

GGACCTGAAGCTCACGAGCATCCAGATTTTATTATGATGTCAAAAGATGCTGCAATTAAA 

GAAATGAATGAATATGTAGCAAAAGGAGGAAAAACTGTTGTTACAATGGACCCTCCTAAC 

GTTGGGCGTGATGTTTATCAAATGTTAGATATTGCAAAGAAATTAGAAGGAAAAGCTAAC 

ATTATTATGGCMCTGGTTTTCATAMGCTGCATTTTATGACAAAGGTGCTTCTTGACTT 

GCTTTGGCTCCAACAGATAAAATTGTAAAAATGGTTGTAGCTGAAATCGAAGAAGGAATG 

Sequence 347 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCGTAGAAGAAGAAGG 

AATACCTAAAGAAACAGACATAGAAATCATCCCAGAAATCCCGGAAACTCTAGAGCCACT 

GTCCCTTCCAGATGTGCTGAGGATCTCGGCAGTTCTGGAGGACACCACAGGCCAGCTCTC 

TATTCTGAACTACATCATGCCCGTTCAGTACCT 

Sequence 348 

TNCTTAGGGCGAAtTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTTTAATTAATG 
AGCTAACGTCATA I I I 1 1 TAAG I I I I I CAATTCCGTTTAAAAATCCTAATTCAAGTAAAA 



# 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

62/467 



AGATTACTTTATGMCAACAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTG 

TTCCGCCAGTAGCTAAMCATCATCMTMTTGCTACTTTTTGGCCTTTTTTCAACATAT 

TAGTTTGGAT TTCTAG AGTTGATTTACCATATTCTAAATCATACTCAAAACTAATAACGT 

CTCCTGGTMTTTTTTAGGTTTTCfTACCATMTAAMGGTTTTTTTAAAAAAGCTGCAG 

TAGGTGTCCCAAACAAGAAACCTCTTGCGTCTGGACCTATAATAACATCTGCATCTTTAG 

CTAACTCAGCCCATTTGATTMTTGTGTAATTTAGCACTTCTCCATTTGCTAAAAGGTGG 

Sequence 349 

TCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAAACAGGT 

GCTCCTGTTAAAATAGAAGATCTTGCAGCTACTTCTAGAGATTTAAATTCTAAACGATCA 

ATAGCAGCGTATCCTGTTCCGGCTTTAATAATTCCGGCTTTGGATTTAGATCTTTTTACA 

ACT GGTCC GCTGTAGTTATATTCATCCATTCCTTCTTCGATTTCAGCTACAACCATTTTT 

ACAATTTTATCTGTTGGAGCCAAAGCAAGTCAAGAAGCACCTTTGTCATAAAATGCAGCT 

TTATGAAAACCAGTTGCCATMTAATGTTAGCTTTTCCTTCTAATTTCTTTGCAATATCT 

MC ATTT GATAAACATCACGCCCAACGTTAGGGAGGGTCCATTGTAACAACAGTTTTTCC 

TCCTTTTGCTACATATTCATTCATTTCTTTMTTGCAGCATCTTTTGACATCATAATAAA 

ATCTGG 

Sequence 350 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCTTCACTATCA 

CTGTCCTGTAAATTTAGTAGCC7TGGCTGGAAACACTGTAGTCGACATGATCTGATATTG 

CTTAATATTTCAGAAAGAGACAGTCTATTTTCACAATGTTTACTGGAAGCATTGGTCCGA 

GAGAMTTAGAAGAAAAGTCTATAGTTTGGGAAGAGCTTGAAAAACTATTCAGCATTTCA 

GGGTCTATCTGTTTCAGGACTGGGTCATGTTCTGTGGATATTCGGTCCATTATGACCCTT 

CCACCTCTGCCAATTCGCCTCCTTGCAAATCCTATACATCTTCTTGGGACTGTAAGTGTT 
GTAAGGC 

Sequence 351 

CTNCTTAGGGCGAA TTGG AGCTCCCCGCGGTGGCGGCCGAGGTACAGCTTGGTGACTGTA 

ATTAACAACAATGTATTTTGAMGTCACTGAGTAAATTTTAAGTGGTTTTTCCAAAAAAG 

CACATAAGGTAATGCACCGTTAATTAGCTATATTGAGCCATTCCACAATGTATAGATATT 

TC AAMCATGTTA CACATGATAAATCCAGTTTTTCTACGTCATTTTTTAAAATTATATTT 

AATi l l l l l ATTTTGAAAGTTTTTTCACAGATC I I I I I I I I AGTATTATTACCTTCTGAT 

ATATGTGTCATTATTGAAGAACCATACCTTTTAAAGGTATTATTTTGTAATTAAGGTATG 

TCAACAGTAAAAAATAACCAGTGGCCCAGGCCATNGGGGCTCATGCCTGTAAATNCCAGC 
AC I I I I I I CGGAGGCCCG 
Sequence 352 

CCGCGGTGGCGGCCGAGGTACCTGTGAAGACAGCTACACCTGGTTTCCTCCCTCATGCCT 

TGATCCCCAGAACTGCTACCTTCACACGGCTGGAGCACTCCCAAGCTGTGAATGTCATCT 

TCAAACAACCTCAGCCAGAGTGTCAATTTCTGTGAGAGAACAAAGATTTGGGGCACTTTC 

MAATTMTGAAAGGTTTACAMTGACCTTTTGAATTCATCTTCTGCTATATACTCCAAA 

TATGCAAATGGAATTGAAATTCAACTTAAAAAAGCATATGAAAGAATTCAAGGTTTTGAG 

GTCGGTTCAGGTCACCCAATTTCGAAATGGAAGNCATCGTTGCTGGGTATTGAAAGGTTT 
GT 

Sequence 353 

CCTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTTGTTTTATATTCT 

TCAAATCCTATTGATGAAAAACTTAAAGATGAATTAGAAAAATCTTATGTAATTAGACAA 

AAACCAAAAAAACAAAAACTTATTTTAGAAAGACTTGGATAATGATTGATTATAAAAAAA 

TGATTGACCATACTTTATTAAMCCAGAAGCTACATCTAAAGATATTTTAAAACTTATTT 

CACMGCTAAAGMCATGGATTTAGAGGAGTTTGCATTAACTCTTCTTGAGTTAAATTAG 

CAAAAGAAAMCTTGCAAATACAGATTTAGATATCGTTTCAGTAGTTGGCTTTCCATTAG 

GTGCATCAAACACACAAACCAAGGTTTTTGAAGCAAAATTAGCAGTTGAACATGGAGCTA 

CTGAAATAGATATGGTTATAAATGTGGGTAAATTCAAA 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

63/467 



Sequence 354 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGCATGAAACCACACA 

TACGGGCAAAAGCAGGAAGAGACTTTGCAGACATAAAAACCAGGTAAGTTTTGAGAGTCA 

GGTGATTGCAAGAI I I I I I I I I CTTTCATTGACACTTAGCTAATTTTGAGCCATATTTTT 

AMTGTGTGTCTT TTCAG TTACTTGAAACCTCTTCTCCCACCTCCCATCCTTTCCAAACC 

TCCTCCAAGCCAATTTTCCAGTTGGTTTAAAAGAAATCATTAGGGTGACCCCCACTCCCA 

CTTTCTGGAACCTCAGGCAGAGCTCCTGGGGTTCCTCC 

Sequence 355 

NNI I I I I I I I I I I I I I I I I I I I IGCGTGGAATGCTTCACCTCTTGCTAAAGGAACAGTAA 

TGTCATATCCTGTTAATCTTGCAGGAGGCGCTAATAAGAATTCAAATGCT7TCTCATTTC 

TCTTGT AATAATT TCAGTGAACTGAAAATGATTTAACAGCTTCGTGAACAACTAAAAGTC 

TTCCAGTTTTTTTAACAGAGTTAATAATTGTATCTGTATCTAAAGGAGAAATTGTTCTTA 

MTCMTTMTTCTACAGAGTATTCTCCATTTAATTGTTTTAAAGCAGCTAGTGCTTCGT 

GAACTTGTGCTCCATATGTTACTAATGTTAAATCAGAACCTTCTACTAATACATTTGCTT 

TACCAATTTC AACTTCAT AAATTCCTG CTG GAG CTTCTTGTTTG AATG AACG ATAAATTT 

TCTTAGGTTCTAAGAAAATAACTGGATCTGGGTCGTTAATAGCTGCGATTAATAATCCTT 

TTGTATCATAAGGAGTTGAAGGCATAACAACTT 

Sequence 356 

AGGGCGA ATTGGA GC TCCCC GCGGTGGGCGGCCGAGGTCTAAACTTTAATTAATGAGCTA 

ACGTCATA I I I 1 1 I AAGTTTTTCAATTCCGTTTAAAAATCCTAATTCAAGTAAAAAGATT 

ACTTTATGAACAACAGCACCTTGAGATTCGATTAAGTTAATAATCGCTTTCATTGTTCCG 

CAGAGCTAAAACATCATCMTAATTGCTACTTTTTGGCCTTTTTTCAACATATTAGTTTG 

GA TTTCTAG AGTTGATTTACCATATTCTAAATCATACTCAAAACTAATAACGTCTCCTGG 

TAAI I I I I I AGGTTTTCTTACCATAATAAAAGGTTTTTTTAAAAAAGCTGCAGTAGGTGT 

CCCAAACAAGAAACCTCTTGCGTCTGGACCTATAATAACATCTGCATCTTTAGCTAACTC 

AGCCATTTGATTAATTGTGTAATTTAGCACTTCTCCATTTGCTAAAAGTGGTGAAATATC 

TTTAAAT ACAATCC CTTCAATTGGGAAATCTTTAACATCTCTGATGTATTCTATTAAATT 

GATTTGATTTTTTTGCATTTCCCCGCGTACCTGCCCGGGCGGC 

Sequence 357 

GAATGGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCT 

ATTATCTAACTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATG 

CCGTGCACAGCTTGAAMGCAATTAGAGTAAGCAAGATTAGTTTTTCCTCCCTTCCAGTT 

CCTCAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAG 

Sequence 358 

AGGTACANGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTGCCAAACTTGTTTACT 

GAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCTTGAAAAGCAATTAAGA 

GTAAGCAAGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGCCCAG . 

GAGGGGAAGGGAAATATAACGGAACCCAACAATTAANAAATAGGCAAATAGCCAATTAAA 

GTAAGGAATGGNCATCCCATGGGAGGCANCAACCATTAATTTCTTGGGAACCCACTNTNT 

CCCNGGGATTCAGGGCTTCCATTTGCTTCACNGATGGCTTCACGTCTGGNGCAGCCCGCC 

AACTCTTACTTTGCCAGGAAACCCTCACCTCAACTTTGCCCAGGGTATTTCTNCCCCGGG 

TCTTGGAAANGAAAATGGGCTTCNTCCACCTGAAAAAGGGTTNGAATCCTTTCTTCCCAT 

TACCCAGGCTTTCCNTTTAAAGCCAMAAGGCAAATTCCCTCCTTTTTTGGCTTTTCCT 

Sequence 359 

CTAATTGATCCTGNTCACATTCAGTGAAATGGCATTGCATATTTATATGTTGCTNACAGC 

TTATTGATTTAGGTAACTATTGTGTCTTCCTTCACTATCTGACCTGAAAAGCACTCTCTT 

CT CTATG CACTCTTATATTCTGCCTTTCTGCCTGGAGTTTGAAATACATGTCTCTTTAGT 

TTCTTTTGCACATGCTACATTGGGCTTTAGACCGGAGATAATACAGTGACTTTACCTCAC 

MATCATA TTCTG TCAACACAAATCTATGMTTTAGTTTATTTAAAATCAGAACAATTTC 

CTACAAAAT7TTTCTGGAAAATAGACTCCTAACAGACCTACCAGAATCATGCTTAAAGTG 

CTCCCTTGACACTTATTCTATACTGAAGGATAAATTTTAAA 
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Sequence 360 

CCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTG 

CCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGTCGTGCACAGCTT 

GAAMGCAATTAGAGTMGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCT 

GGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAAT 

AAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACC 

Sequence 361 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I I I I GTTTTATATT 

CTTCAAATCCTATTGATGAAAAACTTAAAGATGAATTAGAAAAATCTTATGTAATTAGAC 

AAAAACCAAAAAAACAAAAACTTATTTTAGAAAGACTTGGATAATGATTGA7TATAAAAA 

AATGATTGACCATACTTTATTAAAACCAGAAGCTACATCTAAAGATATTTTAAAACTTAT 

TTCACAAGCTAAAGAACATGGATTTAGAGGAGTTTGCATTAACTCTTCTTGAGTTAAATT 

AGCAAAAGAAAAACTTGCAAATACAGATTTAGATATCGTTTCAGTAGTTGGCTTTCCATT 

AGGTGCATCAAACACACAAACCAAGGTTTTTGAAGCAAAATTAGCAGTTGAACNTGGAGC 

TACTGAAATAGATNATGGGTATAAATGGTGGGTAAATTTCAAA 

Sequence 362 

CCGCGGTGGCGGCCGCCCGGGCATGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCT 

TGGTAGGCCATTACCATACCAACTAACTAATGTTCCGCACCCCCATTTTTAAGTGAAGCT 

GTGAAGCTCCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTG 

TTTCCAATAG7TATCCCAGTCTTAAAGGTAGGTTAGGTACCT 

Sequence 363 

CACTACTATAGGGNTNATTGGAGCTCNCCGCGGTGGCGGCCGTGCTGTTGCTTGGCCGCG 
CGCCAGGC TGGC CAGAGGTGCTGTTCCACTGGGGCGTGGCCGTGATGGTGTGCGCCGATC 
GACGTTCTGTTTTAATGAAAGGATGGACATGCAACTGACACTCGCACAACTCTTGGCAAT 
CATGCCCAATGCCCGCTCCAAAGCGGGTATTTTTTTGCCTGCGCTAAACGTGGCCATGGC 
GGAATTCGGTATCAACACGTCGGCGCGCCAGGCCGCGTGGCTGGCCACCATCGGTGTCGA 
G TCCGG TAGCCTGCAGCGGGTAGAGGAAAACTTGAACTACCGCGCGGATCGCCTNCTCGT 
TATTTTCGGAAAATACTTCACGCCGGCGTTGGCCGCAGCTTATTGCCGGCAAGCCGGAAA 
TGATCGNCAACCGTGTTTACGCCAACCCGCATGGGGGAAACG 
. Sequence 364 

AGGTACTAAACTTTMTTAATGAGCTAACGTCATATTTTTTAAGTTTTTCAATTCCGTTT 

AAAAATCCTAATTCAAGTAAAAAGATTACTTTATGAACAACAGCACCTTGAGATTCGATT 

AAGTTA ATAATC GTTCATTGTTCCGCCAGTAGCTAAAACATCATCAATAATTGCTACTTT 

TTGGCCTTTTTTCMCATATTAGTTTGGATTTCTAGAGTTGATTTACCATATTCTAAATC 

ATACTCA AAACTMTAACGTCTCCTGGTMTTTTTTAGGTTTTCTTACCATAATAAAAGG 

1 1 1 1 I 1 1 AAAAAAGCTGCAGTAGGTGTCCCAAACAAGAAACCTCTTGCGTCTGGACCTAT 

AATMCATCTGCATCTTTAGCTMCTCAGCCATTTGGATTAATTGTGTAAATTTAAGCAC 

TTCTCCATTTGCTAAAAGTGGTGAAATATCTTTAAATACAATCCCCTTCAATTGGGGAAA 

TCTTTAAACATCTCNGGATGGTATTCTATTAAAATTGAATTGAA I I I I I I I GGC 

Sequence 365 

CCGCGGTGGCGGCCCGAGGTACCAAAATAAAGGGTATTTGCTACCTTTAATACTTGCCAG 

TTCAGGTTGGAGGCACAGGCAGCAGCAAGAATGGAAAGAAATGTTCTTACAACATTTTCA 

CAG GMAT GTCCCAGTTAATTTTGAATGAAATGCAAAAGCTGAATATTCCAGTTTATTCA 

ATGATTTTGTTGAATCTGAA I I I I I I I IGATTGATGGGGATTCATTACTTATCACATGTA 

TCTGTGAGATATCATTTAAGCCTGGGCAGAACCTCCATTTCTTCTATCTGGTTGAACGCT 

ATCTTGTGGATCTTATTAGCAAAGGAGGACAATTCACCATAGTTTTCTTCAAGGATGCCG 

AGTATGCNTATTCMCTTCCCTGGACTTCTTTCTTTGAGAACTGCTTTAATTCTTCATCT 

NCAGAAAGAATACCCCATTTGATGTTCGAACAACATTTTCGAGATGCTTATCAAAAAGAG 

TGGGGAAAGTTTCTTTGGAAGANGAGTTACCCCATATTTNCTGATTGTTGGCAGACGAAA 

NGCCTTGAACGATCTACAAAACCNCAGCTTTrTAAACTTTTTAAATCNTTCAATTCCTTG 

GGGCAAAGGGAANGNTNAACNTTTGGTACCTTGCCCGGG 
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Sequence 366 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACGCGGGGAAATGCAAAAAAATCAAATCAATTT 

MTAGMT ACAT CAGAGATGTTAMGATTTCCCMTTGMGGGATTGTATTTAAAGATAT 

TTCACCACTTTTAGCAAATGGAGAAGTGCTAAATTCACAATAATCAAATGGCTGAGTTAG 

CTMAGATGCAGATGTTATTATAGGTCCAGACCCMGAGGTTTCTTGTTTGGGACACCTA 

CTGCAGCTTTTTTAAAAAAACCTTTTATTATGGTAAGAAAACCTAA 

Sequence 367 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TMCTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAMTMCTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 368 

ACCGCGGTGGCGGCCGAGGTACGGCCACACTGGGACTGAGATACGGCCCAGACTCCTACG 

GGAGGCAGCAGTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGNGTG 

CAGGATGAAGTTNTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAANAAAAAAAAAAA 

AAAAAAAAGGTNCCTGCCCG 

Sequence 369 

GGCGGCCGAGGNACAATATAGNCATCGCNTTAAACNGCCNANTNTTAANCNCGCCAAACT 

TGGTTACTGAGAGCCCTAAGGAACTAAAACCGCCATAATGCCGGGCACAGCTTGAAAAGC 

AATTAGAGGAAGCAAGANNAGNNNTTCCTCCCTTCCAGNNCCTCAGCAGGCCTGGCTGAA 

GGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAATAAGAAGA 

ATGCCATCCCANGGAGCACACCAAAATTCNGGAACCACCNCTCCCGGANCAGGCNTCCAT 

TGNTCACGAAGCTCACGCNGGGCAGNCCGCAACTTTACTTTGNAGNAACCTCCCCACTTG 

GC CAAGG GAATTCNCCCCCGGGCCTGGAAGAAAAGGGNTCTCCCACCCGGAAAGGGGCGN 

ACC1TTTCCCMAACCAGCCTTTCCTTAMGCNAAAAGCAAACCCNCTCTTTTGGGTTTC 

NCAAAGGGGGCNGNACAAAAGGGAAGGGTTTTGGGGCNGGGGGGGGGAAACAAAANCCCC 

NCATTNGGAAGNTTGCCCCCGGCCGAGGGGAAGGGGAAAAGGTTGGNCCCCGGTTGGGGG 

GGG 

Sequence 370 

ATTGTTGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACANGGAACTGCCAAAGGCAACA 

GAAATTCTTTCTCCCTATGTCCCAGCCTACCCCCACTTTACCGAGGCCAACAGCCGCCTC 

AGAAACCAGATTCAGGAGCTAACATGCCCCAGGTCTCACGAGGATCAGAGACTCCAGAGG 

CCAGGGAAGGAGANNAAGGTAGNCAAGCGGGGGTGGTCTCAAATCTGGTTGNGCTCGAGC 

TATGCAAATGCCTCTCATGGAGATGCGAGGACCTATGTATTATGATGACCAGGGCCACAT 

CCGGAGGGGGCAACAGACTTTCATNTATCAAGCCCTT 

Sequence 371 

AGGTACTATTGACTAMGTCAGTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGC 
TTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCA 
GAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGAATATT 
TTGATAATAACACTMGATGGTATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 
Sequence 372 

CCGGGCAGGTACCGGGTCTAGGGAAGATGCAGAACACTTCAGCCTGGCAGAAGGCTCTAA 

AGATCGTCAACCTTTCCTCTGTTCATTTTGMATTTTTAAAAATATGCTAACTTAGCTGA 

AAATCTCATGGAAACCAGGCTCCTCATCAGACTTGAAAGTCAAACCGGTTTCTCAACAAC 

TTCTCTTTATGGTTCTGTATGGCTCCACAGAAAACCAGAAAAACATTTGGGGCAAGAAGC 

TATGACTCTGTGAGGCCACATGGGAGCAGGCAGTCAATTATTACACTAAGGAACACCCAG 

TTAGCATGAAGTATCCCATCACCTCGGTATTAAGCCCTGCATGCATTAGCTATTACCT 

Sequence 373 

AC I I I II I I I I I I I I I IGCTCAATAGAAGTATGGAATAATTCCAGGTAATTTAAAGCATA 
Mill CAATTGGTGTAAGCTGCTCCATGAAGTCAGCTAGCTCCTCTAATTGGGATGGTTC 
TTCATCACACGGCATGTTCTCAGAGTCTGATGACAGAGCATCAGTGTGTGGTCCCAGCAC 
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CCCCTCCTGTGCGGACTTCTGGGCATCCTCCTCCAGATACTCAATACTCTTGAGGGCCTG 

AGGAAAGTCTCTATGAAAGGTCTTGCAATTTTGGGTGCAATGGTTTCCGTGACAGAAGGT 

TCCTGAGAAAGCACCACAAACTCCTCAGCTTTGACCGGGAAGCCAGCATCACGGACGCGT 

GGGTCGAAGCTTGACCT 

Sequence 374 

CCGGGCAGGTACCGCTACTGAAATTATTAAACATACACTACAGATCAATTATATAANTAT 

GTTAATATCTTTAGAAATCAAGAGTTGCAGCATAAGAGAAAGGGATACAAAAACAAAACA 

AGCAAAGAAGTTACATAAAAACGTAACGTTGTATTGGAAAAACCAGTATGAACTTATGAT 

TTAGTTTTCTTTCTAAAAACGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 375 

NGCGNTGCGCTCACCTGCCCNCTTTCCANTCGAGGNAAACCTGGTCGTGCNAGGGTGCNA 

NTAATGA ATTC GNCCAAACNCCNCCGGNNGAGNAGGCGGTTTTGCGTNATTGGGGCCGCT 

ATTCCNCTTTTCTCGCTCACCTGACTTCGCTGCCGCTCGGTCGNTCGGCTTGCCGGCCGA 

AGCCGGGTAATTCAAGCCTCCACTCAAAAAGGGCCGGGTAAATTACCGGGTTTNTTCCAC 

CAANGAAATTTCAAGGGGGGGATTAAACCNCNAAGGGAAAAANGAAACATTGTNNGANNC 

AAANAAAGGGCCCNNNCAAAAANNGGGCCCANGNNGAACNCNCCGTAAAAAAAAAG 

Sequence 376 

CCGGGCAGGTACACTCCAGCTCCTCTATCCTTTGTTCCAGGTGAGCCTTNTCATTAAATG 

CTGTCTCTTGAGCAATCTTTAGCCTCTCCCTTAGACTATCTCGTTCTGTGGTCATTCTTC 

GTAAATCAGTAAAGGCTACATCTCTTTCAGTCTCCACTCGCCGGAGAATAGCATGTGCCG 

TTGTTGATTTAGGACTCTTACAGCTTTTCATCATTTCTCGTCGAAGTCGGGTAATTTCTT 

CCTGTGCCTGTTCATAAAGAAGGAAGATCTTGTCTCTCTCAGATTTAAGAACCTTGACAT 

TACCCTGAATTTCTGCCAAATCGGACGCGTGGGTCGAAGCTTGTACACCTnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAAAATNGCNCNC 

TGGGGCNAMTCNNGGGMAATANGCTGTTTTCNTGGGAAAAAAATTGTTTTNCCCCGCA 

ANAAAAATCCCCCACAAAAATTNTANNGNNNGGGNGAAAAAAAAAAGNAAAAAANNGNGG 

GGGGGGGCCCCCAANNGGGGGGNNGCCCNCNCCCCTNATANTTNATGGNGGGGGGGGGCC 

CTCNANNNGGCCCCTTTTTNNNNAANNGGGAAAAACCTTNGGCGGGNCCCNTTTNTATNA 

AN AAAAA ACC CNGCC CCCCCCCCNNGGGGGGGGNGGGGNGGTTTNGNTTATTGGGGGGCC 

CNTTTTTCCNNTTTTCCNAANAAAAANAAAANANNCGGNGNCCNGGAGGGGGAAANGNGG 

GGGGGGGG 

Sequence 377 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGTGATG 

CTGGCTTCCCGGTCAAAGCTGAGGAGTTTGTGGTGCTTTCTCAGGAACCTTCTGTCACGG 

AAACCATTGCACCCAAAATTGCAAGACCTTTCATAGAGACTTTCCTCAGGCCCTCAAGAG 

TATTGAGTATCTGGAGGAGGATGCCCAGAAGTCCGCACAGGAGGGGGTGCTGGGACCACA 

CACTGATGCTCTGTCATCAGACTCTGAGAAQATGCCGTGTGATGAAGAACCATCCCAATT 

AGAGGAGCTAGCTGACTTCATGGAGCAGCTTACACCAATTGAAAAATATGCTTTAAATTA 

CCTGGAAATATTNCATACTTTNTATTTGNGGCNAAAAAAAA 

Sequence 378 

GCCGGCCCGAGGACTCCAGCCTGGGCAACAGAGCAAGACTCCATCTCAAAAAAAAAAAAA 

AAAAAAAMGTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGGGGGCCCCGGGTACCCAAGCTTTTG 

TTCCCTTTAGTGGAGGGGGTTAATTTGCCGCCGCNTTGGGCGTAAATCAATGGGTCATTA 

GCTGTTTCCTGNGNTGGAAAAATTGTTATCCCGCTCACCAAATTCCACACCAAACCATTA 

CCGGAAGCCCCGGGNGNAGCCATTAAAAAAGGTGTNANANAGCCTTGGGGGGGGTGCCCT 

ANATN GGAA GGTGGAGCCTAAACCTCACCATTNAATTTGGCCGTTGGCGCCTCACCTTGC 

CCCGCTTTTCCAAGGTCCGGGGGGAAAAACCTGGTCCGNGGCCAAGCTTGCAATTAAATT 

GGAAATTCGGGCCCA 

Sequence 379 




WO 01/070979 



PCIYUS01/09126 



TABLE 1 

67/467 



GAATTGNAGCTCCCCGCGGTGGCGGCCNNCCGGNCAGGTGGAAAGGTGGGTGGGGAGAGG 

GAGGCTTATTTGTTGCTGCAGTGTAACTAAGTGAAACCTAATTCATATGACTCAAACTAA 

GGTATATTTGGTTAGATCTAGGTGAGTTCTACTTTAGAGGAAATCCTGGTAACTGTTGTT 

TGTTTGTMGTTATAGCTGTMTTAATTTTCCCTGTATTCAAAGCCCCCAAACCCTGCAT 

TCAGATACTATGCATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTA 

AGCTATTATTCAAGGAGTATCTACCATCATAAAGGTGGTTTAAGTCATATAGGATAATAT 

CMTCAATMTACAGGGAGATGGCAAAMTTTTTTGGGNAAANCCCAATATTANCTTGGG 

TTTNATGACCCCCNMTCTCACACTTTGGGGNCNTATGGGAAAGGCTTTTTTTAAGACCC 

GGGAGTTCAAGACCNGCCCTGGGCMCATTAMAAACCTCCTTTNNNCCAAAANCTTTAA 

AAAA 

Sequence 380 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCAC 

GCGTCCGCTTCAGMTATCCAATTCATGTGAACTACAGGAMTTATAGTTTAGATATTTT 

TAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCTTTA 

CCATATCATTAATAAAAGTCMATTTTAAATTTGTGCCCMTTTGGCTGGGTGTGGTGGC 

TCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 381 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCC 

GCATTTATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAA 

TTGATCTTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGT 

ATTA AATCCTC CAGAGAAGCCTGTAGTGTGGGATGCAAACTATTTTAAGTGTGACCATGA 

GGTGTTTTTTTGTGGACCATTTTAAAGCCMTGATAGGTTCTAAAGCAATCTCAACCTGA 

GTTAGGTAGAATGGGTTGGTTATCTGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTG 

GATTTCMTTCGGCACTTTATGTATTAGCTAAAAATTTCATGACCAGATCTTTTGAAGTA 

TACAMGTMATCTTCAAGGTGGATAGTTTATCCAAGTGTAAAATGTGTTGCACTAGGTC 

AGCTTGGMTTTTGAGATGACTTTTGGCATCATTGCATACATCTGGTTTGTGTACCTGCC 

CGGGCCGGCCGCTCTAGAACNGTGGATCCCC 

Sequence 382 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTGACTAAAGTCA 

GTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGCTTGGTTATACTAGAGAACAAA 

TNGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCAGAGTATTGATATCTGGGAGTC 

CCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGAATATTTTGATAATAACACTAAGATGG 

TATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 

Sequence 383 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATCTGCATATACACCAT 

TAATTTTACATCGTTGAGGTAGCCMAACTGCTCGTAAGTGGGCTTTTATTAAATAATAT 

MTGTTCTTAATAGAGGAAAMGGAATTGMTACATTTTTAAAAACAAAATAACAAAACC 

MTC CATTGTC CCACAAA AGAAAATCAGTGGAGACAMAGCAGTTTAATTTGCTGGATTC 

TTTTGTGGCTTATTTTTTGAGTATTATTTACAAMTGTTAGACTMTl^^ 

TAATAATMGCAACATACAACTCCAAGAATAATATAATAAATAATAAACTGCGGACGCGT 

GGTTCGAAGCTTGCTCGNNGGGGGCCGGGNCGCTTCNAGGCCCNCCCGGGCAGGTACCCA 

GTNATCACATAAATTCTGCMTCATNTGGNTATTNAGCTTNACNTGNTTTTTTTATTTGN 

NGAANTTGTTGTTGTATTCAG 

Sequence 384 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACC 

CACG CGTCCGCTTCAGAATATCCAATTCATGTGAACTACAGGAAATTATAGTTTAGATAT 

TTTTAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCT 

TTACCATATCATTMTAMAGTCAMTTTTAMTTTGTGCCCAATTTGGCTGGGTGTGGT 

GGCTCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 385 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC' I I I I t I I I I I I I I I I I I I 
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TTAAGAAATAGGGTCTCACTCTGTCCCCCAGGCTGGAGCCATTATAGCTCACTACAGCTT 

CTGACTCCTAGGCTCAAGGGATCCTGCCACCTCAGCCTCCCTGGTAGCTGGGACTATAGG 

CAGGAGATCGCTTGAACCGGGAGGCGGAGGTTGCTGTAAGCTGAGATCGCGCCATTGCTT 

TCCAGCCTGGGTGCCAGAGCAAAACTCTGTCTCAAAAAAAAAAAAAATAATAATAATAAA 

TAAATAAAAAGGCAAGGAATATAGGGAAAAGTCAAAAGAGATGGACTGTGAGAAGACTGG 

GAAAGCCAGAAGAATGGNGGAAAATGTAGCATGGAGTAAGACAATAAAAATATAAGAGGA 

CTCATTTCGGACGCGTGGGTCGACTCACCTCGGCCGCTCTAGAACTAGTG 

Sequence 386 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGCTTCGACC 

CACGCGTCCGGGAMTTTTAATTAAMATAGGTGMCATTTTAAATGACCTAATACATAT 

TTAGTCCACATTGAMCTTTGGCATTTTGTCATTGCCATTAAMTTTTGATGGCATTAAA 

AT1TGATGCCATTAAAATTTTGATCAGTAGGTAGCATTTTTTTCTTAGCTACM 

I 1 1 I IMTTATMGTATTAAAAATTCATGAAGATGATTCTTnTGTMAACAGTTTTGCA 

TAAAAAGTAAGTCTCATTTTAAAGCAACTACCAACTTACTGGCCACCT 

Sequence 387 

AGGTCTTCGACCCACGCGTCCGATGGTTTTTGCAAAAATTGAAAATGCATCGATATTACA 
GTTMTTTTTTCAGTGTGTATGTGGTATTAGGCTTAGM 

GAGTATGTCC ACTCTGGTTTACTC CTTTGTAAGTATTAATACCTGATAATTTACATCCTA 
CAGCCCTGCCI I I I I I I I I I I I I I I CAAGTTTGTCCCAGCAAGTCTTGGCCCTTTGCATT 
TTCTTAATACATTTTAGTACCTGCCCG 
Sequence 388 

CCGCGGTGGCGGCCGAGGTACAAAGAACAAAGGGAAGCTAAGGAAGAAAAGATAGTCAAT 

AAAAGATGTCTCATCTGGGCTTAGTGGCTCATGCCTGTAATCCCAACACTTTGGGAGGCT 

GAGGCTCGAGGACTGCTTGAGTCCAGGAATTTGGGCAAGTAGGAAATTACTGAACAGCTG 

CTATCACAGACAAATGCCTAACATTGTGAAGTGCTACACAGGGGAAGGAGACCCACGCTA 

AGAGGAGAGCATGCACCCAGACACAGAACTCAGAGGACACAGTTCAAAACACACATACAA 

GAGGCTTAGGCACCTGTGGGCGTGTGTGTGCTCACAGCCAGCAAAATGAAAAAAAATCCC 

AGCTCTGAAGGAGAGGCAAGTGCATGGCTTCCGTACCTGCCCG 

Sequence 389 

ACT TAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGA 

TGGtTTTTGCAAAAATTGAAAATGCATCGATATTACAGTTMTTTT^ 

GGTATTAGGCTTAGMCTATAACACAGGAAGTTTTTAGAGTATGTCCACTCTGGTTTACT 

CCTTTGTAAGTATTAATACCTGATAATTTACATCCTACAGCCCTGCCI I I I 1 I I II I I I I 

TCAAGTTTGTCCCAGCANGTCTTGGCCCTTTGCATTTTCTTMTACATTTTAGTACCTGC 

CCGGGCGGCCGGCCGCCCGGGCAGGTACNACTACCTCTTTAAAGTTGTCCTTATTGGAGA 

TTCTGGTGTTGGAAANAGNAATCTCCTGTCTCGATTTACTANGAAATGAGTTTAATCTGG 

AAAGCAAGAGCACCATTGGAGTAGAGTTTGCAACANNGANGCATCCAGGTTGATGGAAAA 

Sequence 390 

TCCCCGNGGTGGCGGCCGAGGTACTATTGACTAAAGTCAAGTTGGGGGAGAGAGAGGCGG 

AAGTATATTACTTTTNTGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAAC 

ATTNNATCATTGGTTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTNTGAGGAGG 

TTCATGAGTTCAGMTATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGG 

TAGATATTTGCACTGGT 

Sequence 391 

AGGG CGAATTGGAGCTNNCCGCGGTGGCGGCCGAGGTCTTCNACCCACGCGTCCGATGGT 
TTTTGCAAAAATTGAMATGCATCGATATTACAGTTAATTTTTTCAGTGTGTATGTGGTA 
TTAGGCTTAGMCTATMCACAGGMGTTTTTAGAGTATGTCCACTCTGGTTTACTCCTT 
TGTAAGTATTAATACCTGATAATTTACATCCTACAGCCCTGCC I I I I I I I I I I I I I I I CA 
AGTTTGTCCCANCAAGTCTTGGCCCTTTGCATTTTCTTMTACATTTTAGTACCTGCCCG 
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Sequence 392 

CCGCGGTGGCGGCCGCAC I I I I I I I I I I I I I I I I I I ACCTGAAAATGCTTATTCTAGCTT 
CACATTTGATTGTTTGGCTAAGAAGAAMTTATTTATTAGACTTMTTTTCCTCACGAGT 
TTAMGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCATT 
CTGATTCTTGCMCTTACMGTAGTC 

TTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 
Sequence 393 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCMACTATTTTAAGTGTGACCATGAGGTGTTTTTTTGTGGACCATTT^ 

AAGCCAATGATAG 

Sequence 394 

GGGC GAATT GGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCTACCCACGCGTCC 

GCACATTTTGATGGTCAGTCAATAACTTAAGCAGTTACCAAAATACTAGGTATCCAAGGA 

GCGAGAGGTGGGCGAGCATAAGAAACACATTTCTCATGGCACAGCTCTGCCAAAGCCCTG 

CAGAATCATTTACACATAGGTCTTTGGTTAGTAGCCCCTGGCACAGAATTCTGATCTTAA 

ACAAATATTGTCTATAATCAAGTAGAGCAATGCMTTAAMAAAAMAGCACAGGTTTTG 

GGGCCATGCTGAAATCCCAGCCTTGCTATTTGCTGGCTGTGTGACCGTGGTTCCTTGGTC 

TCATTATGCTTTGGTTCCCGTATCTATAAAACGGACGTAATAATGTCTCCCTCTCATTAT 

TGTGAAGTCGAAATGATGTCTGTAAAGTGCCCAACACAGTACTAAAGGGCTATT 

Sequence 395 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTTTAAGTAACAT 

T CAGAT TTGTGTGTGTGGAGAGGTTGTAGGGAACAGAATTGTAGGAAGGTGCTCACACCT 

G I I I I I GTTTGTTTGTTTTATGTATATATGGTGGGTAGAAAATAAGGATTAATGAATGCA 

GTMGGTATTTGAGCACTCTTGTTTATCTTGTGTAGGTGCCMCCAAATATTTTTATAGA 

GATGTGGTTAAGCCTCTTGGCATGTTCAACTGTGTACCT 

Sequence 396 

CCGGGCAGGTCTCTTGTCTAGTATACTCAAGGCAGCCTAGTAAATTATTATTTATCTATA 
CAATACTGGAAAAACTTGTAGACAAAAACATGACTTGAATTGCTAAAAAAAAAAAAAAAA 
NGANGGAGAATGAAAACTTCCGGACGCNTGGGTCGAAGCTTGACCT 
Sequence 397 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCGCGCGTCCGCATTT 

ATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATC 

TTMCCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAA 

TCCTC CAGAGAAGCCTGTAGTGTGGGATGCAMCTATTTTAAGTGTGACCATGAGGTGTT . 

TTTTTGTGGACCATTTTAAAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGG 

TAGMTGGGTTGGTTATCTGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTC 

MTTCGGCACTTTATGTATTAGCTAAAAATTTCATGACCAGATCTTTTGAAGTATACAAA 

GTMATCTTCAAGGTGATAGTTTATCCAAGTGTAAATGTGTTGCACTAGGTCAGCTTGGA 

ATTTTGAGAT 

Sequence 398 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCGCCATGTTGGCTGGGCTGGT 

CTCGAAC TCCTG ACCTCAAGTGATCTGTCCTGGCCTCCCAAAGTGTTGGGATTACAGGCT 

ACTAAAGCTTTTTATTTATTTCTGTGGATTTC 

A TATG MGGATTCCTTTTGGTATTTTTTGMGAACTTCTTTATCAGMTTMTTA 

CTTTTGAAAATTTTTAACAATCTAGAAACGTCTTTTT^ 

GGATGTAAAATTTTTGGTTAATTTTGTTTTTCTT^ 

CTTCTGGCTTTCAGTACCTGCCCG 

Sequence 399 

CCGCGGTGGCGGCCGAGGTAGCTTGAGTCGACCCACGCGTCCGTTCAGATCCGTTTCAGA 
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AACGTGAGTCTCTAGCTCAGGAGATTTCCACAACTGTCCTTAGTAACCTGATCTTATTCT 

CATGTTTAACCTTGGCAGTGGGAAGTTCTTCCTGGTATCCTGCCTAATTTACTGGAGTTG 

GCATTAATGCCATTTCCCCCTAAGGCGTGGCTCTTGGACCAGTATCACCTGAGAATTTGA 

TAGACATAGACCCAGAGTTACTGAGGCAGGTGCTCTGTTTTGGGGACCAGCAATCGGTGC 

TTTAGCMGTTCTTTGGGTGATAGGGTTTGGAAACTACTGCTCTAAAGCATCATCTGTTT 

TGACTTTGCCATGCACAATCTGAACTCACTCCCGTGAGGCCCTGCTCCTGATACTTTAAA 

TCGTCCTGTCTCTTTTTCTGCCTCTCTGTGGAG 

Sequence 400 

CCGGGCAGGT ACAGGC ACCTATAGAATTTMAGGGGAGATTTCTTTATTTTGTATTCAAT 

GTATTAATMGATTTTTAAAACATATTTTGGAGAAATTGCTAATTAGTGTATAATCCTGA 

TGCCAATTCTAAAAAACC I I I I 1 1 I 1 1 I I I GTAGAGACAGGGTCTTATTCTGTCACCCGG 

GCTGGAGTGCTCTGGTATGATCCTAGTTCACTGCAACCTCAAATACCTGGTCTCAAGCAA 

TCCTCCCACCTC AGCCT CCCCAGTAGCTGTCTCTATAAGCATGCACCACCACACCTGGCT 

MCCTTCTTATTATTTTTGGTAGAGACAGTCTCACTATGTTGCCCAGGCTGGTCTTGAAC 

TCCTAACCTCAAGCAAACATCCCTCCTCGTGCTCCCAAAATGCTNGGATTACCAGCATTA 

AGCCTTACAAGCATAAGCTACCATGGACTGGCTTTCNAAAAAATATTTGGTTTAAAATTC 

Sequence 401 

CCGCGGTGGCGGCCGCCCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGT 

TGCTGCAGTGTAACTAAGTGAAACCTAATNNATATGACTCAAACTAAGGTATATTTGGTT 

AGATCTAGGTGAGTTCTACTTTAGAGGAMTCCTGGNMCTGTTGTTTGTTTGTAAGTTA 

TAGCTGTAATTAATTTTCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGC 

ATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAA 

GGAGTATCTACCA TCATAA AGGTGGTTTAGTCATATAGATAATATCAATCAATAATACAG 

GAGATGGCAAAAATTTTTTGTGAAGAGCCAGATAGTANCTGAGTATGATGACCCCTAATC 

TCAGCACTTTGGGAGGCTGATGGGAGAGGGTCATTTAAGACCAGGAGTTCAAAGACCAGC 

CTGGGCAACATTAAAAAACTCCATTTCTACCAAAAACTTTAAAAAAAATTAGC 

Sequence 402 

GCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGGGAGGACCTAGGCAACGGCC 

TGAGACTCCGAGACTCTATGTTGAAGATGCCTGGACTAACCTACTGAAGATACGTGGTTT 

TACCAACAGCCAGCACCAATAGGAAGATATGAATGAAGCCATCTGAGACCAGCCATCTGG 

CAGCCAAACTGCCAACTGACTGCAAATGCATGAATGATCCCACTGACACCACGTAGAGCA 

CAAATGAGT TGCCT CCACTGAGCCCAGCCCAAATTGTTATCCTATAAAATCATAAAAACA 

TAAACAGTTGTTTTAAGTCAAAAAAAAAAAAAAAAAAAGTGCGACCTGCCCG 

Sequence 403 

TACTATAGGGCGAATNGNAGCTNCCCGCGGTGGCGGNCGAGGTATTCAACAAGGGCCCTG 

AGAGAGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAGCAGAGACAGGAGTCAGCACGT 

GTGAGGGCAGCAGGGAAGTCTTCCTGGAGGAGTGAGACCTGGCGATGAGGAGGCACGGCA 

GGGAGGTGGAACAGGCAGGAGAGACTCTTCAGGAATTGAGGAGATAGAATAGAGGACACT 

AAAGCCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGATCATCGCTTGAGGCTAGGAGTT 

TAAAAGCAGCCTGGGCAACATAGCGAGACCCCATCTCTAAACACAAAAAATAAAAACCTG 
CCCG 

Sequence 404 

CCGCGGTGGCGGCCGAGGTCAAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGT 

CAAAGCTGAGGAGTTTGTGGTGCTTTCTCAGGNACCTTCTGTCACGGAAACCATTGCACC 

CAAAATTGCAAGACCTTTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTGAGTATCTG 

GAGGAGGATGCCCAGAAGTCCGCACAGGAGGGGGTGCTGGGACCACACACTGATGCTCTG 

TCATCAGACTCTGAGAACATGCCGTGTGATGAAGAACCATCCCAATTAGAGGAGCTAGCT 

GACTTCATGGAGCAGCTTACACCAATTGAAAAATATGCTTTAAATTACCTGGAATTATTC 

CATACTTCTATTGAGCAAAAAAAAAAAAAAAAAGTGCGGCCGCTCTAGAACTAGTG 
Sequence 405 
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TTAGG GCGA ATTGGAGCTCCCCGCGGTG6CGGCCGAGGTCAAGCTTCGACCCACGCGTCC 
GGAAGTTTT CATTCT CCCTC I I I I 1 1 I I I I 1 1 I I I I I I I I AGCAATTCAAGNCATGTTTT 
TGTCTACAAGTTTTTCCAGTATTGTATAGATAAATAATAATTTACTAGGCTGCCTTGAGT 
ATACTAGACAAGAGACCTGCCCG 
Sequence 406 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACI I 1 1 I I I I 1 1 I 1 1 I I I AC 

CTGAAAA TGCTT ATTCTAGCTTCACATTTGATTGTTTGGCTAAGAAGAAAATTATTTATT 

AGACTTMTTTTCCTCACGAGTTTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAA 

AGCTGAGMGTCCMTG CCATTCT GATTCTTGCAACTTACMGTAGTCTTTTTTTGTCTA 

GACGCTTTCAGGACCTTCTTTTTTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATC 

TTTTGGGTACCT 

Sequence 407 

CCGCGGTGG CGGCC GC AC I I I I I I I I I I 1 1 1 I I I 1 1 1 CTCTTATATTGAAGTAAAATTTA 
AAATTTMTACTTTTTATTTTTTMAAGCATGTATGGCATCATTTCAGTCTTATTAAATT 
CTCTCTGCATCCATTCACCCATCCTTCTTTTTGTGTGTGTGTGTAGTGGTCTCTGTGAGA 
' GGGT TCATTMT GTCM TCCTGATCATTTCTTCCTCAAGAGATGTCAGTAGATTTGTTTT 
TTTTGCTTTGGACTTTTATGAATTGATTGAATTTTTATGCCAATTATTTTTAAAGTATTA 
CATAGAAGMCAAATGGACAGAAAAATTTAAATGCAATCAAATCTTGTTGATTTTGAAGT 
ATAGGAAATAATC I I I I I 1 1 I ATTATACTTTAAGTTTTAGGGTACCTGCCCGGGCGGCCG 
CTCTAGAACTAG 
Sequence 408 

CCGCGG TGGC GGCCGAGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

GATCAGTGGTTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGAATATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGGTAGA 

TATTTGCACTGGT 

Sequence 409 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCAC 

GCGTCCGCTTCAGAATATCCAATTCATGTGMCTACAGGAAATTATAGTTTAGATATTTT 

TAAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCTTTA 

CCATATCATTMTAAMGTCAMTTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGC 

TCATTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 410 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGACCAGTGCAAATATCTACCCAGT 

TAGAAGAGTAAATACCATCTTAGTGTTATTATCAAAATATTCTGAACTCATGAACCTCCT 

CAGACTGTTGCTGGGACTCCCAGATATCAATACTCTGAGAACCACTGATCTAATGTTTCT 

TTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAAGTAATATACTTCCGCCT 

CTCTCTCCCCCAACTGACTTTAGTCAATAGTACCT 

Sequence 41 1 

ACTT AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACGCGTCCGA 

TCACi I I I 1 1 CATTGATACCTTATTAGATAAAACATTAGCCCCCTAGAGTGNNNTGTGAA 

GGAAATATGCCTAATAAGAGATGATAGTTTTAGCAATAAATGAGCATTAGAACTATTATT 

TATTAATGAAATGAACTGGTGGTCTGAAAGTGATGATAAACAGACAACTGTGGAAAATGA 

ATTATTAAAATTCCATGGAATTCC7TTTGAAGTTTATGAAGTACCTGCCCG 

Sequence 412 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGC 

ATTTATTAAGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGG 

Sequence 413 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAC7TGTGATGCTT 

TTGGCAGGAATTACAGAACAACCAATGCCATTCAAGTTGTGGAGATTATACTNGCAGGTG 

AACTCGTAAAGAGAAGATTCTGGAATGCCTATATCTGAAAGCTTGAGTCGACACCTN 
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Sequence 41 4 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGGAATGAGTGAGTGCAGAACTGGC 

AACCAGAAGACAGGAACAAGGCCTGGGAATGGAGCGGAAAGGTAGCTGCTATATATAGTT 

CCTTCAGCCAGTAACGATTAGAGCCAATAGCCATCTGGATGATGAATGGCTCCTAATTGC 

CTTAAATTACGGCAGTTAGCTMGGGTTTCTGTTGCTACATGGGTTACCGTAGGCCGCTG 

CACCCTGCATAACTGTCCTCAGGCCTGCGTCCCCTGAGTCTCAGCACTTGGGCCTCCACC 

TGCCCG 

Sequence 415 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAAAGTGCTGGAA 
TCACAGGAATGAGCCACCACACCCAGCCAAATTGGGCACAAATTTAAAATTTGACTTTTA 
TTAATGATATGGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAG 
GCAAATCATTTAAAAATATCTAAACTATAATTTCCTGTAGTTCACATGAATTGGATATTC 

TGAAGCGGCCCCNTGGGTCGACTTTGTAACCTGCCCGGGCGGCCGNTCTAGAACTAGTGG 

GATCCC 

Sequence 416 

TATGGCGAATTGGAGCTCCCCGCGGTGGCGGGNCGAGGTNAAGCTTCGACCCACGCGTCC 
GATTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTTTCTAATGTCNTATT 
CTTCCAAAAAAAAAAAAAAAAAAGT 
Sequence 417 

CCGCGGTGGCGGCCGAGGTACTCTTGATGTCATAAGATTAGAAAATGTGGTTAATTGTCA 
TCAACCCATTAAGTTCTTAAATGTCATTGAATGGAGTCCTTGTCATGTTACAGAGGAGCG 
TAAAATTGTGGTTAMCATTTTTTrMAGATTACATGGTAGAGCCACAGTTTGTTATGCA 
G AAG G AAAAT7TAG CAAATATTATTTTGCTTAATAG CC7TTAAAAAATCGTATAAATTTG 

ATTTGTAGTrTTATCCCCAGAGTCATTAGATTTTTCCAAAAAAAAAAAAAAAAAAAGGT 
Sequence 418 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCT 

TAA CTTAG ATTGTTACTTATGTTTCTAAATCTGNGGAAGCACATTTCCTTTTNTTNTNNT 

TTTCTTTTACTGTTMTATCCTTATTCTCTATTTTACCAGTGGAGAATGNTTAGTATTAA 

TTTCCA7TTANCTCANGATTCAAGAAATGCAAAGTGCTATTTTTATCAAATTTCTGAAAG 

CCTACTGTCTTCTGNTTTGGAAGTCCCACAACAGCTCTTTAATTTCCTTAAGCCCCACTT 

TCCTCATCAGCAAG7TGGTGTGGCAATGGATCATAATAGG7TGCTGGGAGGATGAAGTGA 

GCGGACCGCGTGGGTCGAAGCTTGTACCTN 

Sequence 419 

CCGCGGTGGCGGCCGCACTTTTTTTGTATTACTTCMCTTTTAAAAATTCTAAAGAAAAC 

CATCATCTCAGACCAGCATTTCCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 420 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGTCAAACATCT 

CCCTCGTCCGGATCCTTCTAACGCAGGAGTCTCAGACGCAAATGCCGGCAAGGGCCAGGC 

AGGTGATGTAAGATGCGTGGAGCAGATGCCAAGCCACAGGGAGTGGTGGAGACTGGGGTG 

AACTGGAAAGCACCT 

Sequence 421 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGT 

CATCTCAAAATTCCAAGCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCT 

TGAAGATTTACTTTGTATACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAA 

AGTGCCGAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACC 

AACCCATTCTACCTAACTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATG 

GTCCACAAAAAAACACCTNATGGTCACACTTAAAATAGTTTGCATCCCACACTACAGGCT 

TCTCTGGAGGGATTTAATACTTTGG 

Sequence 422 

GGTGGCGGCCGCCCGGGCAGGTGTCAAACATCTCCCTCGTCCGGATCCTTCTAACGCAGG 
AGTCTCAGACGCAAATGCCGGCAAGGGCCAGGCAGGTGATGTAAGATGCGTGGAGCAGAT 
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GCCAAGCCACAGGGAGTGGTGGAGACTGGGGTGAACTGGAAAGCACCT 
Sequence 423 

TNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAATATAC 

AGAGGTATAATCTGTAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGAAAAGGAGT 

AGAATGCTTGTATGTGACTAAAATTATGTTGGTATCAGTTTAAAATATATTATTATAACT 

TTAGAATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAAATATCTGT 

AGGATACACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAA 

GCTATCAAAATAGAAAACAATAATGGAGAAAATAAGACACCAAAAGCTATAAGACTCACA 

GAAAATAAATAATAAAATGGCAAAAAGAAGCGGACGCGTGGGTCGAAGACCT 

Sequence 424 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAAGCTTCGAC 

CCACGCGTCCGATTTGGCAGAAATTCAGGGTAATGTCMGGTTCTTAAATCTGAGAGAGA 

CAAGATCTTCCTTCTTTATGAACAGGCACAGGAAGAAATTACCCGACTTCGACGAGAAAT 

GATGAAAAGCTGTAAGAGTCCTAAATCAACAACGGCACATGCTATTCTCCGGCGAGTGGA 

GACTGAAAGAGATGTANCCTTTACTGATTTACGAAGAATGACCACAGAACGAGATAGTCT 

AAGGGAGAGGCTAAAGATTGCTCAAGAGACAGCATTTAATGAGAAGGCTCACCTGGAACA 

AAGGATAGAGGAGCTGGANGNCCNTCCCGGGGGCCGGGGNCNGCCCGCCCCNNGCAGGGT 

CANATGATTGCAGAATTTATGTGATTCCTGGGGT 

Sequence 425 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTACTGAGCCACTTA 

CAATTATT TCTG AAAAATCTCAGAGAACTGAGGATAGATCAGAAAATTTAAAGAAAGCAA 

ATACCAMTTTTCAAACCAGGATAGGAAGTGAATACCTTGTAAATACACTTTGTTAAGTG 

ATGATMTTCTGAGTAAAAATTTAGMGATTTTGAGAAAAGCATTTGAACTTCTAGGGGC 

CAATAAAATACCATGCAGAAGAATGTTTAAAAAGTCATGCCAAATTTGAATCCATTTGAT 

CCTCAACCTCATCAGATGTTATATGCCAAACTACTTATTTTGGCTTAGATAATAATCATA 

TAGAATGAAACTTTCCACAAATAGACTGTGGTCAGTGGCTG 

Sequence 426 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACCGTCCGGCAATGATGAGCA 

AAAACMGTTTGGTCCCCCTGTTATAGNGCCTGGTAMGGTTTTTGTTGTTGTTTTGCAG 

GGGTGGGGGAACCAGGAAATCAGATCATCACAACAATATATACTTATCTGTAACTATGGT 

AACTGCTACAGCAAAGGGGCGTATCATACTATTAGCATACTAAGTTTCACTTAAAGAGGT 

CGGA 

Sequence 427 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGTCAAACATCTCCCTC 

GTCCGGATCCTTCTAACGCAGGAGTCTCAGACGCAAATGCCGGCAAGGGCCAGGNAGGTG 

ATGTAAGATGCGTGGAGCAGATGCCAAGCCACAGGGAGTGGTGGAGACTGGGGTGAACTG 
GAAAGCACCT 

Sequence 428 

TANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCAC 

GCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTA7TACATGTTGGGAGCACAAAGA 

TGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCA 

GGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTGGAGGAATGGAAAAGGCCCTGA 

AATGCAGGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCT 

TTACAGCAGCCTCNAAGACTGATCAGGTTACTATACCCTCTCTTNTGTCCACNGTGCATT 
TNAA 

Sequence 429 

CCGGGCAGGTCAAGCTTCGACCCACCGTCCGGCAATGATGAGCAAAAACAAGTTTGGTCC 

CCCTGTTATAGAGNCTGGTAAAGGTTTTTGTTGTTGTTTTGCAGGGGTGGGGGAACCAGG 

AAATCAGATCATCACAACAATATATACTTATCTGTAACTATGGTAACTGCTACAGCAAAG 

GGGCGTATCATACTATTAGCATACTAAGTTTCACTTAAAGAGGTCGGA 

Sequence 430 
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CCGGGCAGGTACACTCCAGCTCCTCTATCCCTTGTTCCAGGTGAGCCTTCTCATTAAATG 

CTGTGTCTTGAGCNATCTTTAGCCTCTCCCTTAGACTATCTCGTTCTGTGGTCATTCTTC 

GTAAATCAGTAAAGGCTACATCTCTTTCAGTCTCCACTCGCCGGAGAATAGCATGTGCCG 

TTGTTGATTTAGGACTCTTACAGCTTTTCATCATTTCTCGTCGMGTCGGGTMTTTCT^ 

CCTGTGGCTGTTCATAMGMGGMGATCTTGTCTCTCTCAGATTTAAGAACCTTGACAT 

TACCCTGAATTTCTGCCAAATCGGACGCGTAGGTCGAAGCTTGTACACCTCGGCCGCTCT 

AGAACTAGTGGGATCCCCCGG 

Sequence 431 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACGATTTAAAATCTC 

CTCCTCCTACAGCGGTCGAGTATTGAAGCAGGTCTTTGAGGATGGGCNGGAATTAGAGTC 

ACCAAAGGAGGAATACCCTCACAGTTTTCTGCAAGAGTCTCTTGAAACAATGGATGGTGT 

TTATGGGTCTGGGGAAGACCCCNCGCCCCAAATGTTGCTCCCCT 

Sequence 432 

CTTAGGGCGAA TTGGA GCTCCCCGCGGTGGCGGCCGAGGTACACACAAACACACACACAA 

AGTTTAATATACTTTTTAAAAAATnTATTGTATTGTTTTCTTGAAATA 

TGTTGCCTAGGCTGGTCTTGAACTCCTGGGATTAAGCAATCCTCCCAACTAAGCCTTCCA 

AAATGCTGGCATTACAGGTGTGAGCTACCACAATCAGTCTCTTAGATTTTGTTTTTTAAG 

MCA ATTCGAAGTTTACTGCAAAATTGTGAAGAACGAACAGACTGTTCCCACATATCCCT 

TTTTCTTTACACACCGGACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 433 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGGCTTGAGGTGGG 

T7TAGGAMCATTTGGTATCTNTGGCAGGGACAGATGTTGACCTGGCCGGTCGGCAGCTT 

TTACAAACCTAAGGACT TCAGG GTCCGGTTGCGCATGAGGACCGGGGAGGACAGAGCTGT 

TTGCMTAGGTGTGGGCTTTTATAGCATTGTGAGCATTTCACGTTAGCGTAAGTGTTGCT 

GCTGTGCAGGTGGTCTCTGGGGCTTACAATCTTCCCCAATGTTCTTCCCCACCCCTCCCA 

CCATTCTGGTGAACAAGCCTCTTGGGATTCTTTGAAAAAAAAAAAAAAAAAAACCT 

Sequence 434 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAATATACAGAG 

GTATAATCTGTAACATCAATAATGTAMGTGGGGAAGGGCAAGGTGGAAAAGGAGNNGAA 

TGCTTGTATGTGACTAMATTATGTTGGTATCAGTTTAAAATATATTATTATAACTTTAG 

AATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAAATATCTGTAGGA 

TACACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAAGCTA 

TCAAAATAGAAAACAATAATGGAGAAAATAAGACACCAAAAGCTATAAGACTCACAGAAA 

ATAAATAATAAAATGGCAAAAAGAAGCGGACGCGTGGGTCGAAGACCT 

Sequence 435 

CAGGTACAGGCACCTATATG MTTTA MCGGGGAAGATTTCTTTATTTTGTATTCAATGT 

ATNAATMGATTNTITAAMCATATTTTGGAGAAATNGCTAATTAGTGTATAATCCTGATG 

CCAATTCTAAA A AACC I I I I I 1 1 I I II I GNAGAGACAGGGTNTTATTCTGTCACCCGGGC 

TGGAGTGCTCTGGTATGATCCTAGTTCACTGCAACCTCAAATACCTGGTCTCAAGCAATC 

CTCCCACCTCAGCCTCCCCAGTAGCTGTCTCTATAAGCATGCACCACCACACCTGGCTAA 

CCTTCTTATTATTTTTGGTAGAGACAGTCTCACTATGTTGCCCAGGCT 

Sequence 436 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCAACAAGGGCCCTGAGAG 

AGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAGCAGAGACAGGAGTCAGCACGTGTGA 

GGGCAGCAGGGAAGTCTTCCTGGAGGAGTGAGACCTGGCGATGAGGAGGCACGGCAGGGA 

GGTGGAACAGGCAGGAGAGACTCTTCAGGAATTGAGGAGATAGAATAGAGGACACTAAAG 

CCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGATCATCGCTTGAGGCTAGGAGTTTAAA 

AGCAGCCTGGGCAACATAGCGAGACCCCATCTCTAAACACAAAAAATAAAAACCTGCCCG 

Sequence 437 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACCACGTAGCAAC 
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ATATGAGTATTTCTCTAGATAACI I I I I I I I I GACAAGGTCTCACTCTGTTGCCCAGGCT 

GGAGTGCAATGGTGCAATCTTGGCTCACTGCAGCCTTGACCTTCCCTAGCTCAGCTGAAC 

CTCCCATCTCAGGACACCATTGCCTCCACTGCCCATCCTGCATCTGCCTGCCTACCCCAA 

AAGTGTTGAGAATACAAGCATGAGCCAGAGCCACGGAACCTGGCCTCTAGAGAGACTTTC 

TATTTTAGTTTTTTCTTCTCTTATTTGTGMGCCTTGAAAAAACTACTGTGGTTTATTTA 

GATTCTGGTTTGTGACTTTTTTAAATAAACTTTTTATTTTGGAATAAATTTATGTTTAGA 

GMTAGTTGCAAACATAATAAAGTGAGTTTTCATAAACGCCTTACCAGTTTCCCCTGNTG 

GTTAACATTTTACATCACCATGCTGTTGCATTGGTCAAAACTA 

Sequence 438 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAAACTATTTTAAGTGTGACCATGAGGTG I INI) I GTGGACCATTTTA 

AAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGGTAGAATGGGTTGGTTATC 

TGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTCAATTCGGCACTTTATGTA 

TTAGCTAAAAATT 

Sequence 439 

TCGAGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCANTGATGCCAAAAGTCATCTC 

AAAATNGCAAGCNGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCTTGAAGA 

TTTACTTTGTATACTTCAAAAGATCTGGTCATGAAATTTTNAGCTAATACATAAAGTGCC 

GAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACCAACCCA 

TTCTACCTAACTCANGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATGGNCCAC 

AAAAAAACACCTTATGGTCACACTTAAAA 

Sequence 440 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTAAAACTTAAAGTA 

TAATAGTAATMTAMAAMGAGGTGCTTTTCTCCTAAGTCAACATTTTAGAGGAAAAGA 

GTCAATTCAAGCAATTATCACATATGTGTAACTGAAGCACATATGTGTAACTTTTCAAGA 

GTGATTAGATGGTCTGTTGTCTTTGMGTGATAGTCAAATATCAGGTGTGTTCTAGGGAG 

G7TGTGTMGACTTTTGCTTGTA7TCTCCCGGACGCGTGAGTCGACTCAAGACCTGCCCG 

Sequence 441 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACG 
CGTCCGATTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTNTCTAATGTC 
ATATTCTTCCAAAAAAAAAAAAAAAAAAGT 
Sequence 442 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTACAAGTCGACCCACG 

CGTCCGCTTCAGAATATCCMTTCATGTGMCTACAGGAMTTATAGTTTAGATATTTTT 

AAATGATTTGCCTGTCACCGTATAACACAAGGGTGTCATGACCAAGCTAGATCTCTTTCA 

TATCATTAATAAMGTCAMTTTTAAATTTGTGCCCAATTTGGCTGGGTGTGGTGGCTCA 

TTCCTGTGATTCCAGCACTTTGGGAGACCT 

Sequence 443 

CGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGTCATCTCAAAATTC 

CAAGCTGACCTAGTGCMCACATTTACACTTGGATAAACTATCACCTTGAAGATTTACTT 

TGTATACTTCAAAAGATCTGGTCATGAMTTTTTAGCTAATACATAAAGTGCCGAATTGA 

AATCCAGAATACAATAGCTATGAAGGGCCGNTAGAGTGCAGATAACCAACCCATTCTACC 

TAACTCAGGTTGAGATTGCTTTANAACCTATCATTGGCTTTAAAATGGTCCACAAAAAAA 

CACCTCATGGTCACACTTAAA 

Sequence 444 

ACNGNCAGGTACCAAGATTAAGGACAGAGTTCCCTCCATTGGTCATTGATTTGNAAACCA 

AAATGTATCTGTGACAGGTATTAATCCGGACGCGTGGTCGAAGACGAAAGGACACGAGAA 

ATANGGACCTANNCCGCTCTANAACTAGGNATCCCCNNNNCTGCAGGAATTCGATATCA 
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Sequence 445 

CCGCGGTGGCGGCCCGCCCGGGCAGGTGCCAMGCCTTAACTTAGATTGTTACTTATGTT 

TCTAMTCTGTGGAAGCACATTTCCTTTTCTTCTTCTTTTCTTTTACTGTTAAT 

ATTCTCTATTTTACCAGTGGAGMTGTTTAGTATMTTTCCATTTANCTCAAGATTCAAG 

AMTGCAAAGTGCTATTTTTATCAAATTTCTGAAAGCCTACTGTCTTCTGCTTTGGAAGT 

CCCACMCAGCTCTTTMTTTTCTTMGCCCCACTTTCCTCATCANCAAGTNGGTGTGGC 

AATGGATCATAATAGGTTGC 

Sequence 446 

CGGGCAGGTACCCTTATTTTCCCTGATACAGCACAACTCTGCCTATTCTAATCATGACCT 
AGACACATTCAATGAACTACAAGTTCTCCTTTACACGGACGCGTGGGTCGACTCCGGACG 
CGTGGGTCGAAGACCTCGGCCGCCT 
Sequence 447 

CGAGGTCTTCGA CCCAC GCGTCCGCTTTCTTTTGTTGTTAAGGACCCATGACCTGCAGTT 

TCCCTMCATTCATTTTTATACAGGGCAGAGGTATGTGTGCGAGCTCAGATACCTTAAAT 

TCATATGCCTTTAATACMTCAGGCAGATTTCTAAATGAGGGATGCTTCCCCACAAATGG 

AGAGTGAAAGTGGGCCAGCCTAAAAGGACCTCCATAGCACTGTGCATGGCCAGCTGTTTG 

TGGCTGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnATCATGNTCATAGCTGNTTNCTGGG 

TNAAMTNGNTANTCCGNTCANAATTTTCACACANTCNTANNAACCGTTNCCTTNAATTG 

TTNANNCCNGGGGTNCCNAATTGGGNGNNCTNACTNAAATNAANTNGGGTNGCTTNTNTT 

GNNNNNTTTTTANNTNNGTTTANCCTTNTNNNCANNTTNNTTNTTNAAATTNGNNAAANN 

CCNNGNGGGGAGGGNNNNTTTNNNNNNTNNNGNNANNNTNNNNGNNNNNNNGGNNNNNNN 

TTNATTNNNNTCTNTNNNNNTNNTNNNNNNTANNNNANTTTNNNNTTNNNNNNNNNNGGN 

NNNNNNNNNNNNNTNNANNTTTANNNNAANNNNGGNNNNTNNNNNTTNNANTNNNNTTNG 

NGTNGTTNTNTNNNNNNNANNNTNTNTTNNNNNNNNNNNNTNTNNNNTTTNNTTTNNNNA 

NGNNNTTNTTNTTTTTTTT 

Sequence 448 

GCGGCCGCCGGGCAGGTTACAAGTCGACCCACGCGTCCGCTTCAGAATATCCAATTCATG 

TGAACTACAGGAAATTATAGTTTAGATATTTTTAMTGATTTGCCTGTCACCGTATAACA 

CAAGGGTGTCATGACCAAGCTAGATCTCTTTCATATCATTAATMAAGTCAMTTTTAAA 

TTTGTGCCCAATTTGGCTGGGTGTGGTGGCTCATTCCTGTGATTCCAGCACTTTGGGAGA 

CCT 

Sequence 449 

CGCGGTGGCGGCCCGACCAGTGCAAATATCTACCCAGTTAGAAGAGTAAATACCATCTTA 

GTGTTATTATCAAAATATTCTGAACTCATGAACCTCCTCAGACTGTTGCTGGGACTCCCA 

GATATCMTACTCTGAGAACCACTGATCTMTGTCTTTAGTCAGTTTCTATTTGTTCTCT 

AGTATAACCAAGCATAAAAGTAATATACTTCCGCCTCTCTCTCCCCCAACTGACTTTAGT 

CAATAGTACCTCGGCCG 

Sequence 450 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTGTCAAGCTTCGACCCACG 

CGTCCGAAAGAGACCCCTGGGCTAGGGAGCCATTTGCAAGTCCCTCAGATGGCCTCAGGT 

GTGGGTCATGCTCCACTGCAGCCATGGTTACTGGGAAGCCAAGTGGACTCCATGGGGCTC 

CAAGCAGGGGAGGGAGCTCCTGGGAGACACAGTGTGGCCCAGATGCCCTGGTCATGGGGA 

CATCCAAGCACAGGTGAGAAACTTGGGCAGGGAGGGTGCAGAGGAAATTTGGAAATGGCT 

GAGAGGCTGGACTCAAGGTCTCTTGGGCTTGAACTGAGCTCAAGCAGTCCACCCACTTCA 

GCTTCCAGGGTGCTGGGATTACAGTCGTGAGCCATGGCATCCGGCTGGACTCAAGGTCTC 

TTGTTCCCTGATCCTGGGCTTGGCATGGAGAAGGGAGGAAGCTGAAGGGGGCAGGTAAAA 

AGAT 

Sequence 451 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCG 
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TCCGATTATACCCTAAAAAAGTAGAAAGATGGAATAATTAGGAGTAAAATTAGTGAAATG 
GAMGATATGTTATAGAAAGGACCAAGAAACGCAAAAGTTGGTATTTGAAAAGACTGAAA 
AAATTCATGAACCTGTGAAMCAGTGGTCAAGMGAMAGAGAGGCATATTTTGATCTCT 
GTTTTACATGTTACTCAATGTTCATTGCTGCCTCCCTTGTCCATAAAGTGCCTTTAGTGT 
GTATGTTACTTTAGATTATCTTGGTGTCATCAAGCTTTACTCAGCAMGAACCACTTTGT 1 
TGTCTACTTTAAAACATAAGTTATCTTTAAAAGAATGGGTATCTTTTATAGTTCCATATT 
MTGGCGMGAMCTGCAGGTAACAGTGCCTTACCAGCTGGGTTTTGCTMCTTTTCTC 
Sequence 452 

CCGCGGTGGCGGCCCGCCCGGGCAGGTTTTATA I I I I I I I CCTCTTTAAAAAATAATTTG 

GTTTTGAATATTAAATTTACATATTTCTAAGTTAAATCAACATTCGTAGAGGAATTATCA 

AAAAMACTAGTAAGTCTGAAAAAAAAACCATATTTTATATTCTGAGGTCCCGGACGCGT 

GGGTCGAAGCTTGACCT 

Sequence 453 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGCAGGTACCCAACACAAACTA 

TTCAATAAAGTAATCTGCTTTAAAAATAAAACACACTGAAAGGCCGAGGCAGGTGGATCA 

CCTGACATCATTAGTTCAAGACCAGTGTGGCCAAACTGGTGAAAATTAGTCTCGACTAAA 

AATCAACATTAGCTGGGCGTGGTGGCAGGCGCCTCTAATTCCAGCTACTCAGGAGGATGA 

GGCAGGAGAATCACTTGAAGCAAGGAGGTGGAAGTTGCAGTGAGCTGAGATCGTGCCATT 

GCACTGCAGCCTGGGCAACAGAGTGAGACTCCGTCTCAAAAACACCACCACCAACAAAAT 

AAACACAACAGAATTATTCTGCAAATACAGATATTGGAGTAGCTGAGTTCCATCTCAAAT 

TTGA CTATGCAGGTTGACAGGTGATCTTGGCAMCTACTTATCCTTTCTGAAGTTCAACT 

TTTTCACCAAATGGTATTGGGATACAACACTTGCTCTTGCCTATCTCACATGAATTATCC 

ATTTTGGACAACTTGGTAAACTATA 

Sequence 454 

CCGCGGTGGCGGCCCGAGGTCCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTG 

AGGAAAAAAGAAGGTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGA 

ATCAGAATGGCATTGGACTTCTCAGCTTTCCTCATTAGAAGTTAAGATCTGAAGCAATCT 

TTAMCTCGTGAGGAAAATTAAGTCTAATAMTMTTTTCTTCTTAGCCAAACAATCAAA 

TGTGAAGCTAGAATAAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 

Sequence 455 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTACAGCATCCTGATAA 

CAGCCTCTGCCCTGGGAAACAGACTGTGACCATGCATTTCTAGTCCAGCATATCCTATCA 

GAAGACCAAATGGCTTCATCAAAAACAGAGTGACAACCTCTTCTCTTTGCCTCTTCTGTG 

CTTGTTMCAGGCAGCATTGGGGCAGGAGAGCCTGCAGGCCTTTCACGGCTGCTTGAGTT 

CTCACCTGTTTGTCTGAGCTCTGATTCCTCTGCCCTGTAAGCGTAAAGGAGATGTGCTGA 

GTGGAAAGACCTCTAAACAGGCAGCCAGGAAGCCAGATTTCAGGTCCATCTCTGCCTCTA 

ACTGGCAGCTTTGCCTTGGGTAAATCATCAAGTGGGCAATAGTTTCTCTCCTGTAAAAGG 

MAAGATTGGGTTTMGATTGTTTCTGAAGTTCTCTCTAGATTTAACCTGGAAGGAGTTG 

AAATTGCTAACC 

Sequence 456 

CCGCGGTGGCGGCCGCCCGGCAGGTACAACATTTTACATTTCCAGGGACTGCAAAAATGT 

TAGTTCCTTCCCCCATCATTTAGTTTGMAATTCTTAGATMTTCTTTGCTGGTAAATTC 

CAACAGAATAGTTAGCACACAGGTTCCACACACACAAGTTCTAGATAGGAATCTGAAGCA 

CCACMTGAAMGAACATTTAACATCTTTTAAAMTGTTTAATGTTATCAGAAAGATGTT 

TGGTATATGTGTTCCATGCATGCTCCTGCTGGTTCTATTTGAAAAAGAAGTTTTTACAGT 

TATCTGTTGTCACCATATTGTAAACGGACGCGTGGGTCGAAGACCT 

Sequence 457 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACMTAAATAGCA 
TTCCCACGGTGACCACAAGTCTTGGAATCAGTTCCAGGTGTCGTCGTGGCCGTTGACACC 
GCTGCCTTCTGACGGTAAATGTATTGTAGMTTCATGTTGTATCAGGCTTCAGTTTCCTC 
ATTCTAAAATGAGAGGATTGGATAAGTTAGTAGTTTCTAATTTTTACTTTTAATCAGTGG 
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CATC TCCCATTTA I NN CATTTGAAATAAAACTTTTGAATTTTATCTTCTACCTAAATA 

ACATATTTTGTTTTATGTTTCMGATGAAGCTCACACTGAGTTGGAAAAAAGGAAAAAGC 

AAAGGATCAAAGCTGGGGGAAAATACTGGACCATGTGCTTCACCTCATGGTGCCAAATAA 

AGAGAAAATGGGGAGAAGATAGGGACAGATAAAGATCTATTTGCTCGGATTGNGCTCTCA 

TCCTTGGCAACATGTTGACAATGCCCTGGAAATA 

Sequence 458 

CCGCGGTGGCGGCCCGAGAGCTCCAGGACGAAGGTATAAACACAGCAGAGGGCAGAGCCT 

GATTTCAATCAGGGGCTACTCTAAGAAAGGCAGGAAACTAGATAAATACATTTAAAAGAA 

ATTCCTCAGTGGCAGGGACAGTAGAGCAGCAGGGGGAGATCCCAGCACGGACAGGTAACA 

GTGTGATGTGGCAGAAAGGCTTTGGTTGCAAGTGGAGAACAGATGTCTCTGGCTGCCTCT 

GGCAGCTGCCTCCTTCTGGGCCTTGACTTTTCAAAGCCAGGCCAGGCCTCCCCCACCCTG 

GACCACCTGTAGCTGGTTCAGAAGGCCCCAGGCTGGGCTTCATAGATGAAGACACAGCTG 

ACTCAAGTCTCTGGCTCTGTCGGCTCTTGGCCACCTTGCCGCTCCCATAACGGTGTTTCT 

CAGGTCAACCCCTCTTCTCCATTCCTACTTCAATGACCTCAGGTCAGGCCCTTGCCACTT 

CTCTTCTGGACAAAGATGACAGCCCTTCACTGGTATCCTCGTCTNCAACCTAATTTATNC 

TTCACAGTGCTGGCAGAAGTGACATCTTTTAAACACACAACGACCCCN 

Sequence 459 

GCGGCCGAGGTCCGCAC I I I I I I I I I I I I I I I i I TATTTTTACTCCAGMTTTTCCTTTA 

ATA1TTAGGACTCCAATCTTTACTTACAAAATAGCTTTTATTTACGTGCACATGATCGTG 

GTTTCAAAATTTTCTAAGCACTATGCTAAATTTGTCATCAAAACATAACAGATTCCCATC 

TTACAAACATAGTTGCTAGTTGMTGAGTAAAAGAGATTTCAAATTTCAATTCAAGGAGG 

CATGTCTAAAAGACCAGACCATTCATTTGATGAAATTGTAAATGCCGATCATCCAACTTA 

ACAGGAACTGCACATTTGTTCCTTTCTAGTTAGAAAAAAATAA 

Sequence 460 

CCGCGGTGGCGGCCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGTGATTGCCTATTGTT 

TGTTGATTGACTGATTTATGCCTCTAAGAGGAACTATCTTTTGATAATATTAAATAAGAT 

GTCCTAATACAAAACTGATAGAGTTCAGAAATAATAAGAATCTCCTGGCCAGGCGTGGTG 

GCTCACGCCTTTAATCCCAGCACTTTGGAAGGCTGAGGTGGGCGGATCACGAGATCAGGA 

GATTGAGACCATCCTGGCTAGCATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAAAA 

TTAGCCCGGGTGTGATGGCGACCT 

Sequence 461 

CGCGGTGGCGGCCCGCCCGGGCAGGTACAGAAAGGACAAATACATCAGTAGAAAAGAAGA 

CAATATAAGGGCAGATTGAAATATATACGTGAACGTCACAAAGACCAATTACTGCCATTT 

CAATTCAATGAGGAAATAATGATGTATTTAATAAATAGTGCTAGAATGCTGCATTATCTG 

TCTAGGATGAAAAAAAAAAAAAAAAAAAAGT 

Sequence 462 

CCGCGGTGGCGGCCCGAGGTGGMTGTCTGTTTTCACAAMTTTTGTATTTTCTCCTAAT 
AGTATGAGGTNGAAGAAATCTACATCTTCTCAAGTGAGCTTATGATTAACTCGATGAGTT 
TTCTTGCTATTCTCAAATCGGAATNTCCAGACCTGGCTAGAAACTAAAGTCTAAGCCCAT 
TCATTAAAGTCTTGAATTTATTTACTTTNGCCAAGAACAGCTATATAAAATTAGATTCCT 
CCTGGTATAAAATTGGGTGTTTTCTTAGATATTNGCTATCAAMGTCATTTTTCTTGG 
ATCGGACGCGTGGGTNGAAGCTTGTACACCTGCCCGGGCGGGCGGCCGCAC I I I I I I I I I 
I I I I I I I I I I 
Sequence 463 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGAAAG 

CATACAACTGAGGGTTCACGGATTTCTAAGAAATGCATTTCCCTTGTCTATGTTCATCAG 

CCTTTAATACTTTGGCTACAAGGCATATCAGAGAAAGGGAGGTTAAATTGGGTAATGACA 

AAAGAACATATGTAACTCTGGAAATAGGAAAAATGTTCCAGAAATGGGATCAATGTGCCA 

GCMTMGCATAGTTCATTTCATTTGAAAATTCAGTTAAAGAGCCCAATAAACAGTTCCA 

AACCG G ACG CGTG G GTCG A AG ACCT 

Sequence 464 
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ACTATNGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTCGACTCAAGCTTTCAG 

ATATAGGGCATTCCAGAATCTTCTCTTTACGAGTTCACCTGCTAGTATAATCTCCACAAC 

TTGAATGGCATTGGTTGTTCTGTAATTCCTGCCAAAAGCATCACAAGTTGTACCTGCCCG 

Sequence 465 

CCGCGGTGGCGGCCGAGGTAATAGCTAATGCATGCAGGGCTTAATACCGAGGTGATGGGA 

TACTTCATGCTAACTGGGTGTTCCTTAGTGTAATAATTGACTGCCTGCTCCCATGTGGCC 

TCACAGAGTCATAGCTTCTTGCCCCAAATGTTTTTCTGGTTTTCTGTGGAGCCATACAGA 

ACCATAAAG AGAAG TTGTTGAGAAACCGGTTTGACTTTCAAGTCTGATGAGGAGCCTGGT 

TTCCATGAGATTTTCAGCTMGTTAGCATATTTTTAAAAATTTCAAAATGAACAGAGGAA 

AGGTTGACGATCTTTAGAGCCTTCTGCCAGGCTGAAGTGTTCTGCATCTTCCCTAGACCC 

GGTACCTGCCCG 

Sequence 466 

CCGCGGTGGCGGCCGC CCGG NCAGGTTACAAGCTTCGACCCACGCGTCCGGGAAATTTTA 
ATTA AAMT AGGTGAACATTTTAA ATGA CCTAATACATATTTAGTCCACATTGAAACTTT 
GGCATTTTGTCATTGCCATTAAAATTTTGATGGCATTAAAATTTGATGCCATTAAAATTT 
TGAT 

Sequence 467 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTGATAACTTCTC 

CTAAGTGCCAGGCATTGTATTACATGCTGGGAGCACAAAGATGAATAATAACAATAGGTT 

CACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCAGGAGCCCTCAGCGTAGTAG 

GGGGTTGGTGTTGGAGGGTTGGAGGAATGGAAAAGGCCCTGAAATGCAGGCAGAGAAATG 

ATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCTTTACAGCAGCCTCGAAGA 

CTGATCAGGTTACTATACCCTCTCTTCTGTCCACGTGCATTTC 

Sequence 468 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACAACATTCCCCCTT 

CCCC AAACAGTAA TATG GACACTGATTTAACAAGACTTATAAAAAAATAAGGCACATTTA 

TTTTGATATGGTAATTTTAAAATAGAAACCCCTTCTCAGAACACCTGTATTCAAATGAGC 

T GTGTA AAAAGACACCTTGTGGTACCTAAAATAGGTTTATGGTACCTATGGAATTGCTTC 

TATT7TAGTGAAGATGGAATAAATTGCACCCATCCCACATTGTCAAGTAATGAAAATATG 

CGGACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 469 

GACCTCTTTAAGTGAAACTTAGTATGCTAATAGTATGATACGCCCTTTTGCTGTAGCAGT 

TACCATAGTTACAGATAAGTATATATTGTTGTGATGATCTGATTTCCTGGTTCCCCCACC 

CCTGCAAAACAACAACAAAAACCTTTACCAGGCTCTATAACAGGGGGACCAAACTTGTTT 

TTGCTCATCATTGCCGGACGGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 470 

GCTCC CCGCGGTGGCGGCCGCCCGGGCANGGTTACAAGCTTCGACCCACGCGTCCGGGAA 
ATTTTAATTAAAAATAGGTGAACATTTTAAATGACCTAATACATATTTAGTCCACATTGA 

AACTTTGGCATT7TGTCATTGCCATTAAAATTTTGATGGCATTAAAATTTGATGCCATTA 

AAATTTTGAT 

Sequence 471 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGT 

CCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCACAAAGATGAA 

TAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACCCTCCAGGAG 

CCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTNGGAGGAATGGANAAAGGCCCTGAAA 

TGCAGGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAATATCTT 
Sequence 472 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTTCGACCCACGCGTCCG 

GAGCGTTGCTTGGATTTCTAATTACTTCTAAGNGTAGTTTTATTTAATTTCAGTCCTTTA 

GAAAAAANAAAATAAAANNAAATGTGCGGGCCCGGCCTGCCCGGGCAGGTNCCACNCGTT 
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CGMAAAAGAAAGAAMAAAAAAACTTTCTCTTTGCCANTTCTTCTTCTTCTTTNTT 
Sequence 473 

CTA CTTA GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCAC I I I I I I I I I I I I I I I I I I 

TCCTTTTGAGACACAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCA 

ATC CTCCCGCTTT CAGCCTCCCAAAGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGG 

CCCTTTTCTTTTTCATAGGAGAAGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAA 

GAATGTGAACTCAAGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACT 

GTTTCTTAGCCTTGTCCTTTCTCTCCTGCCAGTTCTAGGGGAGACTGCTTCTCCTGGTTG 

ACCTCATCAATGCCCAACC 

Sequence 474 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGT 

CATCTCAAAATTCCAAGCTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCT 

TGMGATTTACTTTGTATACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAA 

AGTGCCGAATTGAAATCCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACC 

AACCCATTCTACCTAACTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATG 

GTCCACAAAAAAACACCTCATGGTCACACTTAAAATAGTTTGCATCCCACACTACAGGCT 

TCTCTGGAGGATTTAATACTTTGGAAGTAGCATCATAAG 

Sequence 475 

CTNCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGACCNGTGCAAATATCTACCCA 

GTTAGAAGAGTAAATACCATCTTAGTGTTATTATCAAAATATTCTGAACTCATGAACCTC 

CTCAGACTGTTGCTGGGACTCCCAGATATCAATACTCTGAGAACCACTGATCTAATGTTT 

CTTTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAAGTAATATACTTCCGC 

CTCTCTCTCCCCCAACTGACTTTAGTCAATAGTACCT 

Sequence 476 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAACGCTTCGACCCACGCGTCC 

GGTTTGGGTGGMTTATAATATTTTAGATAAGATTTAAGAGGATTGCTAGATNGGAATGC 

GAATGATGATAAGGCTTTTAGAGTTAGATAAGAGAGAGGGCGCTCTAGAACTAGTGGNTC 

Sequence 477 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCACAACATTCCCCCTTCCC 

CAAACAGTAATATGGACACTGATTTAACAAGACTTATAAAAAAATAAGGCACATTTATTT 

TGATATGGTAATTTTAAAATAGAAACCCCTTCTCAGAACACCTGTATTCAAATGAGCTGT 

GTAAAAAGACACCTTGTGGTACCTAAAATAGGTTTATGGTACCTATGGAATTGCTTCTAT 

TTTAGTGAAGATGGAATAAATTGCACCCATCCCACATTGTCAAGTAATGAAAATATGCGG 

ACGCGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 478 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTATACTAGAA 

GATGCTC CAAG GTTTCAGAAAGGMTTMTTACTTTCAATTTGCACAATTTAGAACAAAT 

A TCTGG CTTTTCCCTAAGCTTAATGATTTTCCATTTCACACAACTAAAATATAATAGCAT 

TATTTTATAATCAAGTTTAACTGATGGTCTATGATAGTAGAGCGATTTAGTATTTTGACA 

AAAATCTTATGAGACATGAAGTCATTCAATTTGCCGGACGCGTGGGTCGACTCAAGCTAG 

ACCTN 

Sequence 479 

T TAGG GCAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCTTT 

CTTTTGTTGTTAAGGGCCCATGACCTGCAGTTTCCCTAACATTCATTTTTATACAGGGCA 

GAGGTATGTGTGCGAGCTCAGATACCTTAAATTCATATGCCTTTAATACAATCCAGGCAG 

ATTTCTAAATGAGGGATGCTTCCCCACAAATGGAGAGTGAAAGTGGGCCAGCCTAAAAGG 

ACCTCCATAGCACTGTGCATGGCCAGCTGTTTGTGGCTGTACCTGCCCG 
Sequence 480 ' 

ACTACTA TAGG GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTC 
CGCTTTCTTTTGTTGTTMGGGCCCATGACCTGCAGTTTCCCTAACATTCATTT7TATAC 
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AG6GCAGAGGTATGTGTGCGAGCTCAGATACCTTAAATTCATATGCCTTTAATACAATCC 
AGGCAGATTTCTAAATGAGGGATGCTTCCCCACAAATGGAGAGTGAAAGTGGGCCAGCCT 
AAAAGGACCTCCATAGCACTGTGCATGGCCAGCTGTTTGTGGCTGTACC 
Sequence 481 

GACCTCTTTAAGTGAMCTTAGTATGCTAATAGTATGATACGCCCTTTTGCTGTAGCAGT 

TACCATAGTTACAGATAAGTATATATTGTTGTGATGATCTGATTTCCTGGTTCCCCCACC 

CCTGCAAAACMCAACMAMCCTTTACCAGGCTCTATAACAGGGGGACCAAACTTGTTT 

TTGCTCATCATTGCCGGACGGTGGGTCGAAGCTTGACCTGCCCG 

Sequence 482 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTGACTAAAGT 

CAGTTGGGGGAGAGAGAGGCGGAAGTATATTACTTTTATGCTTGGTTATACTAGAGAACA 

AATAGAAACTGACTAAAGAAACATTAGATCAGTGGTTCTCAGAGTATTGATATCTGGGAG 

TCCCAGCAACAGTCTGAGGAGGTTCATGAGTTCAGMTATTTTGATAATAACACTAAGAT 

GGTATTTACTCTTCTAACTGGGTAGATATTTGCACTGGT 

Sequence 483 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGA 

CCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCAC 

AAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAACC 

CTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGGGTGGAGGAATGGAAAAGGC 

CCTGAAATGCANGCAGAGAAATGATGAAACAATTCAGGGGCTGTGGTGAGGTTAAATGAA 

TATCTTTACAGCAGCCTCGAAGACTGATCAGGTTACTATACCCTCTNTTCTGTCCACGTG 

CATTTNAAAAACNTTGGCCGNTCTAGAACTAGTG 

Sequence 484 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGATGGTTTTTGCAAAAATTGAAAA 

TGCATCGAT ATTACA GTTMTTTTTTCAGTGTGTATGTGGTATTAGGCTTAGAACTATAA 

CACAGGMGTTTTTAGAGTATGTCCACTCTGGTTTACTCCTTTGTAAGTATTAATACCTG 

ATAATTTACATCCTACAGCCCTGCC I I I I I I I I I I I I I CAAGTTTGTCCCAGCAAGTCTT 

GGCCCTTTGCATTTTCTTAATACATTTTAGTACCTGCCCG 

Sequence 485 

CCGCGGTGGCGGCCGCAC I I I I I I I I I I I I I I I I I I ACCTGAAAATGCTTATTCTAGCTT 
CACATTTGATTGTTTGGCTMGAAGAAMTTATTTATTA 

TTMAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCATT 
CTGATTCTTGCMCTTACAAGTAGTCTTTTTTTGTCTAGACGCTT^ 
TTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 
Sequence 486 

ACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTGGAGGCCCAAG 

TGCTGAGACTCAGGGGACGCAGGCCTGAGGACAGTTATGCAGGGTGCAGCGGCCTACGGT 

AACCCATGTAGCAACAGAAACCCTTAGCTAACTGCCGTAATTTAAGGCAATTAGGAGCCA 

TTCATCATCCAGATGGCTATTGGCTCTAATCGTTACTGGCTGAAGGAACTATATATAGCA 

GCTACCTTTCCGCTCCATTCCCAGGCCTTGTTCCTGTCTTCTGGTTGCCAGTTCTGCACT 

CACTCATTCCGGACGCGTGGGTCGAAGACCT 

Sequence 487 

CTAT NGGGC GAATTNTTNCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACGCGTCCGA 

TCACTTTTTTCATTNATACCTTATTAGATAAAACATTAGCCCCCTAGAGTGTTTTGTGAA 

GGAAATATGCCTMTAAGAGATGATAGTTTTAGCMTAAATGAGCATTAGAACTATTATT 

TATTAATGAAATGAACTGGTGGTCTGAAAGTGATGATAAACAGACAACTGTGQAAAATGA 

ATTATTMAATTCCATGGMTTCCTTTTGAAGTTTATGAAGT 

Sequence 488 

CCGGGCAGGTACAAATCMGTCATTAACATTTTCAATGTCAAAAATACAGCACGCTGTTA 

AGAGTTCTGTCAGTGCTCATTATCCCACTAGATCCCACAAAGGGCAAACTCAAAAGATGA 

AACAAAGGCAACGCCATCAATAACCACCATATTCCACAGGCTTTCTCCCCTAGGACGTAC 
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CTN 

Sequence 489 

CCGCGGTGGCGGCCGCCCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGT 

TGCTGCAGTGTAACTAAGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTT 

AGATCTAGGTGA GTTCT ACTTTAGAGGAMTCCTGGTMCTGTTGTTTGTTTGTAAGTTA 

TAGCTGTAATTMTTTTCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGC 

ATTTAGACTTCCTTAGGCAAAGTCAAGGCAACAAGCTGATGATTCTAAAGCTATTNTTCA 

AGGGAGNTNTTTACCCATCATAAAGGNGGTTTTTAGTCATTATAGATAATATTCAATCAA 

TTANTACCGGGGGATGGCAAAAA 

Sequence 490 

CCGCGGTGGCGGCCGAGGTGTMTTTGGAGAATATTTAAAGCAAAAGAGCAAACAACAAA 

AACTAAGTTAACACTTACCCAGTGCAGTAAGGGAATTGTAAGATACAGCCTGCTTAAGGA 

GGTCTGCAGACAGATGCACCTMGATTTCAGCTGTTTTAGGTCACTTTTCTCAAAATATT 

TATTATCTGGCAA TGGG GATGGGAGTGGGGAACACCTNTCTGTGAGGCAAATGGTATCTC 

MCAMTACCGACTTTTCAAGGAAGAAAGCTCTCCACTTCTCTCATAAACTTATATACTA 

CCTTAAACAGTATGCAGTATTCGCGGACGCGTGGGTCGAAGCTTGACCTGCCCGGGCGGC 

CCGCTCTAGAACTAGGTG 

Sequence 491 

CCGGGCAGGTACAGCCTCACATACACAGATGCAGGTGAAGTCACCAAAGCTGATCTCTCA 

TTCGTTCTGGGGACAGTTAGCAGCGTAGTGGTCCCACTGCAGCAAAAGTTTGAAATTCAT 

TTTCTTCAGGAAAATACCCAGCCAGTCCCTCTCAGTGGAAACCCTGGTTATGTCGTGGGG 

CTCCCATTAGCTGCTGGATTCCAGCCTCATAAGGGTGGAGCTCTCCCGTGTCAGCTCGTA 

GCACAGMGGTGAAGAGCCTGCTGTGGGGCCAGTGCTTCCCAGATTACGTGGCCCCTTTT 

GGAAATTCCCAGGCCCAGGGACATGCTGGACTGGGTGCCCATCCACTTNATCACCCAGTC 

ATTCAACAGGGA 

Sequence 492 

CCGCGGTGGCGGCCGAGGTACTATTGACTAAAGTCAGNTGGGGGAGAGAGAGGCGGAAGT 

ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 

NATCAGTGGNTCTCAGAGTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 

TGAGTTCAGAATATTTTGATAATAACACTAANATGGTATTTACTCTTCTAACTGGGTAGA 

TATTTGCACTGGT 

Sequence 493 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGA 

ACATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGA 

ACTGGATCAGAATGGTCCAAGGACTGTTMACAGAGGAAGTATTTACATTTTGAAAACTT 

GCGGACGCGTGGGTCGAAGCTTGTACACCT 

Sequence 494 

CCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGATTTACAAGGGACAAAA 

TGATGCAAATTATATGCTGTCCAACCTACTGGTGAACTGGATCAGAATGGTCCAAGGACT 

GTTMACAGAGGAAGTATTTACATTTTGAAAACTTGCGGACGCGTGGGTCGAAGCTTGTA 

CACCT 

Sequence 495 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCTAAAA 

CTTAMGTATMTAAAAAAAAGATTATTTCCTATACTTCAAAATCMCAAGATTTGATTG 

CATTTAMTTTTTCTGTCCATTTGTTCTTCTATGTAATACTTTAAMATMTTGGCATAA 

AAATTCAATCAATTCATAAAAGTCCAAAGCAAAAAAAACAAATCTACTGACATNTCTTGA 

GGAAGAAATGATCAGGATTGACATTAATGAACCCTCTCACAGAGACCACTACACACACAC 

ACAAAAAGAAGGATGGGTGAATGGATGCAGAGAGAATTTAATAAGACTGAAATGATGCCA 

TACATGCTTTTAAAAAATAAAAAGTATTAAATTTTAAATTTTACTTCAATATM 

AAAAAAAA 

Sequence 496 
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CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGA 

CCCACGCGTCCGTAGTAATAGGGAATTAACTGACCCCTTTTGGATGGGGGAGAGCATCAG 

GCTGGGGTCAGGTAAGTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCCCAGC 

CCTGACTTGAAACCATTAGCGCT^CTTGCTCTGTTTTGAGAAAAACTTTCCAAACTTTT 

GCATGAGAAACTAGAAAAAGGAATGTATGCCACGTAACTGGATTACAGAAATGAGTTAAT 

TGTCTCTGTGATAAAAAMAMTGAAATATTTTCTTATTGAATTAATATTTTTGTCTTGA 

AGCATTTTCTAGTGATAGMTGTATTTGTCTTTTTTCCTGGGGGGNACCTCGGCCGCTCT 

AGAACTAGTG 

Sequence 497 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCCCAGGTC 
AGCA CTTT GAACACTTGAACATCTATGAAATCACATAGTAAAGTGATAGGAGATGGGGCT 
AAGCTTTTAATGGCCTTTAGACATAGCATTAGACATAACCTAAGCTGAAAGGCTTTGGGA 
AGTTGTTGTGTTAMTCCCCAACACACT 

AAAAAAAAAAAAAAAAGTTGCGGCCCGCTCTAGAACTAGTTGGATCCCCCGGGCTTGCAG 
GN 

Sequence 498 

CCGCGGTGGCGGCCGAGGTACCCTTTTATAAGGGTGTATCCCCTTTTGGTAACTTACTGT 
TTGTTMTTTGTAGTGTTCCCTGCCAGTAAGCTTGTAACACTCTAGTGACTCACCTTCGG 
GTGGGAGGGTAGGAAAGGGAGAGGCCTGCCTCCTAAACCTGGGAAGATGGGGAGAGAGTG 
GTAAACCTGAGAGCCCAAAAAACAAACCAAAACAAAAAAAAAAAAAAAAAAAAAGT 
Sequence 499 

AGGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAGGT 

CCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGAATCAGAATGGCATTG 

GACTTCTCAGCTTTCCTCATTAGMGTTTAAGATCTGAAGCAATCTTTAAACTCGTGAGG 

AAAAT TAAG TCTAATMATAATTTTCTTCTTAGCCAAACAATCAAATGTGAAGCTAGAAT 

AAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 

Sequence 500 

CCGCGG TGGC GGCCGAGGTACTATTGACTAAAGTCAGTTGGGGGAGAGAGAGGCGGAAGT 
ATATTACTTTTATGCTTGGTTATACTAGAGAACAAATAGAAACTGACTAAAGAAACATTA 
, GATCAGTGGTTC TCAGA GTATTGATATCTGGGAGTCCCAGCAACAGTCTGAGGAGGTTCA 
TGAGTTCAGAATATTTTGATAATAACACTAAGATGGTATTTACTCTTCTAACTGGGTAGA 
TATTTGCACTGGT 
Sequence 501 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAGGAGTGTCCCAAAGATTTCCCAAGTCCAGCCC 

AGAGAAGCTGAAAGCCTTTCCCCCAGGTGTGGGGCTGAGTTAGATGTGGGTCATAAAGGA 

TGTGGCCTCGAGGCTGGGAGGCAGCTGGGCAAAGTGGGAAGCCTCCCTACTCCTGAGACA 

GTGATGGCTCAAATCCAGGCCAACCTGGAACATGATCCTCAACTTCTCTAAGTTCACCTT 

TCCCAGGTGTGAAATGGGTTGTTCTGGGAATTGAGTGAGCTAATGATACACTCCCTGGCA 

CACAGCGAGCCTNAAAACGCTTGTGTCCCCTCCCTACCTCACAGCCCATTTTAGAAGTTT 

GCTGTCACTTA 

Sequence 502 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTTCTTA 

TGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGAGTGGGAGGTGACTGATCGT 

GGAGGTGGATTTCTTATGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGATTG 

GCTTATCACCATCCCTCCTTGGTGCTGTTTTTGCAACAGTGAGTGATTTCTTGTGAGATC 

CGGTTGTTTAAATCCANAGGCACCTNCCCCTACCCTCTAGCTCCCATTCCTGCCATGTAA 

GACACCTGCTCCCCCTTTTTCTTACCCCATGATTGGAAGCTTTTTTGAGGCCTCCCCAGA 

AGCTGATGCCAGCCCTATGCTTCCTGCACAGCCTG 

Sequence 503 

CTACTATAGGGCGATTGGAGCTCNCCGCGGTGGCGGCCGAGGTTTTTGAAATGCACGTGG 
ACACGAAGAGAGGGTATAGTAACCTGATCAGTCTTCGAGGCTGCTGTAAAGATATTCATT 
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TMCCTCACCACAGCCCCTGAATTGTTTCATCATTTCTCTGCCTGCATTTCAGGGCCTTT 

TCCA TTCCTCCACCCTCCAACACCAACCCCCTACTACGCTGAGGGCTCCTGGAGGGTTCT 

TTTTCTCTCMTCAATTCATCTTTTCTGTGAACCTATTGTTATTATTCATCTTTGTGCTC 

CCAGCATGTAATACAATGCCTGGCACTTAGGAGAAGTTATCACGGACGCGTGGGTCGAAG 

CTTGACCTGCCCGGGCGGC 

Sequence 504 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCGTC 

CGAAATTCTTGGTTAAGGATTATTATAAAATAGAATGGTATTTCAGTAAATCCCTGAGGC 

TTAGGAGTCCAGGTACAATGTTGGTTCTCAATTAAAATATAAATCATGTCTAGGGACACT 

TAGGMCACAGAACATATATTTAGAGCTAGAAAATATACAGCTTCAGACCAGGCAAAGTG 

CTGGGATTACAGGCGTGAGCCACCACGCTCGTCCTCACATGGGGTTTTATTATTAGGATG 

GTMGAGTATTATAAGGGATTNGGTACMGGCATMTGAGTCCTTTTGCTTTTTAGGCTT 

TTGACTTNTGGTTTTMGACTTTTNTTTAGCTTTTTGTTNGTTAGAC^ 

AGGCTTNGGTTTTTTAATAAAGTTTGCTTGGGATNAMCNTGACCTTAATGGMATTGTC 

CCCTNCCCCCAAAA 

Sequence 505 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTACTGATACAAATAGCATGGATGAAACTCAAA 

ATCATTATTCTMGAGCCAGATACTATAGCCTGTATTTTATGATTCACTTTCAATGAAAT 

TCTACAATAGACAGAACTATCTATCAACAGAAAGCAGATCAGTGGTTTTCTGCAGCCAGA 

GGTATGAAAGGTTTGMACATGTGGCACCAGTAGGACATATGGMACTTTTTTGGTGTGA 

TGGAAGTATTTTTTTATCTTGATTGTGTGGTGTTTGTTATACAGTGGTATACATTTGACC 

T 

Sequence 506 

GGGGCGGCCGCCCGTTCAGGTACACGTNTTTNCCMCCAATTTTATANGNATATATATAT 

TCTACTTCCAACACCCNTNTTCATCCTGGTNCAATCAAAGCCTGGTTNTGGCCAAGAANA 

AACTCGTCAGGAGATCGAAGGNTGTAGATGTCTGCACGTGGCTTCCTTGGAGGTCCAGNG 

GNGACTCCCTCTTCCAAAATCCATTCTGTACCCGCTGGCTGCTCTAACGGGCAGGACAAC 

AGCGTATGAAGCCTGACTGCAACTAGGAGAAGTACCACACTCCCGGACGCGTGGGTCGAA 

GCTTGTACACCT 

Sequence 507 

GGCCGAGGNCAAGCTTTTACCCACGCGTTTGAANCCATCTGTTTGGNACCCNGAAAGGGG 

GCAGGAAAGGCTGGGGTCCCAGNCCACCCTAAGGGNATCTGAGTGGCCCAGGGCTNCAAG 

NNNNCCCACCTGNCCAATGGGACCCTTTCTGNCCTCACCCTACAAGGGGCACAAAGGGAA 

GACACCAAACCTGGCAGGAACTTTTCACGCAATCAAGGGAAGGAAAGGCANTCCTGGCAG 

AGGGAACAGCANGCCAAGCGGGAGAAGGCTCAAAGTAAGGAGGGTAAG 

Sequence 508 

CC GCGG TGGCGGCCGAGGTCMGCTTCGACCCACGCGTCCGCTTTTCMTTTTATTGTAT 

AGTTTTTGATMTGTTAATGTCTGAGATCTTTATGGGTGAGTCTGCTGTCATTTCTGCTA 

TTTCTCGTAGTGATTTGCTTGTATGGTTTATGATTTTTTAMAACTGMTGTGTAT^ 

ATTGTGTCTGGTMTTCTTTAGGGACCCATTGTAGATGTATTTCTTCAAAGAGCATTTGT 

GGTTATTATATTTGGGTGCTTGGGGCACTGCCAGTACCTGCCCG 

Sequence 509 

CCGCGGTGGCGGCCGAGGTATTGAACCAGGTCAAAACATTGTTGAATATCAAACCCAATC 

TATTTAATCTGTAAGAAACAAGGACCCTGAGAAAGATTCTGACCAAGGGTATGTGATCGG 

AAACTTGACAGATAAATGTAGTATACTTGTAAAGCCATACTGTGAAAAACTTGGGGATTA 

TTTGAACACAAATTATCACCTGGAAAAAGACAGAAAACAAGGCAGAAGACTGTGCAAAGA 

GGTTGGAATATTCAAAACTTCAGATTAGAAG 

Sequence 510 

CCGCGGTGGCGGCCGAGGTACCCAANNGATATCACTGTGAAGGTTTGGATACACTGACTG 

AGGAAAAAAGAAGGTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGA 

ATCAGAATGGCATTGGACTTCTCAGCTTTCCTCATTAGAAGTTTAAGATCTGAAGCAATC 
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TTTAAACTCGTGAGGAAMTTAAGTCTAATAAATAATTTTCTTCTTAGCCAAACAATCAA 
ATGTGAAGCTAGAATAAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGT 
Sequence 51 1 

CCGCGGTGGCGGCCGCGACCACGCTTGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCG 

TCCGAMTTAATGAMTGTTTTACATTCTTTTAAAAACCTTTGAAATATGGTGTGTATTT 

TATGCTTTAGCAAATCTCAGTTTGGACCATTTCAGGTGGTCAGCAATTACACATGGCTAG 

MCTAAGAGCAATCAGTTTTNTTCCACAGTTTTTCTAAMTTTTCTTGTCAAAAATCTTG 

ATGGTATG MTTAC TCTTTTAAAAAGTGCACTTNACCAGCAACAGAAAANAACCCTGGAG 

GGGTATGGGTTTTAAAGCTGGTACCTNGGCCCGNTCTAGAACTAGGTG 

Sequence 512 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAAAGTGCTGGAATCA 

CAGGAATGAGCCACCACACCCAGCCAAATTGGGCACAAATTTAAAATTTGACTTTTATTA 

ATGATATGGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAGGCA 

AATCATTTAAAAATATCTAAACTATAATTTCCTGTAGTTCACATGAATTGGATATTCTGA 

AGCGGACGCGTGGGTCGACTTGTAACCTGCCCGGGCGGCN 

Sequence 513 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAAAGGACAAATACATCAGTAGAAAAGAAGA 

CAATATMGGGCAGATTGAAATATATACGTGAACGTCACAAAGACCAATTACTGCCATTT 

CAATTCAATGAGGAAATAATGATGTATTTAATAAATAGTGCTAGAATGCTGCATTATCTG 

TCTAGGATGAAAAAAAAAAAAAAAAAAAAGT 

Sequence 514 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCACTATAGGGATCT 
AGATCACGAGCGGCCGGCCGCAC I I I I I I I I I I I I I I 1 1 I I I ATTTCTGCCACCTCTTTC 
ACTTGGGAATCTATTTTCACTGCTCTCCAAAGTTTTGAGAAGGCAATAGTCCTGGAAAAT 
GGGTCCTGAGCTCCTCTCAGCAGTCCTGCTTTCTTTCCACCTGCACTGTAAGGNGACCCT 
AACTGGGTCTAAGACAAAAAGTCGGACGCGTGGGTCGAAGCTTGTACACCTT 
Sequence 515 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTAGTTACCACTTCA 

TTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGACTTGAAGCTGGAAGCAAATAC 

AAGTCTGAGGGGCTCTAAGCTGGGAGGTTCTGGCGTCTCCCTAGCTCTCTATGGCTCTAC 

CTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTCTGACTGGGGATAGGACCACT 

GCTGACAGGGCCCCACCTCAACTTCTTTCATTGCCCTCTTCCAGGAAATCCCACCCTGGG 

ATACTTCAAAGACCTCATATGCTACAAAGATCAAGGCCACCTAATGAGTGCTCTAGAGAT 

CAGCACCAAAGATGCTTGCCAGAGTCTTCTCTA 

Sequence 516 

ACACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCAC 

GCGTCCGGGGAACCCTTGTTCAGGTTCTTTTTAGGCAACCCAAGCCAAGACAACAAAGTA 

AGATAGAGCCCCAAATGTGGTCGTATAAGGTTTTTCAAAGAAAGTAACACTTGAGTTAGG 

TCTTAAAGTTTNACCTAAGAAACTGCCCAGGTGGACAAGAAGAAAGGGTGTTCCAAGTAG 

AAATAATANCATGGACAAAGGCAATGTAGCAGGAAAAGTNTTCGTAAATTCAGGGAATTT 

CAAGTGTTTCACGATGGAAGGAGCAATAGAGTCATTTACTTGCGGTGGCAGGGGATGTTG 

GAAATGTAAACAAGAGTGAGATACAGAAGATTTTATGTGGCATGCCAACTGGGACTTTTT 

TTTGTAAACAA 

Sequence 517 

TCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGAGAAATACCATTTGCACA 

GTCAATCACTTCTGACCAAGCTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAAT 

GTTCTAGGGNGGGNGAGGAAANCTAGGGTGGNTATCTAAATCAACAAATATTCTAGATAT 

TCCAATATCTAAATTATTGTTGGAAATACTCNTCCTGAAGNGNTCATTTGAACNCTAAAG 

CAGGAGNACAGCNTTTGTTGTATCAANATGGGCAGGGGTTTTTAAAGGGTNTCCATTTTT 

TNTTANTTTCCNCATTATTAAATTCCNTTNrTAAATNNTTTTTAGGACCMAAATTTTTCC 

CNTTTCTTNGAGGTNTTTAAAGGGGGATTTAANAAAATGGGNNANNTGGGGGGTTT 
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Sequence 518 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGGTAGAGAAAGATTCAA 

AATGCTGCTGTTCTACCTGAGATGGGAAAAATGAAAGCAAAATAACATCAACAAAAACAA 

ACAAACAGCCTTGTAGTTCCATGTCACTAGCCAGGGATTTAAGACCAGCCTAGAGAACAT 

GGTGAGACCCGTTCTCTACAAAAATAAAAATAAAAATACAAAATAAGCTGGACATGGTGG 

TGTGTACCTGCCCG 

Sequence 519 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGCTTCGACCCACGCGTCCGCTCACT 

TCATCCTCCCAGCAACCTATTATGATCCATTGCCACACCAACTTGCTGATGAGGAAAGTG 

GGGCTTAAGGAMTTMAGAGCTGTTGTGGGACTTCCMAGCAGMGACAGTAGGCTTTC 

AGAAATTTGATAAAMTAGCACTTTGCATTT^GMTCTTGAGCTAAATG 

ACTAAACATTCTNCACTGGTAAAATAGAGAATAAGGATATTAACAGTAAAAGAAAAGAAG 

AAGAAAAGGAAATGTGCTTCCACAGATTTAGAAACATAAGTAACAATCTAAGGTTAAGGC 

TTTTGGCACCTGCCCGGGCGGCCCGCTCTAGAACTAGTGGGAT 

Sequence 520 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACC 

CACGCGTCCGAGCTATGGACCTAAGGCAGCGAGTGGATTCATTAGTCCTCTTTCAGCTGA 

ATGCATGCTACAGTATMGAAAMAGCTGCTGCCTATATGAAGTCTTTGAGAAAGGTTTG 

TTA GCTGC TGTTAATATTTAAATCAGAGGAAACATCAGGAGTCATTCTAGAGAATGGCAA 

GAGTTTTTCTGCAGTTTATATTGTTGACTTTTTATACGATATTGGGGTACCTCGGCCGCT 

CTAGAACTAGT 

Sequence 521 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTCGACCCACGCGTCGGC 

TAGGAACTATGTTAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAAACTTAGAACTTTGTGTGAAATAAGACTGGCCAGCATTAGAGGTGGGTTG 

GCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCG 

TAAGCCAAGGCACCCAGACCAGTTTCCATACATAGAACCTGCCCG 

Sequence 522 

CGACTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I M i I I I I I I I I I I 

TTTTAAGTAAAGAAGGTTTTATAATATAGTGAAAACAATACGGAGATGAAAACCAGGAGA 

CCTGGGTCCCGCCTTTGTTACAAATGCCTTTCCTAAAAGCTCCAGAATGGTGCGAGGTCA 

AAACAGATGGGCAGAAAGGAAGTGGTCATCAGAGCAAGAGAAAGAGCAGGTGCCAGGCAC 

TCACGTGTGCGGTCATATCAGGTAGAGATGATGAGTAGAGATCTGCCCTAGAAGACACTG 

MTTCTGAGATTCAAAGGGGAAAAGTTGATTTTATAGCCAGTGATTTTATAGCCCACTTT 

CCTGCCCCACCCCTACTNTAAGAATTGCGGACGCGTGGGTCGAAGCTTGTACCTGCCCG 

Sequence 523 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCCCAGAATA 

ACTAAATAAATAAAAGGCTAAAGAAAACTGGAACAGTACTGCGTCTCCATCTGAGACGCA 

NTCTTCTACTTCCAGCATCGNAGAGAAGGGCTAGGGACA Ai MUM I CAAAAGATTTAT 

ATACAGGCTTGAATCCAGMATTAAGGNTAAAAGCATAAATATTGATAATTTCAACTAAA 

TTCAGAATGGNTTCAGAAAGATATGATACAACAATTTAGAATAAAACAAAGCAGAAGAGC 

ATNATATTTTGCGGACGCGTGGGTCGAAGACCT 

Sequence 524 

CCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACGCGTCCGCTTAAAAG AI HUM 
TTTATGTAAACTGTTGMTATTTGAMTAGTCCACTTCACCTTAATGGGTCTTGTCTATC 
TTCATTAG TCTTCA AAGAAAMCCATTTGCTACCAAAGTAAATCAGTATTTTGAATGTGC 
TTCTCTTGTTTTTTGTTTATTAGCTAGTT^ 

MTCGTMGGAAGCTGTTTCTTTTAAAACACAAACCACCACCAAAAATTTAAATGTACCT 

GCCCG 

Sequence 525 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTTCAAGACCCGCCTGGC 
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CAACATGGTGAAACCCCATCTCTACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATG 

TGCCTGTAATCCCAGCTACTCAGGAGTCTGAGGGAGGAGAATCACTTGAACCTGGAGGCA 

GAGGTTGCAGTGAGTCGAGGTTGCGCTACTGCACTCCAGCCTGGACAACAGAGGGAGACT 

CTGTCTCAAAAAAAAAAACCTACAGCTGTTCAAGGACCAGCTGACAGGTCAAGTGTGGCC 

TTTTCTGGTCTTTGAACACATCATAGAAAGTGACAAATGCTGCAAAGCCATGAAGAACAT 

GAACTATAAACGGGTAGACTAACTGCCCAGCTTAGACACTTATCTATGCCACAAAACAGC 

TGAATT 

Sequence 526 

CCGCGGTGGCGGCCGAGGTACCAAAACCCGGCTTTTTTCGAAATACCTGCAAAAAAAAGT 

GGATGATTCCAAATCCAACTGAAGTGTCTGCTCTCTCTCCAATTCAGAACAACCAGAAGG 

GCCTGTCTTGAATTAGGTAATGCATTAAAGAAAAGTAGGATTATTATATTCCAATTCTTT 

CCATCAGATGTAAAACATTATTGGTAGATCACATCTGTTTTAATAAATCTGTAAGAAAGA 

CGTGTAATTATAATTATGTTACCATTGTATGTAAATGGCATTTTAACAAGACATATTAAT 

ACATTTTTATAGAGTACCTGCCCG 

Sequence 527 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGT 

CCGGTTAATTTGGGAGAAGGAAGAGAGAGTGACATATTTGGCTACCTTCAGGGAACAAAA 

TCTAACAGCACAGATGGTAGTAGAGGAGATACCAATTTACATATTAAGGAGCTAGAGTTG 

ATGATGGTATGACTCAGCCCTCTGAGATTAAATTCTACTTACTAGGGCTATGAATGGAGA 

TMGTAGGTATCCCACCTTTTATTAGAAGGTTCCTt AAAATAAATATGGGACTCTGGTCA 

GAGAGTAGGGCCATTAATTTGCTCCTGGTTTTTACCTGGCATCCACCCACCAGTACCTGC 

CCGGGGGGCGGNCGGCCCGCCCCGGGCAGGTCCCGCAC I I I I I 1 1 I I I I 1 1 I I I I CCTTT 

Sequence 528 

CCGCGGTGGCGGCCGCCCGGGCAGGTATTGCCCTTTGATGTCCCCATGAGGGCCAGGCCC 
AGGCAGAACCCATCCCATTTTATCCTTAAACTCAGAAGGAAATTTGTCTAAATATTAAAG 
GATTAATATGGGGAATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTGTCTC 
TCnTAGAGTTCAATGAACACTTCAGGAGAGTATTTCCAACAATATTTAGATATTGGAATA 
TCTAAATATTGTTGATTTAGATAACCACCCTAGATTTCTCACCACCCTAGAACATTTAGT 
, GGGGAGACATTCTTTTCTCCTTTTTCTGATAACTTGGTCAGAAGTGATTGACTGTGCAAA 
TGGTATTTCTCAGCTAAAATCTCCCTTATGAACCCTTCTCGAAATCC 
Sequence 529 / 
ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCCATTTCCCTGAAACA 
AGCAGCCAGCAACTATCTCAGAAATGTGTCATTTTTACTGGTTATAATTCTTAAAAAGCT 
TGTTTTCCTAAGATATGAAATGCCTGCCAGTATACAAACTGTTGTAACTACTTCCCTTTT 
TG CTTTTA GCGGGGAAAAAATAGCTTAATGACAGCATAGAATCATGTAGTAAATATAATT 
CATTTTTTGAAAGGTTCAGCTATATCCTCTTCCATTTGTTTATTTTAAATGATCTAATTG 
CAAACATGTCATCACTCCCTTGATGTTTACCTCCTTGTTATGCATTTTTAGCAGGCTTTA 
TTGTCACCTGAGA I I I 1 1 I I I I I CTTTGACAGGCCGGAGTCTAGATGAAGGAAAATGTGT 
TAGAAGCACCTTATCCACAGATGGGG 
Sequence 530 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGTAGCAGAGCAGCTCCCTC 

GCTGCGATCTATTGAAAAGCTTGAGTCGACCCACGCGTCCGCTGCATAAAAGTTATGCAA 

AMGCATTTTATGATATACCAGCAAAAAACATGGAAAATGAAATTTTGAAAAGCAATGCC 

ACTTCAAAGATCCCTCAAGTGCCTAGAGGGAGAAAATGAGTTAATATGCTTTGAAGAACT 

GTATCC AGAAAA TAAAATTACAAAGGAGGAGAGGGATAGGATTCCAGGACAATCTCAAAA 

CTATTGCTTTTTCCTAMTTCATTGCAACCTTAAAATCCTAGCAAGTTCTTTAATGTAAA 

TTAACAAGCTAATTCTAGAATTCATATGCATATTCAAAAGTCGAATAATTGTCAAGGCTA 

TCCTGTAGAATGGGACAGAGAGGATTGAAATTT 

Sequence 531 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATCCCGTCTTAC 
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AGAAGAGAAAACTGAGATTTAGCAACATAAAAGTATTTCCCGTAAGTAAACAGTAGAGCC 

AAGATCTTGACCTACGCCATCTGATACCCTGAGCCCATGCTATAAAAGAGGAGCATTAGA 

MTATTTGAMGATAGAAATGAGAACTAGTCAATATTTATTTTGCTTAGCACTGTATTCA 

GTATTATGGCATCTTAAAGTAGTTAAGACTCAATATTTCATCAAAAAAGTTTAAAATCTA 

ATCAGAGAAT 

Sequence 532 

CGCTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAATAAGCCCACCC 

CACTAGGAACTATGTTAAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTAC 

TGTGACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGG 

TTGGCCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAAC 

CCGTAAGCCAAGGCACCCAGACCAGTTTCCATACATAGAAAGTTACAGCTGCTTTTATAC 

CCCCTTGCCCCGCCAACGTAGT 

Sequence 533 

CTACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCGCACTTTTT 

I 1 1 I I I I I IGTAGAGACAGGATCTCCTTATGTTGCCCAGGCTAGACTTGAACTCCTGGGC 

TCAAGGGATCCTC CTGCCT TGGCCTCCAAAAGTGCTGGGATTATAGGTGTAAACCAGTGT 

GCCTAGCCTACAGTTTTTTMTTTTATAAMTGTTATTTCTMTTTTTCTCCAAAAGTAA 

AAGTGGCATTCCAATGGCAATATTAATTCAGGTATCCAGAACTCTTAACCTAAATTTGGG 

TGAGATGAGGAAAAGTGTATTGTTAATTTTATGTGTCAACTT 

Sequence 534 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACCC 

ACGCGTCCGTAGTAATAGGAATTAACTGACCCCTTTTGGATGGGGGAGAGCATCAGGCTG 

GGGTCAGGTAAGTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCCCAGCCCTG 

ACTTGAAACCATTAGCGCTAACTTGCTCTGTTTTGAGAAAAACTTTCCAAACTTTTGCAT 

GAGAAACTAGAAAAAGGAATGTATGCCACGTAACTGGATTACAGAAATGAGTTAATTGTC 

TCT GTGA TAAAAAAAAAAAAATGAAATATTTTCTTATTGAATTAATATTTTTGTCTTGAA 

GCATTTTCTAGTGATAGAATGTATTTGTCTTTTTTCCTGGTGGTACCT 

Sequence 535 

CCGC GGTG GCGGCCGCCCGGGCAGGTGTCCCCATGAGGGCCACGGCCCAGGCAGAACCCA 
TCCCATTTTATCCTTAAACTCAGAAGGAAATTTGTCTAAATATTAAAGGATTAATATGGG 
AATAAAA A ATG AACCTT AAA 
Sequence 536 

GAANTGGAGCTCCCCGCGGTGGCGGCCGAGGTCCAGTAGATTTGGAGAGTAATACAAATC 

CTTTCTTTCTGGTTAGAACACACTGCCAAAAGCCACCTCTTTCATCTAAGGAAAAGATTA 

AAAATGCATGTTGATATCTCCTAACTATCACACAACTTCCACTATTACAATGAAAAATCT 

G GTCCC CTTTCATTGCCTTTGAAAACCNTTTTGCCGAGGTGGNTTTCAAAAAAACNCGNG 

ANTTTTNAAMANTTG GNTTTGGTTTTACCNGGGGAMGGGGACNTTTNNCNNTTTTTTT 

I 1 1 H I I 1 1 I I I 1 1 I I INAA ANGGNGATTNGGTTNNGGTTNTNCCTGGGGCCAAAATNCC 

NTTT TGNGGAACCTTTTTTGGGGTCCNAAAANNNACAAAANAAAGGGNTTGGGACNATNT 

TTTTGNATNCNCNCNAAAAAAA I I I I I I I I I I I 

Sequence 537 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTTCGACCCACGCGTCCGCTA 

GGAACTATGTTAAAAAAAATTCAAGAAAGAATTTAAGGGAGATTACAGTGTTACTGTGAC 

ACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGGCC 

ATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCGTAA 

GCCAAGGCACCCAGACCAGTTTCCATACATAGAACCTGCCCG 

Sequence 538 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCG 
ACCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGGGAGCA 
CAAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAATTGATTGAGAGAAAAAGAAC 
CCTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAGNGGTGGGAGGGAATGGAAA 
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AGGCCCTGA 
Sequence 539 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGACAAGCTTCGACCCACGCGTCCGCAAGT 

TTTCAAAATGTAAATACTTCCTCTGTTTAACAGTCCTTGGACCATTCTGATCCAGTTCAC 

CAGTAGGTTGGACAGCATATMTTTGCATCA7TTTGTCCCTTGTAAATCAAGATGTTCTG 

CAGATTATTCCTTTAA 

Sequence 540 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCCATTTCCCTGAAAC 

M GCAGC CAGCAACTATCTCAGAAATGTGTCATTTTTACTGGTTATAATTCTTAAAAAGC 

TTGT TTTC CTAAGATATGAAATGCCTGCCAGTATACAAACTGTTGTAACTACTTCCCTTT 

TTG CTTTTA GCGGGGAAAAAATAGCTTAATGACAGCATAGAATCATGTAGTAAATATAAT 

TCATTTTTTGAMGGTTCAGCTATATCCTCTTCCATTTGTTTATTTTAAATGATCTAATT 

GCAAACATGTCATCACTCCCTT 

Sequence 541 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGA 

TTGTTAAAAGAGGAGTCTAGAAAAATTAATCCTGAACCCTAAAGAATAAATCTTAAGTGG 

TGGATACATGGGTTGAATAGTGTGCTCCAAAATTCACATCCACTTGAAACTTCAGAGAGT 

GGCCATATTTGTAAATAAGGTATTTGCGGGTGTAATCAGTTAAGGATCTCAAGATAAATT 

CATCCTGAATTATAAGTTGTCCTTAAATCCAATTACTGGTATCCTTACAAGAAGGTGAGA 

GGAGACAGAATAGAGCCATCTGAAAAGGGTCAGAAA 

Sequence 542 

CTAACACTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGAC 
CCACGCGTCCGCAAA AATC AATCAAGGGTTCCTACTCAAGTAAAAAGCAACTTGTAGGAA 
AATAATAGGGGATATATTTTGCTCATTAAGGATCTTTTTATAGTGGCTCTTGGTGCACTG 
CCTGTGAGTTAGCCCTTATCCTCAAGGAGCAGCTTAAAAAAAAAAAAAAAAAAANGT 
Sequence 543 

CTACTATAGG GCGAT TGGANCCTCCCCGCGGTGGCGGCCGAGGTACTTCCTGGAAATCAA 

TTAACTGAGTCTTTTGAAACCCCTAGAGAAGATAGGAGAAAATTGGTTCAGANCGAGCAT 

TTAMTTAAGTCAGCAAAGTCAGMTTTAAMTTGGGCMTTCCTTGTCTACATTTTCTT 

TACACTCAA 

Sequence 544 

CTNACTATAGGGCGANTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCC 

T TAAC T TAGA TCGTTACTNATGTTTCTAAATCANGTGGAAGCACATTTCCTTTTCTTCTT 

CTTTTCTTTTACTGNNAATATCCTNATTCTNTATTTTACCAGTGGNGAATGTTTAGAATT 

AATTTCCATTTAGCTCMGATTCAAGAAATGCAAAGTGCTATTTTTATCAAATTTCTGAA 

AGCCTACTGTCTTCTGCTTTGGAAGTCCCACAACAGCTCTTTAATTTCCTTAAGCC 

Sequence 545 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCACACAGAAGGAGG 

AGGGAGCTAATCCAGTAACAAACATTCAAAGATTAAATTGTAGATATGCACCTCTGTATT 

TGGCACTGTTGATTAATATTATAACACCTTCCTCTCAAAGACAGGCATTCTTAAGCGTTA 

GTCACMTATACCA GAATT TGCTATTCATATTAAAACCACCTTTTAAACTTTATAACAGT 

AACCAATTATTATAGTTTTAAGAAACAAAACGCAATGAGAACTGGGAATGGAATTCAAAT 

CCTCCAAATTCTTGCTATGCTCCAAGCTGCCATCCATAAAACAGGTTTAATTTGGGTAAT 

TTTTCCATTGTGGGGAAGGGTCAACAAGAAACAATTTAAAGACAATATTTTCCAATACAA 

ATAAAGACATACACTTTTTGTT 

Sequence 546 

TACTTAGGGCGATTGGNANNTCNCCGCGGNGGCGGCCGCCCGGGCAGGTACAAGCTTCGA 

CCCACGCGTCCGAAATAATMAGCTAGAAGTAATATTTTTCCTTTTGTCTATTTTCCAAA 

TTGACTCGATATTGATGGCTACTTTTGTAAGTTTTTATTTAAGNTTAAAGGGAATATTTA 

TTGATCACCTCTATGTGCTCANTACCT 

Sequence 547 
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TACTATAGGGCNATT6GAGCTCCCCGCGGTGGCGGCCGCAC I I 1 1 1 I 1 I I 1 1 1 I I 1 I 1 1 I 

TTCCAGTGCTAAAACATCAGATMGAGCCTACCTGACATTTTGGAGAATTTGCTGNGCTG 

GGATTGATATTCCGCATTGCCTAAGAGTAAAAAATAAGACGGACGCGTGGGTCGAAGCTT 

GACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTACCACAGGAGGCAGAAGGAAATCCTCA 

ACCTTCCGAAGAAGGCGTAAGCCAGGAAGCAGAAGGAAACCCCAGAGGAGGGCCGAATCA 

GCCTGGCCAGGGATTTAAAGAGGACACACCCGTTAGGCATTTGGACCCTGAAGAAATGAT 

AAGAGGAGTAGATGAGCTTGA 

Sequence 548 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNAGGGTCGCGGTGGGTCGA 

CTNANGCTAGAGAATTGTAATACGACTACTATAGGGATCTAGATCACGAGC 

Sequence 549 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGG I I I I I I 1 I 1 I I I I 1 1 I I I 

TACCAGMCATCACATMGTTTATTTCAGATGTAACAGCAATGTTAAAATTGACAAGTTT 

AATTCTTAACTGCACCMGTAAACTTAGCCATTTAAGTATTTTTTTMGTTATTCCCTCC 

AAAAMCTGAGGGAGCTTTTCTTTTCCACCACCACACCATGGTTTCCCMTAGTTCTCTT 

TTTGGAGGACTTTTCMTTGATGAGTAMCTGCTTTAGATATTTCAGAACTTCATTCCCC 

AAATGMAGCTMTCTGGACAMCTATATATTGCATAGATTTCTCTACAGATTCTTTGCT 

TTAAAA 

Sequence 550 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGATGAGGCGGGGA 
GGTGGGACCCCCAAACATATATCAGCCCMCAGCCCTAAGTCTCCTTCTTTATTATTAGG 
AAAACAACAACAACAACAAACAAAAAAATGGCGTCATGAATATGAACAGCATTGTCAGAT 
GAATTAGTTGAAGTGGN I I I I I I I I G I i I I I I I I I I I I I I I 1GAACCTGCCCG 
Sequence 551 

CTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGTGCTTCGCCCCAC 

GCGTCCGNAATAATTGGAAANGGCCTATAGATTAAAAAGCTGAGAAAGTATATGGTAGGG 

AGCACACTCCCCACAAGTATGAACTCTGNGATTACGACATCTCATAAATNCATGAGCACT 

CATGTTGGCTTGCTTTGTAGCTATGAACTTACCCTGTATTATTGAAACGTCAGCATAATG 

ACTGGMGGAGAAATTGGT CCATT TTAGAGCATTACTATTATGCTATCTGTCCATTTAAA 

TTAATMTTGCATTAMTTCATTTTAGMGGNGCTATTACATTNGTAGTAAGAAAGTAAA 

TTCATATATAAATATTTGATTATCAGATGGTTTACTTACAGATACTTATTTTCCTGTAAA 

ATAGGAGAGTTTACCTGAAGAAAAATAAMCTTTTNACTTTTCTGGGAAAAAA 

Sequence 552 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCCCGGGCAGGTACCAAGTGAA 

TTTAAATAATTGGTGTGGATTGGCCAGTAGCTMGMGTGGGCTTTTAAAGAGTNTTGAA 

NATNGAANGGG I I I I I NTTTCTTTTTTAAAAAAGAAAAACAAACTATTG ATTGTCTATAA 

TGA AAAGC TAGGNNTTGCCCTNTTCATGTNTACTCTCCTTCCAAATAGTTATATCCAAAA 

CTGTTTTTCCCTCTCCCCTACCTTGTCCCCCCTATTAAMTANAAACNGGGAfTGATTAA 

TGTCCCGCTCCTGAATACATGTAAAATTTGTACCTCGGCCGNTCTAAAACTAG 

Sequence 553 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC . 

CGCTTGTCAATAAAAAACAAACAATTGCTAAATAAAACAACCTGAGAAAATCTCCAGAGA 

ACTATACTGAGTGAAGGAAGAAAAATCCCCAAAGATTACACACTGTATGTCATTTATATA 

ACATTCTTGAAATGACACAATCACAGAAATAGAGAATACTGGTCACTANTGCATTAAGGA 

AGGTGTGGAAGGATGTAGTGATGGGAGGAAATGTGTATGGCTGTAACAGGGCAACAGAGG 

CNTCATTGTGATGATGGAAGTGTTCTGTNTCTTGGGI I I I I I GAATGTCA 

Sequence 554 

CACTACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAG 
AACATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGNCCAACCTACTGGTG 
AACTGGATCAGAATGGTCCAAGGACTGTTAMCAGAGGAAGTNTTTACATTTTGAAAAC 
Sequence 555 
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CTACTTAGGGCGAATTGNANCTCCCCGCGGGGGCGGCCGCTAAAGGAATAATCTGCAGAA 

CATCTTGATTTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGAA 

CTGGATCAGAATGGTCCMGGACTGTTAMCAGAGGAAGTNTTTACATTTTGAAAACTTG 

CGGACGCGTGGGTCGAAGCTTGTACACCTT 

Sequence 556 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I I 1 1 I 1 1 I I I 1 1 1 I I G 
TTTTGGTTTGTTTTTTGGGCTCT 

GAGGCAGGCCTCTNCCTTTCCTACCCTCCCACCCGAAGGTGAGTCACTAGAGTGTTACAA 
GCTTACTGGCAGGGAACACTACAAATTAACAAACAGTAAGTTACCAAAAGGGGATACACC 
CTTATAAAAGGGTACCTN 
Sequence 557 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCACGCGT 

CGGCTTTCAATAGATCGCAGCGAGGGAGCTGCTCTGCTACGTCACAATCTTTCAAAAAAA 

TGMCATGTAAGAAAMGCAGTTTTCATTGTGCTAATTATTGCAGGCCTTCATGCACGTA 

MCCTCAACAAAATGTGTGCCAACAATATACAAATTTCCATATAAACAAAGTCATTGATC 

ACTAACAAAATATAAACATGGNTTCTTTTATATTAG A I I I I I II I AAAAAA AAG CT ATTT 

ACCAGCAAGAAAAAACAAGTACCTGCCCG 

Sequence 558 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGC 

ATTTATTMGGCTTGTATATGTTCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATT 

GATCTTAACCACAAGGCTGAGAAGTTATGTGCAGGGCTTATGATGCTACTTCCAAAGTAT 

TAAATCCTCCAGAGAAGCCTGTAGTGTGGGATGCMACTATTTTAAGTGTGACCATGAGG 

TGTTTTTTTGTGGACCATTTTAAANCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGT 

TAGG 

Sequence 559 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGGCTTCGACCCACGCGT 

CCGCTCCAGGAGACTTCTGCTTACCTCTCAGTGATCAAAAACCGTTTCACCACAGTTACT 

TACCACGTCCTACCGATCCGCATTCTCGCAAGTGTCCTTCACTCCATTTACTCTACTGCA 

TTTTTCACTGTATTTCTCATGCCAAAACTTGGGCTTCTCCACCAGTCTGCACACGTTCAT 

GCTCTCAATTCTCACAGCCATCTAT7TCATTCTCCACTAAACTG7TAGAGGGATTTCTGN 

AGAAATTAAAGAAATTCCTATCACTCCTAAAAAAAAAAAAAAAAAGTGCGGCCGCTCTAG 

AACTAG 

Sequence 560 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACG 

CGTCCGCAAGTTTTCAAAATGTAMTACTTCCTCTGTTTAACAGTCCTTGGACCATTCTG 

ATCCAGTTCACCAGTAGGTTGGACAGCATATMTTTGCATCATTTTGTCCCTTGTAAATC 

AAGATGTTCTGCAGATTATTCCTTTAA 

Sequence 561 

CTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGCTTTATTTATTTCTT 

TTAGGAATTGCAGGTTCCTAACAAAGTAGGGGTGAGGGGGGTGTTACAAACCAGTCACTA 

GGCAGGAACATTAGACTCCAAMGCAGAGAMTGCTTAATTTTTCTTCTACCTGTTTCAC 

CACATTCATGTANAACTGTAGTAAAAAAGATGGNGAATCAGGCTGAATCAATCTAAATAA 

CAACTTAAGGCTCCCAAATCACATGAACCTAGGACCACTAAATCCAATGTCAGACGTGTT 

TAMTGGNGCACTGCTCTACATTTTTCTATTATGCAAAGAGCTAGAAAATAATGGTAGTG 

TCATTATGACATTCCATGAAAATGAAGAAAATCTTTCANGAAAAATTTAGAAAATAAAAA 

TGTTTACT 

Sequence 562 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCTTAACT 
TAGATTGTTACTTATGTTTCTAAATCTGNGGMGCACATTTCCTTTTCTTCTTCTTT^ 
TTTACTGNTAATATCCTTATTCTCTATTTTACCAGTGGAGMTGTTTAGTATTAAT7TCC 
ATTTAGCTCAAGATTCAAGAAATGCAAAGTGCTATTTTTATCAAATTTCTGAAAGCCTAC 
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TGTCTTCTGCTTTGGMGTCCCACAACAGCTCTTTMTTTCCTTMGCCCCACTTTCCTC 
ATCAGCAAGTTGGTGTGGCAATGGATCATAATAGGTTGCTGGGAGGATGAAGTGAGCGGA 
CGCGTGGGTCGAAGCTTGTACCT 
Sequence 563 

CTACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCAGTAATCACATA 
AATTCTG C AATC ATCTGTTT ATTTAG CTT AACTGTTTTTTTTTTT ATTTGTTG AAGTTGT 
TGTTGTTATTNCAGTCTTTTTCTTATTGGGTTGACCAGACTTGGTAAAATCTGTAAGAAA 
GTTCCATAAT 
Sequence 564 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTCTTCAGATTCAAAT 
GCCMCACTMTTTGMCTTCTTTGGGTCTATGACAGTTTGCAAGCCATACAAACCCAAA 
GAGCTMTCTGTGATTTCTTAACTTGAGAAAATAATAATNATAACCACCACTGGAACCTA 
CATAGGTTTGTNGNTTATTTAACATGACTTMCCT^ 

AAAAAAA AAAGT GACCTGCCCGGGGCGGCCGGACGGCCGGGCAGGTCGCNCTCAAATTNT 
TNMTTTNTTTTGGAAAGACANGNATTTTTTATTTTTGCCAANGCTM 
GGCCTTTAAAGGGGATTCCNTNCTGGCCTTTGGGCCNCCAAAAAGTGCTTGGGATTNTTN 
GGGTNNAAACCCCGGNGGGGCCCTAGCCTACCAG 1 I I I 1 I I AAA, 
Sequence 565 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCC 

GTGCAMTTTGACTTTGTAAATGGCCCTTGGGCTCTGGGAGGAAAGCAACTGTTGGGCCA 

TGTGGTTGTATCTTTAGTTTTGTAMGAATTGCCAAACTGTTTTATAATGTGGGTATATC 

TTTCCACACTTCCAGCACAATGTATGAGTGATCCAGTTTCTTAGCACCATAGTCAGAATT 

TACTGTTGCTACTATT7TTTAGCTATCCTGATAGATGTGTAGTGATATTTTATTCTGGTT 

TTGMGCAGTGTCATTGTCTGGGGTAMTCCTTGAGGTTTGTTGTCTCAGTCAAGGGGAA 

TCAAGGGACATGGACACACAAGTAGTGAATTTAAGAGTGGAAGTTTAATAGGTGA 

Sequence 566 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGAC 

CCACGCGTCCGTCTTATTTTTTACTCTTAGGCAATGCGGAATATCAATCCCAGCACAGCA 

AATTCTCCAAAATGTCAGGTAGGCTCTTATCTGATGTTTTAGCACTGGAAAAAAAAAAAA 

AAAAAAAGT 

Sequence 567 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGGCAGGTCAAGCTTCGAC 

CCACGCGTCCGCATATTTTCATTACTTGACAATGTGGGATGGGTGCMTTTATTCCATCT 

T CACTA AAATAGAAGCAATTCCATAGGTACCATAAACCTATTTTAGGTACCACAAGGTGT 

CTTTTTACACAGCTCATTTGAATACAGGTGTTCTGAGAAGGGGTTTCTATTTTMMTTA 

CCATATCAAAATAAATGTGCCTTATTTTTTTATAAGNCTTGTTAAATCAGTGTCCATATT 

ACTGTTTGGGGAAGG 

Sequence 568 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCAGGTACAAATAAGCCCACCCCACT 

AGGMCTATGTTAAAAAMAATTCMGAAAGAATTTMGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAMCTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGG 

CCATCAGAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAACCCGT 

MGCCMGGCACCCAGACCAGTTTCCATACATAGAMGTTACAGCTGCTTTTATACCCCC 

TTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAGGAAAG 

ACC AGTN N AG AG AAAAAAAG G CC ATCTAT ACCAATTCTAAG 

Sequence 569 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGAGCGGCCGNC 

CGGGCAGGTACACAAACCAGATGTATGCANTGATGCCAAAAGTCATCTNAAAATTCCAAG 

CTGACCTAGTGCAACACATTTACACTTGGATAAACTATCACCTTGAAGATTTACTTTGTA 

TACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAAAGNGCCGAATTGAAATC 

CAGAATACAATAGCTNTGAAGGGCCGCTAGAGTGCAGATAACCAACCCATTCTACCTAAC 
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CTCATGGGCACACTTAAMTAGTTTGCATCCCACACTACAGGCTTCTCTGGAGGGATTTA 
ATACTTTGGG 
Sequence 570 

CTACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTTATTTTCCCTCAGTAAC 

ATGTMTTGCTACATTTTTTATAAGAAGGTATGGTTAGAAAAAAATGTGAAAGATCACTT 

AAACCAAAGCCAGTTACAAGGAGTAATCTCTCCTGTTGGTTTACCTTCACCTCANAACTA 

CMGMTATTACMTACATAGTGAATAGTTGTCTGTAACATTTCTACCAGTTGTTTCANT 

AGCATATTGGTCTTGGCATTTCTTGGCACTGTGGTTCTGCTGTATTATTTGTGATGTCTT 

ATTGTTTGTGAGCTTTTG I I I I I I I I I AAAGAAAAAACAAAAACTAAGTG 

Sequence 571 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAGTAATCACATA 

AATTCTGCAATCATCTGTTTATTTAGCTTAACTGNTTTTTTTTATTTGN^ 

TGTGTTATTTCAGNCTTTTTCTTATTGGGTTGACCAGACTTGGTAAAATCTGTAAGAAAG 

NTCCATMTTATGGGGAAGATTTCCTCTGAATTGGCTAAATTCCTGTAGCTGAAAAAAAA 

AAAAAAAAAAAAGT 

Sequence 572 

TTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACGCGT 

CCGCMGTTTTCAMATGTAAATACTTCCTCTGTTTAACAGTCCTTGGACCATTCTGATC 

CAGTTCACCAGTAGGTTGGACAGCATATAATTTGCATCATTTTGTCCCTTGTAAATCAAG 

ATGTTCTGCAGATTATTCCTTTAA 

Sequence 573 

TACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTACGGC 
TTCTG ACAG G CCTC AG AAG AAC ATTCCCTACCCCAAA ATTATAAA AAT AATCTTGTATAT 
ATTCTTCTCMAACTTTATACTTTTTTTAAGG 

TTTTTAAATACTGAGTGAGTCACTTTTCTCCCGGACGCGTGGGTCGAAGACCT 
Sequence 574 

GAATTGGAGCTCCCCGNGGTGGGGCCGCTCGAGGCCGCTCTGACCTCTTTAAGGNANACT 

TATTATGCTMTAGTTGATGCGCCCTTTTGCTGNANCAGTTACCATAGGTTACATGATAA 

NTATATATTGTTGNGATGATCTGATTNCCTGNNTNCCCCACCCNTGCAAAACAACAACAA 

AAACCTTTACCAGGCTNTATAACANGGGGACCAAACTTGNTTTTGCTCATCATTGCCGGA 

CG 

Sequence 575 

AGGTA CTTACCCCAGACAACGACGCCGCTTCACCATGATGATGGACAACAGGCAACTTTT 

TTTTTGGAGTTTCA GCTTG CTTCCAACAGGGACGGTGAGTGTGAGGTTTATTCCCATTTC 

TAAGACGATAGAAGTTTTCAGCCTAAGCCGTATTCCTAGGTAAGCAGCTGGATTGCAAGT 

TTTGTCTTGGAAATTCTCCTTAATGACTAAAAGTTAAAGAATTAAACAACCTAGCTGGGC 

TTAAATTTCTTNCTTACCCATTAGAAGTACCCTTGCCC 

Sequence 576 

TAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGNGGTCTAAGAGACAGGGTCTCACTACA 

TTGCCTAGATTGGNCTCACACTCCTGGGCTCAAGCAATCTTCCTCTCTTGGCCTCCCAAA 

GTGTTGGGATTGCAGGTGTGCGCCACTAGGCCCAGCTTGAAAAATTTTTAATGCATGTGG 

TAATCCACAGGAGATCACATTTAGTATATGACCAAGTTAATTAAGAAGNCAAAAAACACG 

TTMATTTAAGCAGAATMGGCTGGGTTCGGTGGCTCATGTTAGTTTTTATCCTTAAAAT 

TGTCTGAGTTCTTAGAACACAGAAAAMCAAATTTGAATGCATTTCTAACAGCTTAATAA 

TTTATATGTCCCATTATG 

Sequence 577 

GGGG NGGCGGCCGCCCGGGCAGGTCAATGCTTCGACCCACGCGTCCGATTTCAGGTTGAC 
TTTTCTCACCTTTAACCTCTTTATATAGCACAGTGCAATCTGGCCCTACTGCCACTTCAT 
CTGGGTTATCTGTAGCTTGAGTTGTAAAAAAAAAAAAAAAAAAAGTGCGGCCGACCT 
Sequence 578 
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AGGTGTCGACTCMGCTTTCAGATATAGGCATTCCAGAATCTTCTCTTTACGAGTTCACC 
TGCTAGTATAATCTCCACMCTTGAATGGCATTGGTTGTTCTGTAATTCCTGCCAAAAGC 
ATCACAAGTTGTACCTGCCCG 
Sequence 579 

CCGGGCAGGTTACAAGTCGACCCACGCGTCCGCTTCAGAATATCCAATTCATGTGAACTA 
CAGGAAATTATAGTTTAGATATTTTTAAATGATTTGCCTGTCACCGTATAACACAAGGGT 
GTCATGACCMGCTAGATCTCTTTACCATATCATTAATAAMGTCAMTTTTAAATTTGT 
GCCCAATTTGGCTGGGTGTGGTGGCTCATTCCTGTGATTCCAGCACTTTGGGAGACCT 
Sequence 580 

GTGAACCTGCCCGGGCGGCCGGCCGCAC I I I I I I I I I I I I I I II I I I I CCTTTTGAGACA 

CAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCAATCCTCCCGCTTT 

CAGCCTCCCAAAGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGGCCCTTTTCTTTTT 

CATAGGAGAAGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAAGAATGTGAACTCA 

AGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACTGTTTCTTAGCCTT 

GTCCTTTCTCTCCTGCCAGTTCTAGGGGACACTGCTTCTCCTGGTTGACCTC 

Sequence 581 

CCGGGCAGGTACCCTAAMCTTAAAGTATMTAAAAAAAAGATTATTTCCTATACTTCAA 
MTCAACAAGATTTGATTGCATTTAAATTTTTCTGT 

TTAAAAATAATTGGCATAAAAATTCAATCAATTCATAAMGTCCAAAGCAAAAAAAACAA 
ATCTACTGACATCTCTTGAGGAAGAAATGATCAGGiATTGACATTAATGAACCCTCTCACA 
GAGACCACTACACACACACACACAAAAAGAAGGATGGGTGAATGGATGCAGAGAGAATTT 
MTAAGACTGAAATGATGCCATACATGCTTTTAMAMTAMMGTATTAMTTTTAA 
Sequence 582 

CCGGGCAGGTACTTATACTAGMGATGCTCCAAGGTTTCAGAAAGGAATTMTTACTTrC 

MTTTGCACMTTTAGMCAMTATCTGGCTTTTCCCTMGCTTAATGATm 

ACACAACTMAATATAATAGCATTATTTTATAATCAAGTTTAACTGATGGTCTATGATAG 

TAGAGCGATTTAGTATTTTGACAAAAATCTTATGAGACATGAAGTCATTCAATTTGCCGG 

ACGCGTGGGTCGACTCAAGCTAGACCT 

Sequence 583 

AGGTCGCCATCACACCCGGCTAAI I I I I I I I GTATTTTTAGTAGAGACAGGGTTTCACCA 

TGCTAGCCAGGATGGTCTCAATCTCCTGATCTCGTGATCCGCCCACCTCAGCCTTCCAAA 

GTGCTGGGA TTAAA GGCGTGAGCCACCACGCCTGGCCAGGAGATTCTTAATTATTTCTGA 

ACTCTATCAGTTTTGTATTAGGACATCTTATTTMTATTATCAAAAGATAGTTCCTCTTA 

GAGGCATAAATCAGTCAATCAACAAACAATAGGCAATCACGGACGCGTGGGTCGAAGACC 

TGCCCG 

Sequence 584 

AGGTGTACAAGCTTCGACCCACGCGTCCGCATTTTTCTGGTGTTCCCTCTTACGTGCACA 

CCCCTTGCTCCCCTTTGGGTTGACTTATAATCTGACTTTTGTGACAGATGTTAGGAGGTG 

GAGCAAAGGAATTTCAGACCAATCAGTTAAGAGACTGCTGTGGGGTAAGAAAAAAAATTA 

GCCTCTTAAAATTACTCTTATCAAAGGAAAAAAAGTTGGAAGCACATGATAGTATAACCA 

GAAACATGACACAGAAGAATTAAGGGAAGAACCTGCCCG 

Sequence 585 

CCGGGCAGGTGGAMGGTGGGTGGGGAGAGGGAGGCTTATTTGTTGCTGCAGTGTAACTA 

AGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTTAGATCTAGGTGAGTTC 

TACTTTAGAGGAMTCCTGGTMCTGTTGTTTGTTTGTMGTTATAGCTGTMTTMTTT 

TCCCTGTATTCAAAGCCCCCMACCCTGCATTCAGATACTATGCATTTAGACTTCCTTAG 

GCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAAGGAGTATCTACCATCA 

TAAAGGTGGTTTAGTCATATAGATAATATCAATCAATMT 

Sequence 586 

CCGGGCAGGTCTTCGACCCACGCGTCCGTGATTGCCTATTGTTTGTTGATTGACTGATTT 
ATGCCTCTAAGAGGAACTATCTTTTGATAATATTAAATAAGATGTCCTAATACAAAACTG 
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ATAGAGTTCAGAAATAATTAAGAATCTCCTGGCCAGGCGTGGTGGCTCACGCCTTTAATC 
CCAGCACTTTGGAAGGCTGAGGTGGGCGGATCACGAGATCAGGAGATTGAGACCATCCTG 
GCTAGCATGGTGAAACCCTGTCTCTACTAAAAATACAAAAAAAATTAGCCGGGTGTGATG 
GCGACCT 

Sequence 587 

AGG TACAT TGTTAGACAAGTGTTTATCACTAATCTGGAATACATCATCTTCAATAAGGCT 

CTTGTTTTCCTCCAAGCTGCACTGCTCACACTGCTCAGTTTTCTGTTAAGCAACCTGCTC 

ATTATAGTAGAGCACCMGGTGATCTGTTCTTCTGTTCTTCAGAAGTTCACTATTTCTTG 

TTGCMCAGGGCTACATGATTTTAAGATTCCTCAAAGTCAATACGAATTAACATTATTTT 

CCATTTCCATTCTGTATATCTTCACATTCCATAAATATAATACTCATGTATACGTTAAAT 

TTCCTTATAAGTTCAACACATTGAAAGCTAAAATAAAGACTTCCTACTAG 

Sequence 588 

CCGGGCAGGTACACAAACCAGATGTATGCAATGATGCCAAAAGTCATCTCAAAATTCCAA 

GCTGACCTAGTGCMCACATTTACACTTGGATAAACTATCACCTTGAAGATTTACTTTGT 

AfACTTCAAAAGATCTGGTCATGAAATTTTTAGCTAATACATAAAGTGCCGAATTGAAAT 

CCAGAATACAATAGCTATGAAGGGCCGCTAGAGTGCAGATAACCAACCCATTCTACCTAA 

CTCAGGTTGAGATTGCTTTAGAACCTATCATTGGCTTTAAAATGGTCCACAAAAAAACAC 

CTCATGGTCACACTTAAAATAGTTTGCATCCCACACTA 

Sequence 589 

CCGGGCAGGTGACTTGGCTGTGAAAAGTGCTAAAACAGATAAAAGACTATACTGACAGGC 

AAATGGAGCCTGTTATGACACTGACATTGAAGGTGAAAGGAGAATCCAGTTCACATTAGC 

CAGGGTCTCAGGGACCAGGTTTTGAGGCAGTATTTCTGCCTCTTGAGGACAGGGCAGAGC 

AGGTGGGTAAAAAGCAAAGAGACCAGGGAAGGGGGACTAAAAGTAAGGGAAACAGCATCT 

GAGGAAAGGCTCCTCTGACTGGATTTTCACAAACATTATTTATTAACCTCACTAAACAAG 

GATAATGGGACAAAACAGGGGCAAGCTGGAAAACCAGCAGGGGTATTTGGCAG 
Sequence 590 

CCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAA 

GGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCC 

GTGTTGGTCACTGNCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGTTGGAGCAA 

GTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTG 

GTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGC 

ACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGAC 
Sequence 591 

CCGGGCAGGTGGAAAGGTGGGTGGGGAGAGGGAGGCTTATTTGTTGCTGCAGTGTAACTA 

AGTGAAACCTAATTCATATGACTCAAACTAAGGTATATTTGGTTAGATCTAGGTGAGTTC 

TACTTTAGAGGAMTCCTGGTAACTGTTGTTTGTTTGTAAGTTATAGCTGTAATTAATTT 

TCCCTGTATTCAAAGCCCCCAAACCCTGCATTCAGATACTATGCATTTAGACTTCCTTAG 

GCAAAGTCAAGGCAACAAGCTGATGATTCTAAGCTATTATTCAAGGAGTATCTACCATCA 

TAAAGGTGGTTTAG 

Sequence 592 

AGGTCAAGCTTCGACCCACGCGTCCGCAGCCTGGGTGATAGTGAGATCCTGTCTTAAAAT 

GMGAAAGAAAGAAAAAAAGAATGAGAAGGAAGGATA7TAATTGAAGTAAGAGCACATTT 

GATTACAAAATAGAAGAGGAGTAAGTGAGAACTAAACGGGGAATACAGATAGCAGAGATT 

AAATAGGCTATAAGAAAAAAAAGGGATGATAATAAGACCATGGTAGTACCTGCCCG 
Sequence 593 

AGGGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGTTTA 

CAATATGGTGACAACAGATAACTGTAAAAACTTCTTTTTCAAATAGAACCAGCAGGAGCA 

TGCATGGAACACATATACCAAACATCTTTCTGATAACATTAMCATTTTTAAAAGATGTT 

AAATGTTCTTTTCATTGNGGTGCTTCAGATTCCTGATTCTAGAACTTGTGTGTGTGGAAC 

CTGTGTGCTAACTATTCTGTTGGAATTTACCAGCAAAGAATTATCTAAGAATTTTCAAAC 

TAAATGATGGGGGAAGGAACTAACATTTTTGCAGNCCCTGGAAATGTAAAATGTTGTACC 
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Sequence 594 - 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATAAATAGC 

ATTCCCACGGTGACCACAAGTCTTGGAATCAGTTCCAGGTGTCGTCGTGGCCGTTGACAC 

CGCTGCCTTCTGACGGTAMTGTATTGTAGAATTCATGTTGTTATCAGGCTTCAGTTTCC 

TCATCTCTAMATGAGAGGATTGGATAAGTTAGTAGTTTCTAATTT7TACTTTTAATCAG 

TGGCATCTCCCATTTATTTTTCATTTGAAATAAAACTTTTGAATTT^ 

ATMCATATTTTGTTTTATGTTTCAAGATGAAGCTCACACTGAGTTGGAAAAAAGGAAAA 

AGCAAAGGATCAAAGCTG 

Sequence 595 

CTATAGGGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCAAAGCCTT 
AACTTAGATTGTTACTTATGTTTCT^ 

TTCTTTTACTGTTMTATCCTTATTCTCTATTTTACCAGTGGAGAATGTTTAGTAT^ 
TTCCATTTAGCTCAAGATTCAAGAAATGCANAGTGCTATTTTTATCAMTTTCTGAAAGC 
CTACTGTCTTCTGCTTTGGAAGTCCCACMCAGCTCTTTAATTTCCTTMGCCCCACTTT 
CCTCATCAGCAAGTTGGTGTGGCANTGGATCATANTAGGTTGCTGGGAGGATGAAGTGA 
Sequence 596 

ATGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGA 
TTATTCTCTCCATTTAGGCTATAAATCTTTCAGTGTAGGGTGTTTCTAATGTCATATTCT 
TCCAAAAAAAAAAAAAAAAAAAGT 
Sequence 597 

CCGGGCAGGTCTTCGACCCACGCGTCCGGCTCTCCAGTTTATACATGAAGAAACTTTCCG 

MAGTCTTGCAGCTTGTGGAGAGCAGAGCTGGAGAGCAGGCTAGTCTGATTTTAGAAGGG 

AGTTAACCATTACATAACCTGCAGGTGGCTTCTCCCCATACCTGCCGTGGGATAATATGG 

CTCAC TTTTTACTTCATTTACAATATTTAATAAGTGCGATTTTAGACTTGAGAAGAGAAT 

ATTTTCTGCTAAAATTATCCCCACTAGAGATAATCACCAGTGAATTAATACACTGCAGCA 

ACGGAACCAGTCAGCTTTTTTGGTAATCATTCCCTTCCT 

Sequence 598 

CCGGGCAGGTCTTCNACCCACGCGTCCGGCTCTCCAGTTTATACATGAAGAAACTTTCCG 

MAGTCTTGCAGCTTGTGGAGAGCAGAGCTGGAGAGCAGGCTAGTCTGATTTTAGAAGGG 

AGTTAACCATTACATAACCTGCAGGTGGCTTCTCCCCATACCTGCCGTAGGGATAATATG 

GCTCACTTTTTACTNCATTTACAATATTCMTMAGTGCGATTTTAGACTTGAGANGAGA 

ATATTTTCTNCTNAAATTTATCCCCNCTAGAGATAATNNACCAGTGAATTNATACACTGC 

ACCNNACGGAAACCAGTCA 

Sequence 599 

AGGTGCTTCGACCCACGCGTCCGGTATTTCTCTTAAAGTTMTT7TGATAGATATTTATC 
TAGATGCTTTCTTTTTTCCCTTGCCATAATAGCTGGCTTGTAGAGAGAGTTATGTTTGAA 
AAGGCTTGCC I I I I I ICCGTCGCTCTG 
Sequence 600 

AGGTCAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGTCAAAGCTGAGGAGTTT 

TGTGGTGCTTTCTCAGGAACCTTCTGTCACGGAAACCATTGCACCCAAAATTGCAAGACC 

TTTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTTGAGTATCTGGAGGAGGATGCCCA 

GAAGTCCNCACAGGAGGGGGTGCTTGGGACCACACACTGATGCTCTTGNCATTCAGACTC 

TGAGAAACATGCCGCGTGATGAANAAACCATCCCAATTANANGAAGCTAGCTGNNTTNCA 

TTGNAGCAGCTTTACCCCAATTT 

Sequence 601 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGTGATAACTTCTCCTAAGTGCCAGGCATT 

GTATTACATGCTGGGAGCACAAAGATGAATAATAACAATAGGTTCACAGAAAAGATGAAT 

TGATTGAGAGAAAAAGAACCCTCCAGGAGCCCTCAGCGTAGTAGGGGGTTGGTGTTGGAG 

GGTGGAGGAATGGAAAAGGNCCTGAAATGCANGCAGAGAAATGATGAAACAATTCCNGGG 

GCTGCGGNGAGGTTANATGAATATCTTTACAGCAGCCTNGAAGACTGATCANGTTACTAT 



# 
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ACCCTCTCTTCTGTCCACQTGCATTTNA 
Sequence 602 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCATTTATTAAGGCTTGTATATGT 

TCAAGATCCAGTGAAGACTGTCTTGGGCGTGTATAATTGATCTTAACCACAAGGCTGAGA 

AGTTATGTGCAGGGCTTATGATGCTAGTTCCAAAGTATTAAATCCTCCAGAGAAGCCTGT 

AGTGTGGGATGCAAACTATTTTAAGTGTGACCATGAGGTG I I U 11 I GTGGACCATTTTA 

AAGCCAATGATAGGTTCTAAAGCAATCTCAACCTGAGTTAGGTAGAATGGGTTGGTTATC 

TGCACTCTAGCGGCCCTTCATAGCTATTGTATTCTGGATTTCAATTCGGCACTTTATGTA 

T 

Sequence 603 

ATTGGAGCCTCCCCGCGGTGGCGGCCGAGGTAGCTTGAGTCGACCCACGCGTCCGTTCAG 

ATCCGTTTCAGAAACGTGAGTCTCTAGCTCAGGAGATTTCCACAACTGTCCTTAGTAACC 

TGATQTTATTCTCATGTTTAACCTTGGCAGTGGGAAGTTCTTCCTGGTATCCTGCCTAAT 

TTACTGGAGTTGGCATTAATGCCATTTCCCCCTAAGGCGTGGCTCTTGGACCAGTATCAC 

CTGAGAATTTGATAGACATAGACCCAGAGTTACTGAGGGCAGGTGCTCTGTTTTGGGGAC 

CAGCMTCGGTGCTTTAGCAAGTNCTTTGGGTGATAGGGGTTNTGGAAACTACTGCTCTA 

AAGCATNATCTGTTTTTGAC 

Sequence 604 

GGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAGGTC 

CTGAMGCGTNTAGACAAAAAAAGACTACTTGTAAGTTGCAAGAATCAGAATGGCATTGG 

ACTTCTCAGCTTTCCTCATTAGAAGTTTAAGATCTGAAGCAATCTTTAAACTCGTGAGGA 

MATTMGTCTAATAMTANTTTTCTTCTTAGCCCAAACAATCAAATGTGAAGCTAGAAT 

AAG C ATTTT C AG GT A AA AAA A AA AA A A A AA AAGTG CG G C 

Sequence 605 

CCGGGCAGGTACANNTTGTGATGATTTTGGCAGCAATTACAGAACAACCAAGGCCATTCA 

AGTTGTGGAGATTATACTAGCAGGTGAACTCGTAAAGAGAAGATTCTGGAATGCCTATAT 

CTGAAATCAGAATCCTAGTAGTTTGTAGTTTGCCTCTTCCTAGAAGTTCAAGAGACTCAA 

GTCATAGGCTACAGATGTACCTN 

Sequence 606 

AGGTCTTCGACCCACGCGTCCGCAACTGTTGATCTMCTTTTCCACTTGAATGTCTAATT 
GGCAAATCAAACCTAACATGTTCCAAACGAGTTCTGAAGCACCCCCTCTGCCAAATCTAC 
GTCTCCCACAGCCTTCCCTATTTCTCTACCTGGTACCTGCCCGGGCGGCCGCTCGACCTG 
CCCG 

Sequence 607 

AGGTCTTGAGTCGACCCACGCGTCCGGAGATGTATACGCCACTATAGGAACTATAAGAAA 

AAGTCAAATGGAAATCTTATAAATAAAAACCACAGTCACTATAATGAGGAAATACTTTGA 

TAAGGTGTCAGTGAACTCAAAAATCAATCAATAGAAACTACTCAAACTAAAACTCAAAGA 

GAAMMAAAGATGGGAGATMTTATTTTTTAAGAATTGGTCATCAAAATGTAGCAACAA 

GTT TGCCT TATCCTATATCATTTGAATTTTCMMAATAAGCTCATTATACAATCTTTAA 

MTATTTTGAATAGAACTGTTTCATGTGTTATTTGT 

Sequence 608 

AGGTCAAGCTTCGACCCACGCGTCCGGGGAACCCTTGTTCAGGTTC 1 I I I 1 AGGCAACCC 

AAGCCAAGACMCAMGTAAGATAGAGCCCCMATGTGGTCGTATAAGGTTTTTCAAAGA 

AAGTMCACTTGAGTTAGGTCTTAAAGTTTCACCTAAGAAACTGCCCAGGTGGACAAGAA 

GAAAGGGTGTTCCAAGTAGAAATAATAGCATGGACAAAGGCAATGTAGCAGGAAAAGTCT 

TCGTAMTTCAGGGAATTTCMGTGTTTCACGATGGAAGGAGCAATAGAGTCATTTACTT 

GCGGTGGCAGGGGATGTTGGAAATGTAAACAAGAGTGAGATAC 

Sequence 609 

AGGTCAAGCTTCGACCCACGCGTCCGTGATGCTGGCTTCCCGGTCAAAGCTGAGGAGTTT 

GTGGTACTTTCTCAGGAACCTTCTGTCACGGAAACCATTGCACCCAAAATTGCAAGACCT 

TTCATAGAGACTTTCCTCAGGCCCTCAAGAGTATTGAGTATCTGGAGGAGGATGCCCAGA 
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AGTCCGCACAGGAGGGGGTGCTGGGACCACACACTGATGCTCTGTCATCAGACTCTGAGA 
ACATGCCGTGTGATGAAGAACCATCCCAATTAGAGGAGCTAGCTGACTTCATGGAGCAGC 
TTACACCAATTGAA 
Sequence 61 0 

AC I I I I I I I I I I I I I I I AGCTTGAGTCGACCCACGCGTCCGGGGATCTAGATCACGAGCG 

GCCGGCCGCC CGGGCAGGTACGGAAGCCATGCACTTGCCTCTCCTTCAGAGCTGGGATTT 

TTTTTCATTTTGCTGGCTGTGAGCACACACACGCCCACAGGTGCCTAAGCCTCTTGTATG 

TGTGTTTTGAACTGTGTCCTCTGAGTTCTGTGTCTGGGTGCATGCTCTCCTCTTAGCGTG 

GGTCTGCTTCCCCTGTGTAGCACTTCACAATGTTAGGCATTTGTCTGTGATAGCAGCTGT 

TCAGTAATTTCCTACTT 

Sequence 61 1 

AGGlTrCGACCCACGCGTCCGGAATTTATCTGGCCAGGCATTGGTAGTTTACAGAAGTCT 

ACCAGATGATTCTM TGTGT GGTCAAGACTGAGAACTATGTGTTTAATTGGGTTCATTTC 

AAGAATAC TGTAAA AA TTTTAT CTAAATACTAAATATCCATAAAAGAAACCTCGGTAATC 

AGGCCAGG I I I I IGAGTTTTTCCAGATTAGCCCAACTACAGGGGAAAGAGACTTTCGCAC 

TATATCCCAGAGTCTCTGCTCCTGCTTCCAGCCTCAATGCACTGGGCCTTTCTGCTGCCT 

TGGAGCACTTAGAGGGATTACAGGAGGAGTGATCTGTGGAGTT 

Sequence 612 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCAGCGCCCGGGCAGGTACATACAGCCTGAAG 

TTAACCTTTCTATGTTAAATGAAAAAACTTTGTCATTCATCAGGTCACAGAAACCAAAAA 

CTAAACAATGCAAAAAAAAAAAAATCTAAAAATAAAAGAATTTTATATTTGAAGTTATTC 

T GGATA TTCGCACCATTTTAGCTTCTGAAAAAAATGCAACTATGAAATGAAGACCTCATA 

TATT7TCATTTATCMTATMTGTTAAAAGTTTCATTCCACCGGGTGTGGNGGCTCACAC 

TTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTCAGGAGATCGAGA 

GTATCCTGGCTAACATGGTGAAACCCCGTCTCTCTAAAAAAATTCNNNAAAANAAAAAAA 

AAGGAA 

Sequence 613 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTGTAGTTTTCTTAGCCATTTTTC 

MTAGT CTACACAGTGTTTATGTTTCCTTTTATTTGTGTATAGTGGAGTAGAGGGGAGGT 

I I I I I I ATTCAAATAGAAGAAGCTAAAACTCAAATGCAATGTCAGATCTCANAATAAACT 

GACCCAATTTCTGAMCCCMTAAACACATTTCATTTGTAATATTCTTTATTATATAGCT 

CTATGAAAAGTAATTTGTGACTTTCGATCTTAAMGAGAGTTTTAAAAATACACAGTAAA 

TTGAA AGAA AAACTACTACATTTAMACAGTATTTTCCTGAAAACATAGAATGAAAATGC 

MGTATTTTGTGCATGGCAGCTGTTTTTMGGMCCAATGTTATATATGGNGAATTTTGT 

GGMGACTATGTCTCTTAAMTATTTCTTATMAATANCATGGCTTTTTAATAGCTGGGA 

ATCTGANGNNGGATTTCCCATGAAGACCTTAAATGGCTNNGCAGGAATTATAAAAAAG 

Sequence 614 

ACTTAGGGCGAATTGGAGCTGCCCGCGGTGGCGGCCGAGGTACATGCCTGTAATCCCAGC 

TGTTGGGGAAGCTGAGATGGGAGGATTGCTTGAGCCTGGGATACTGATATTGAGGCTGCA 

GCGGGCCGAGAATACCACTGCACTCCAGCCTGGATGACAGAGTGAGACACTGGCTCAAAA 

MAMAAGAAAAAGGAAGAAAAAAGTTTAAATCAATGAATGTTCTCATTTCTAATGAAAT 

MTGAM CATTATT GGGAGAGTTATAGTCATAATCATCTTACTGCACTATCAATTAAAAA 

ATACATCATT TTTTAG AGCACAATATATACCATAAAGAATT ATTCAAATAGTCTAAATAT 

TACGATCAAATTTTTAATAGACTTTGTTACTTAMCTAMACTGTATTAGTCTGTATTAG 

TCAGCTCAAGTTGGGATTCACACCTGTAATCCCAACACCTAGGGGGGC 

Sequence 61 5 

CCGCGGTGGCGGCCGAGGTACACTGTGTAAGTGGTCAMGATAGACATGGTTTTATTTAC 

AAGGAAATTTGCTGAAGTGTAAATTATAACACGAAGAGATGGGAGGGAGGGGTAAACACC 

TAAATGTCTAACAACAGAGAATGGTTCTCTGTTGATACAAAATTATGATACATCAAAAAA 

GAACAACAATCAAATTCTCTGAGAATTCCCATTACAGTTAAAAGGAGCTCCCAGCCAGGT 

GCGGTGGCTCACGCCTGTAATCACAGCACTTTGGGAGGCCGAGTCGGGTGGATCACGAAG 
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TCAAGAGATTGAGACTACCCTGGCCAACACTGTAAAACCCCGTNTCTACTAAAATACAAA 

AATTAGCTGGGCCGTGG 

Sequence 616 

TTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGAGGTACTGAATAACTGCCAATGCCA 

TCTGCCTGTGGCCTTCTCAAGTTTGTCTGCACCTGTGGTTATCCTGACTTCAAACCCGGG 

GAGACAGAGGCTAGAAGAGGCAGACAGCTCTTGTGTATTCTCCTGTCCAGTGCAAAGAAC 

ATCT GGAACTCTGAGCCCTAACCTTAAATGCAAGACCTNATCTGCAGGTGTTCCTNATCC 

TTTTAGCCCCTCAGTGATGTAAGCAACAAACGTCACCCANCTCCTGGGGCACACTTNACT 

CCCAGATGAGCTTGTCCTGGATTTGCAGGGAGCCTGGCTCCC 

Sequence 617 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCGGGGCAGGTACATCACCCTGCTGA 

GGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCC 

GCTTCTGCCTGGTCACCAACTTGACGAT.GGACTCCGTGTTGGTCACTGTCAAGGCATTGT 

TCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATG 

CCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACMGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTCA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGA 

GGAACAAAAGGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGC 

Sequence 618 

GCCGGGCAGGTACAGATGGGGTTNCACCGTGTTAGCCAGGATGGTCTCGANTTCCTGACC 
TCATNANGCATCCANCTCGGCCTCCCAAANTGCTGGAAATTACAAGGGCGTTGAGCCCAC 
CCGCACCTGGG CC AG AATCTTAC ATATTTCTTAAACATCATTA A AT ATATATTGT ATTTT 
TTACi I t l I I l I I GAATAGGGGTCTTGCTATGTTGCCCAGGCTTGGTTTTGAACTCCTGG 
CCTTNANGAGATCCTCCCGCTCTCAAACCTCTCAAAAGCAATGGGTA 
Sequence 619 

AGGTACCCCATTTTATGCCATAAGTCAGGTTTCTCCCTCAATAGCCCTTTGGAACTCTCA 

AGGTCCAGAGTGGCATCAAACCAACTGACACATGAGTTGATACATCATGTGCTGCCAACA 

GAGMATTAGTCTGTGCCAAACTCAGCACAATCCTGCAGTTCAAACCAGAATTTCAAAAA 

Sequence 620 

ACCAAGATT TGAA TCATGCTTTCAAMGCTAATGTGAAGTTAGACATATTTGGTTTCATA 

ATCACAGAATTTTAAAAACACCAGGTCTGCAATATTCAGAAATCACCATTAACGCTCTCT 

TGACACATACAATCMTTTCACTTTAGATCGCTGATTTTCTTMCAACTGATTTAGTTAT 

TTCTGAATACTGCTAGAAAATTCAAAATCTACAATTAAT 

Sequence 621 

AGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGT 

CAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTG 

GTCACTGTCAAGGCATTGTTCTCCTCCMTTTGGACCCCAGCCTGGTGGAGCAAGTNTTT 

CTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGAC 

ATCCATGTGACAGAAATGGAGTCATNAGTTTATCAACCAACAAGCAGCTCCAGCACCCAG 

CACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAAGCCCAGCC 

AGGCACCACCAATTACCAGAGGAACAAAAGGAAT 

Sequence 622 

NCCGGGCAGGTACTGGATGACAGCAAGTGCACACATCAAGAGAAAGTTACCATTCAGAGG 

TGCAGTGAGTTCCCTTGTCCACAGNGGAAATCTGGAGACTGGTCAGAGGTAAGATGGGAG 

GGCTGTTATTTCCCCTAGGTCATCTCTTACATTCTAGTTCTGGTGCTCTCTATCTGTTTA 

AGACAMCCCTTGNGCACCTTTCTCCCACCCCTCCCTTTCTCCCTTGTCTCCCTTGAGAA 

AACAACTNCAGTTCTCTGCCTGCACCATGACTGTCGATACGCGGGGGCAGTTCGGCGGTC 

CCGCGGGTCTGTCTCTTGCTTCA 

Sequence 623 

AGGTACAAGCTGTGCACTGCAAGGTAACCACGTGGCCAGAGGCACATCCCTCCCTCACAT 
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ATACTGAGTGGTGTAATGCAGTCACCTTGTCATCTGGCAAGAGGTGATCGATGGACACAA 

ACTCCTCCCGGAACTGCCCCTCCAGCGAGCTCACTCTGAGGTTATCTGAACTCACATAGC 

TTGGGAAACCCAGCTGGGCACGGGCAACATTTGCGTAGTGACCCTTCCAGTCATCGGAGC 

ACATGGTCTTCCACGAAGCAGCTGTGAACACCTGGAGCACGGCATTCTGACCACTCACCC 

GGACACAGCGGTACCTGCCCGG 

Sequence 624 

CGGGCAGGTACTTTGCAAGACACGCCTGGCTACGAACAACATGGGACAATGGGCAGCCTC 

GCTGCACTGNACAGAGGAAAGGAAAGAGGCCTTGCAGCCACTGCCTGGGAAGGAGCAGCA 

CATTCTGCATTAACCAGGCATGCCTCACTCACTGCAATCCCCAAACAAGCCCAACTCTCC 

GTGTTGATTATTCTTACCATACTCCACCAGAAAGCAGCATGATTTTCTGTCCTCAAATAC 

TTCAGATTCCAAGAGAACTGCACCTTCTAGAGTCTCTACTGATAACCTCAGNCACTTACC 

CACTTGAAGCATNAGCACACACTTAAAAAGGAAA 

Sequence 625 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATTCTTCCAACCTTTCT 

TCTGTGCATMTCATCTAGGTGTGGTGCTTACATTTTCTTTTGGCAGTGTTATCTTAGTA 

TCTTCCAGCATGGTTTTCTCACCTGATACTGTAACCATACTTCCATATCCTCAAATGTGT 

TGTTTTCTAAAATAAC I I I I I I I I ICI I I I I I I AGAGACGGAGTCTCACTTGGCCAGGTG 

CGGTGGTTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGGT 

CAGGAGTTCGATGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGGC 

GCACGCCTGTGATCCCAGCTACTCGGGAGGCTGACGCAGGAGAATCTCTTGAACTTGGGA 

GGTGGAGGTCGCAGTGAGCCGTGATCGCGCCGCTGCACTGCAGCCTGGGTGACAGAGTGA 

GACTCTGTCTCAAAAAAAAAAAAAA 

Sequence 626 

AGGTACGCGGGACATACTCCCTAGGTGTCTGTGAGGATGGTGGAGGGGATTTTCTCCATG 

CCGGGAGGCTTCCTGGAGCAGGTGCTGCCTCTCGTGACTCTTGAAAGATGCTTGTGAATA 

AAGCATACTGGGAGCTGAGCTGCTGTTTAGTAATTAMAATCCTTTCCATTGTTTAGAGC 

TCAGCACCTTTGTGCATTCATATTACGCATTCATTTTCGTATCATTGTTGAATTTCTCAC 

T TCTGCTAC TGCAATGTATGT CTACAG CTGACAAGTCTTCCTTGGGAGCCCTACGTAGCT 

CI I I I I I I I CTTTTCTTTCI I I I I I I I I I I GAGACGGAATCTTGCTCTGTCACCCAGGCT 

GGAGTGCAGTGGCGCAATCTCGGCTCACTGTAAGCTCCACCTCCCGGGTTCACGCCATTC 

TCCTGCCTCAGCCTCTTGAGTAGCTGGGACTACAGGGTGCCCACCACCACGCCTGGNTAC 

I I I I I I I I GGTAT 

Sequence 627 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTCTGCCATGAAGGT 

TCTGGGGTGGAGAGGGAAGCAATGTATATCCTACCCATGGTGATTGGTCCGATGGAAGTC 

ACATCCTGATGGGAAAAAAGGACTGAGCCAGAGTGGACTGTCTAAACCAAAATGGGATAA 

ACAAGCATGGCATGGAGCCAAAACAAATGGCTAAGTCAGAGGTCCTAATGCAGAAGGCTG 

GACAACTAGGATGGTGGGGAAAGACATGAGCTTGAAGGACTTCCCAAGATAAAGCAGAAC 

TMCCAGAAGAGCCTGTTATAGATTCATATTGGGGGAGTTTGGGGGGTTTGTGCAGGGTG 

CATCAAAAAGCACTCGCATGGAATAAACATATCTTGCACAGGAACATATGACAGGTAATT 

GAATAGTTTGATTTGAACTATGTAAAGACATGATCCTGATGGTAGAAGGATGGTACCTGC 

CCG 

Sequence 628 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I 1 I I 1 I I 1 1 I 1 I 1 I I I 1 

TTGGGACCGAGTCTCACTCTGTCGCCAGGCTAGAGTGCCATGGCGCAATCTTGGTTCACT 

GCAACCTCCACCTGCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGA 

TTACAGGCCCTNACCACCACGCCCAGCTMTTTTTGTATTTAGNAAAGATGGGGTTTCAC 

TGTGTTGGCCAGGCTGGTCTCGATCTTTTGACCTTGAMCTTTNACATAMCTTTCACAT 

TTCCATGACAAAGTTTTAGCAGTMCTTCCAAATTGGTCTTATTCAACTCCAACATTAAA 

CTTTGTATGTACCTGCCCG 

Sequence 629 
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CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGATCC 

AGGATGTGATGGGATCTTAGGGCTTGGCTGGAAGGTTTCTCCAGTCAGCCATCTAGCAGA 

GCTGCAGATCTGGGCTGGGCTGTTGGCTAAAGTGCTCTTCACAGACACCTCATTCGGCTC 

TTCCTTCAGCTTCTTCACTTATTTCTTACTCAGTCACTACTCAGCTCCTTGTCCATGTGT 

CCTTGAAGCCATCCTAGGTCTTATTCTGATTCTGAATTCTTCAGTCACCCATAAGCTTCT 

CCTTACCCCGGGAGTCAGTGGGTGTGTGTTCCCAGGTGGACTTAACCATTCTTCTCCTTT 

ATGATCCnTTCCCTTGGGTGGACAAGTGTGATTTGGTTGTAAGGCCATTTTTCAAGTTGC 

CTATACATTGATAAAAGAAATCCCACTAACGGAAGTAGACTGCATGCCAAATTTCAGTGT 

CTTTCTCCAGGGGCCAAGGTTGGACCCANAAGTGCATGGG 

Sequence 630 

CCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTATTGCTTCCTGGGAGAGCTGACCATGA 

GTCAATTGGCCCACAATAANTTATNAAATGAAAACCGGCCATCATCTGCATCTTATGAGT 

GCACGTCATCAGAGATGTCCACTCCAGTTACAAGAAAGTCCTGAGGGCTTTCTTGGAGCC 

TGANGGGCGCTGGAGGTGAGACCTGGAGGTGAGCAGGAGTTAACTAGGATGAGGGACNGG 

CGCAGCATACAGGAAAAGCTGCCTGGGGGAGAAAGGACCAACAGCAAAGACTGAGAAAAA 

AATGCTGTTGTGACCAGGGTTCAGAGCGGGCATGGAGGACTGAGGGTTCAGAGCGGGCAT 

GGAGGACTGAGGGTTCAGAGCGGGCATGGAGGACTGAGGGTTCAGAGCGGGCATGGAGG 
Sequence 631 

CCGGGCAGGGTACTAAGGACAAAAAGACATTTATTCTCTTTGACCCTTGCTGCCAGNACA 

GAAAATGACTTCACCCAAGGACACAGCACTTGCGGGTGGCCTTCTCCACCTCCAGCTATT 

GCTTGGTTTCAGGTGACCACTCCCTTTCTCTTCTCAGGCCTATGGGTGGTAACAAGCTCC 

CAT CCACT GCTAGTCTTAGACATCTTTACTTTCCTTGATTGATNCCCTTGACTCTGCCCA 

CATCTTTTAAAATATCCCATATTAMCTTTTTACACCCCTTTGAATGTGTCCTGCTTCCT 

GCTGGGACCATGACTAGTCTCTTCTAGTNGGAATCCATATCACCTTCTGTGATGTAGTCT 

CCAAGTCAGGCAG NCTCATTTCAACTACAG NCTTTCTTTATGCTTCTCTTTTTCCTTTCT 

GGACTCTTACCTTTCTTTTATTTCTTACTCAGCAACAGTGTCTGCCCATTAAAATGCACC 

TTTGCGGNGGNGGTTNGGGTATCTTTATCTCTCTTATTTCTTCCTTNCTGGTGCATTTTC 

ACTGGCATTGCATGGGAATTTTGGTT 

Sequence 632 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACATACTCCC 

TAGGfGTCTGTGAGGATGGTGGAGGGGATTTTCTCCATGCCGGGAGGCTTCCTGGAGCAG 

GTGCTGCCTCTCGTGACTCTTGAAAGATGCTTGTGAATAAAGCATACTGGGAGCTGAGCT 

GCTGTTTAGTAATTAAAAATCCTTTCCATTGTTTAGAGCTCAGCACCTTTGTGCATTCAT 

ATTACGCATTCATTTTCGTATCATTGTTGAATTTCTCACTTCTGCTACTGCAATGTATGT 

CTACAGCT GACAAGTCTTCCTTGGGAGCCCTACGTAGCTC I I I I I I 1 1 CTTTTCTTTCTT 

I 1 1 1 I 1 1 I IGAGACGGAATCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTC 

GGCTCACTGTAAGCTCCACCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTTGAGT 

AGCTGGGACTACAGGTGCCCACCACCACCCCTGGCTACTTTTTT 

Sequence 633 

GCCGAGGTACTTCCCTGAGCAGTCGAAGTGGATGCCCAGACCAATGGCCAGNGCTAATAT 

NCAANGCAATGATCCCAATGACGATGATTGGAAAAAACTTCAATGGCAGCAGTGACAGGA 

TCTGTGCAGCAACAGCATCTGCATCTGGTGCAACAGGACTTATTTTCAAATCATCAAGGC 

CAAAAAGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGATCATTTTCCCCCATGG 

TGACTATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCTACCTCATAATTCCCGA 

GTCCCAAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCACAGTGTTACTGGCTTCC 

CATAAACACAGCCCTTTCCTGGCTCACACGGGCATGACCTAATTAAGAACCCCCGCGTAC 

CTGCCCGGGCGGC 

Sequence 634 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAGTGA 

AATCCCTAAGTCAAACTGTGGCTTATAAGCAGAAATCCTGGTTAGTATTTCAAAGTTCTC 

TTAGCGTTTTCTCCTGCGACTTAAAAGACTTAAAAACGTGAAAAGACATGGACGTAAGAC 
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TCCAMCAAAAATACATTTCTTTGAAACTAAATAGCTCTTAAGTAAGAAAAATTTCTATA 

GATCTTCAAATCATCCCCTAAGCAAMTATTCTCTAATTAAGTATTTCTGTATTTCCATC 

TATGTTCTTCCCAGGCTTGGGGCTGTTGATCAGACCTATTTTTAGGGGTAAGTTTCTAGG 

GGTCATAGAAGATACAGATTTTGACCTGCTTAATGTCAAGAGGTTGCACGGTTGATTTGT 

CCAGTTGTGMTTCTATGAATGMGCTTTTTGCTTAAATAAAACGATATTCCCCTCTGGC 

TGCTGTGAGCACCGGGAGACTTGTTTCGGCAGTGCCTGGGTGCTGGGGCAGGGCCCG 

Sequence 635 

TG GGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGAGTAAAATAAAAGGGTCG 

GATTTTGTAGGATTCTAAGGAAGAGGCAGTGTGCTCTGTCCACAGGCTGCAAGGTGAGAA 

CCTAAAAGAATGAGATATATTCCCATTTTGGAATGGCAATCAAAAAGAGGATCTCTCTGT 

CAAGTCTTTACATTAATGCTGAGTAACAATCTCAAAAGCCTGCCCATTCCCCTTTAGACA 

CATGTGGCAAAAGCAGAACTGAAGGAATGGCCAAGGGGCTTGAACAAGTAGAGAGACCGA 

CAGTCTTTCAAATTTCAGGGAACCCCAGATACATTTTGGGGGAGCCACTGTTTTCCCATT 

TTCCTGAAAAGTTCTTGCAGGTATAAGAAATAGGAATAGAAATTGAATAGGTTCTGGAGC 

CAGGGCTACAAAGGCCCCAGCTCTGATCTGTTAGACTGAAAACCACACATCAGATGAAAT 

TATATNCACAAAAAGGAGAGTCCTTAAAAACAGCCATTTCGGTCCCT 

Sequence 636 

AAGAGCACGTATAGCATGGGGGAAAGAACCTAAATGTCTCTCTGTCCTGTGAGCTGGTGA 

AAAACCCAGCATGAGAACGCAGTGTCAGGTGTGGGACTCCTTCTGCCCCTGCAGTGGGTG 

TTACGGGCGGTGTGCCCTGGCGAGCAAGCTTTGATTCTTGG7TCTTTGAGCTCGT7TCAG 

AGGCTGAGTCCCCACATCAGCTTTAGTTCTTGGACTTCCCTGTATTAAGCAAGAATTAGG 

AGAATGGCTG TCCC TGCAGGCGCCTCCCGTAAATCCTGAGCTCTCTGGCGCAATCTGAAA 

CTTCTCTTCTGTTTTCTTTGGCTGTATCAGCCGAACCAGGAGAGGCC 

Sequence 637 

CCGGGCAGGTACCAGGAGAGATCTGAGACANGGTATGAAGTAAAAGATTTAAGA7TGGAA 

GTGGAGAGTGTCATGGACCAGTGCCTTTCGGATGGGTGACTTCTGGAATTCTTGTTAGGC 

ACAGCGGAGGTTGGTCCTGTGGGAAAGGAAGAATATTTCCGGGGTGAGGAGACTTCGGGG 

TGTGGGCCGGGTGCCTTTTTAAATTTGGAATGGTGTATACAATAGGGAAAGGATGTTAAC 

TTTGCAGCAGCGGGGATGGTGAATATAACCTGATAGGGACCCTTCCATTTTGTTGGAAAG 

GGGAGGAGGGGTGTGCTACCCAGACCCAGTCTCCTGGNTGTAAGGGTAAGAAAGTGAATT 

GGGAAGAATCCTCAGG 

Sequence 638 

CCGGGCAGGTACCTGGACTCCTAAGCCTCAGGGATTTACTGAAACACCATTCTATTTTAT 

AATAATCCTTAACCAAGAATTTTAAGGATCTTAAATTT7TCTGTGG7TCTATTG7TATCT 

GATATATAGATGATCTGCTGCCATATCCTAAAGAGCAGATGAGGCCGGGTGTAGTGGCTC 

ACGCCTGTAATCCCAGCACTTTGGGAGGCAGACGAAGGTGGATCACCTAAGGTCAGGAGT 

TTGAGACCAGCCTGGCCAACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGC 

TGGGTGTGGTGGTGGGCACCTGTAATCCCAGCTACTAGGAAGGATGAGGCAGGAGAATCA 

CTTGAACCCAGGAGGCGGAGGTTGCAGTGAGCTGA 

Sequence 639 

AGGTACCACTTAACAAGGGTTCTCAGCTGTGNGGNCACTGGACCACTGGGATATGCTGAG 

CTATTGCTTAAACACTGACTTAAATAAAACAAATATTTTAAATAATGAGAATGCTACTGT 

AATTAGAAGGCAATCATTTCAAAGTCTANATGGAGGCCAGGGGCGGTGGCTCATGCCTGT 

AATCCCAGCACTTTGGGAGGCCGAGGTGGGTGGATCACATGAGGTCAGGAGTTTGAGACC 

AGCCTGGCCAGTATGGTGAAACTCCATCTCTACTAAAAATACAAAAATTAGCCAGGCGTG 

GTGGTGTGCACCTGTAATCCACTGAGGCAGGAGAATCACTTGAACCCGGGAAGTGGAGGT 

TACAGTTGAGCTGAGATAGCACCACT 

Sequence 640 

AGGTACAAAGGTTCAGTGGTGAGAAGAGGGAGCAAGGCCTTTGGAAf AATGAACTCCAGT 
TGTTCCTCATAGGTGCAGCAGAAATAGCGAGAGGTCAGGATTATGGAGATTGGTAAGGCG 
AGATCATCCAAGGGCCTTTTGCTTGGTAAGCCATTTTACTTTAATCTTGAGTGCCATAGG 
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GATTCATTGACGGATTGATACAGGGAAATGAAATGA I I I I I I I 1 I I I I I I GGTTGGGGGA 
GACAAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAACGTCGGTTCACTGC 
AGTGTCTGCCTCCCAGGTTCAAGCAATTCTCATGCCTCAGCCTACCTTGTAGCTGGGATT 
ACAGGTGCACACCACCACACCCAGCT A I H I I I A 
Sequence 641 

AGGTACAAAGGTTCAGTGGTGAGAAGAGGGAGCANGGCCTTTGGAATAATGAACTCCAGT 
TGTTCCTCATAGGTGCAGCAGAAATAGCGAGAGGTCAGGATTATGGAGATTGGTAAGGCG 
AGATCATCCAAGGGCCTTTTGCTTGGTMGCCATTTTACTTTAATCTTGAGTGCCATAGG 
GATTCATTGACGGATTGATACAGGGAAATGAAATGA I I I I I I I I I I I I I I GGTTGGGGGA 
GACAAGAGTCTTGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACAACGTCGGTTCACTGC 
AGTGTCTGCCTCCCAGGTTCAAGCAATTCTCATGCCTCAGCCTACCTTGTAGCTGGGATT 
ACAGGTGCACACCACCACACCCAGCT A I I I I I I AT AT 
Sequence 642 

AGGTACCTCGTTTCTGAGGATCAANACCTNAGNGACCGNGTGTGTGTGTGTGTATTTGTG 

TGTGTGTGAGTCCTATTTGGGCCCCGCCTTTCAGCCCTGTCTTGCAGC 

Sequence 643 

AGGTACTTTCAATTTCTGTGGGATAAACTCCAGCTCCAGTTTCAGAACCCACTCTAATTG 

GTTTAAGCCAGGAAAGGGAGAGGGACATGTTGCTGGGAGGCCCCCATCTGGGGCCTGAGC 

TTGGAATCAAATCAGAGGAAGGCAACACATGTAAAGTGCTGAGAGTGGAAGGATGAAGAG 

AGCTAGGGCTTTGTGCCATCACTCGTGCTCTGGACATAAGTGGAGCTGGGATTCAGCATT 

ACCTGCCCCTGTACCTGCCC 

Sequence 644 

CGGGCAGGTACTAGTCCAGGTGTGAGATGAAGGGGGCCTGGATGAAGCAGAGGGTGAGAG 

ACAAGGAAGATTCTGAGGACCTTGTGGCTAGATGTGGGGGTTAAGTCAGGTTCAACTCCT 

AGGCTGGATGAATTGGCAGATGGCACATGAACTACAAGAGAATGGAAGGCAGAACCTATT 

TTGTGGGCAAAAAATAAATTACATTTTGCAATACTGAATTGAGGGGCTTCTTGGAAGTCC 

AGGTGTAGATGTCTTACAAAAATAGAATATTCTGGGCTGGGTGCAGTGGCTCACCCCTGT 

AATCCCAGCACTTTGGGAGGCCAAGGTAGGGGGATCACCTGAGGTCAGGAGTTCGAGACC 
AGCCTG 
. Sequence 645 

GGNCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCAC 

CMCTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGA 

CCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCT 

GGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCA 

ACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACC 

ATATTCCCAGGACAAAGCCCAGCCAGGCACC 

Sequence 646 

CCGGGCAGGTACAGGGGCTTGGGGGCTTGGCCAGGCTCTTCTCCATCCATGCCACGGGGC 

TGACAGCCACAGATCTGGAAGCTCAGGCCTAGGAGTGCAGGCTCCGTTAAGCCCTGTGTC 

CAACATCCTGACTCCTAGGGGTGCCCAAGATTTGAGTGGCCACTTTCACCTCTGGAGGAA 

GTAATACCTAAGGCGCTGATAGAAATAGAACTTCCGCTGCCAGGCAAGGTGGCTCACACC 

TGTAATCCTAGCACTTTGGGCAGCCTCAACGCAGGTGGATCACTTGAGGTCAGGAGTTCG 

AGACTAGCCTGGCCCAACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTA 

Sequence 647 

AGGTACNTGTTTCAGTCACTGGGCTGANNTGGNNCACAGCACAACCTTCATAGCCACTGT 

ATGMGAAGTANAAGACCCAGACTCTTGCTTTATGTTGGTATCAAAAGTCATTTCAGAGT 

CAGGCTGATCACTCCCAAGTAACCCACTGACTTCTTTACTCCAGCTCTCTGTCTGCTGNT 

GACTCANA ANGT NACACTTNATTTTCTCCATTGCTGATATAATCATATCTGCAACATAAA 

AGTGGGCATTTTCCTTTTCTACATCAACAGGCAGCACAAATACCTCTGGTGAGAAGGAAT 

TCNAAGAAATGGTTNTTCTACTGACTTGAACAGCACCTTCATCAGCAGCAGATGTCAGAT 

GGGAAGGC 
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Sequence 648 

CCGGGCAGGTACCACTAGATTGCCTCCCTGTGCCTGGGCMTTTCAGAAAATGGTGGTTT 
TCCTTTCGTTrCCATCTTTTTTMGACTTAAAAAGTATCT 

GCCTCCATGCCTTGACTCAAAAAATGCTGTCTTAGTTGACAGCCTTGAAATGAGTATGAC 

CCTAGCTCTAGTTGGGTGGAAATCACCTCGCATAGAMTAGACCTGGAGGGCCGGGCACG 

GTGGCTCACTCCTATAATCCCAGCACTTTGGGAGGCCCAGGTGGGTGGATCCCGAGGTCA 

GGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCCGTCTCTACTAAAAAAAAAAAAA 
AAA 

Sequence 649 

AGGTACTAGTATGAAGGAAATAATATCCACACACTGATACTGGTCCAGCNGAAACCAAGA 

CCGCTCCTGGTGCATTAACTTTTAACAGAGCANGGACTCANTTCTCTGAAAATAGTGCCA 

TAAAACATGTGCTCCCAGAAGAATAAATATTTGGCTTGCTAGMTTTCTGCNGCTTTTNT 

GTAAAAGTTGATTATTCGGTATTAAAGAGGAGTATCAAATATGNGTNNATGNANNAAAAA 

CTTG GAAAN AGTANNGGACCNNGGCTTATCTCNTCATTTTCATTCTGCACACTNCAANTC 

ANTCNTTTTCCCATCTTNNTTCCCNTCTCTGNAATTTATCACCCTCCCCCTCT 

Sequence 650 

GGCGATGGACTCCACCGCGGTGGCGGCCGAGGTACTTACCACAGAGAAAAGCCAATAATC 

ACAATATATGTTGTCTTACTGACCCATAACCCATTTTCTGAGGGTGGCAGGCATTGTGCC 

CCCACTGTGAGGTGGGACTATATACATATACAAAGGAGGTGTTTAACTGGGTGGCATGTC 

TCAGGGAGATGTAANGACTTACCTGCATATCCTGGCAGTNTTGAAATGATAGTGAACTGT 

TCNTANGGCNTCCNCTTGATGGCATAGNCNAAACACCAGNATTTTCTTGGNAGAATGATT 

CGGNAAATGCTACATAGAAGAAATGGNTGGTGAGCTNTTACTGTGACTGTGCCCATAGTA 

AGTCATCCTGGACCCTCTGAATCTTATCCCAC 

Sequence 651 

GCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGI I U I I I I ll I I I I I I I I I I 

TTTTTATGATATTCATCTATGAACCAATTTMTTTTAATTTAAAAATGACTTCTGATCT 

GCAGATGATTTGGGTTCTAGAAAGAAATTGTGCCAGGCATGGNGGCTCATGCCTGTAATC 

CCAGCACTTTGGGAGGCTGAGGNGGGAGAAACCTGTNAGCTCAGGAATTGGAGACCAACC 

CTGGNAACGTAACAAGACTTNTTTTTTACAATATTAATAATAAAAAAAGGCCAGGTGCCG 

GNGGTTCACACCTGTAATCCCAGCACTTTGGGAGGCCACCAGACAGGTGGATCATNAGGT 

CAGGAGTTCGAGACCAGCCTGGCCAACGTGGGGAAACCCTGCCTNTACTAAAAACACAAA 

AATTATCTTTGCTTTGNTGGCGGGAGGCTNTAATCCAGCTTCTAGGGAGGTTGAGGCAGG 

AAMTCNCTTGAATCTTGAAAGCAAAANTTNCAATNAGCCCNGGTCACACCATTTGCCTT 

CCMNCTGGGCAACMGAGCCAAATTTTNTTNAAAAAAAAAAAAAAAGGGCCGGCCTTGG 

GGGTNNTCCCNGGAACCCCACNCNTTNTGAGGCCCCACCGGTGGTNNTAGGGNCNGAGTT 

CAAMCNNCCTTCCACTTTTNGMCCCGTTTTTTAAATTCCMAAATTNCTGGCTGGATA 

AATCCGNAGCCCCTTCTTGNGGGTGNGGNNGGANTNTTTCCCNCAANNGGGGGTGGGTNA 

NNNCCGAMNCCCTTATTTCCCCTNTGNAMGGGNCTTTTCNAAAAAAAAAAAANA 

Sequence 652 

CCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGGTAAGGCAGAAGGAAGGAAGGGCAAAGA 

AACA ATTCCAGGGCCCTGGTTTCTGGGATGACAGGCTTCCCMCACTCATGCCAGGACTA 

TTTTCCACCTCGGTTCACTATGGG I I I I I I I I CI I I I I I I I I AATATAATGAATTTTTAA 

AATGTGTGTTTGTTGCCCAGATTATTCCANAAAGAGTTGAAGGGAGGAAAGGNGTGCNTG 

GGGTGCNTGGGANTTTTANCCCTCTNTCCACCCNGATTTCTAAGTTGGGGGGGGCATCCA 

MCAGCTTCACCCANGTGCCCAGGCTNTTTTTTGNTNTCCAAAGCCAACCCTTCCAGGGC 

ANGGANGGGTGAAGNTTAGGAGGGCAAAGGTTAGCCTGGAGGCTGCAATTAACAAGAATC 

AMNTGGGGTTTMGGATTCTCACACCCAGTTTGCTMTTTCAGCTGGTCCTTGTAGAGG 

TGACACCTAGTAGGACMCATGGNTTTTNGGGCAGGGCTGGGGTGCTGGTCTCTGCTTTC 

TAGGGTAGA MGGAA TCATACATTGAAAATGCTTAAATCGATGGAATGATTTATGTTCCT 

NATCTTTC ATCTT TTTCTGNGTGGCTGGTTTTCTGCCANCCTTACTTGGACAAGCACCAT 

TCTANACCTTTTCTNTAGGCATNTCCNAGAANGNGAAGTNGAAAGGAAGAAAAAAAACTT 
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ATTTTTTNNGNCCAAAAATTNGGCAAAAAAAAA 
Sequence 653 

TCCACCGC GGTG GCGGCCCGAGGTACGCGGGGGCCCTTCTATCTCAGGATGTTTGCACTT 

GCTATTTC CTTTT CTTAAAGGCTCATCCCTAGATATTTGCATGACTGGCTTCCTAATTTN 

NTGTAAGCTTTTGCTGAGAAGTTACTTTACCAACTGTCATTGAGGTTTTCCCTGAACATC 

TTAGGTAAGATAACAAGCTCCCCTCCTTTTCTTTCCTCACTTCTTGGTATTCCTTATCTC 

GTAACI I I I 1 1 I I INGGGGGGATNGANACTTNCGTNTTGNTTTTGTTGGCCAAGCTTGGA 

GTGCANNGGGTGCANTCTTGGCTTNACTGAAACCTCCACCTCCCGGGTTTAAAGCGATTC 

TTCTGC CT TNAGC CTNCCGAGTAGCTGGGACTACGGGCAAGTGCCACCACACCCAGCTAA 

1 1 i 1 1 iG TATTTTTAAGTAGAGGTGGGGTTCACTGTGTTAGGATGGTCTCTATCTCCTGA 

CCTTTTGGGCCACCCACCTCGGCCTCCAAAGTGCTGGGATTATAGGTGTGAGCCAGTGCN 

CCCGGCCTCTCATAATTTTCTTAAATT 

Sequence 654 

CNAATTGGAGCTCCCCGCGGTGGCGGTCAGTTNGTCTTAGAGATACCCATGAGGTCACCT 

ACTCAAAATGGGGCTCAGAGTAGCCTTGTCCCATTCTTGTCCAGTGGGCGCAGCTACAGT 

CTNNCTGGNNNGGAGTGACTGGAGGCTGTCCCCACGTCCCACTTCAGTGAGGCATTCATG 

TGCACCCAGCACACTTTCTAGCTTTATTTGCCTGGAGGGGAAGATTCTCCAGAACCTTGT 

TAAGATGCACAGNGNGGGCCCTTGGACTGGCAAGTGTGGCCTTNGGCAGTCCCTNGGAGC 

TTGTTAGG AATGCAAAATNTTAAGCTTCTTCCTACTG NATCTAAAGGTTG ANTTTAAAC A 

AGATCCAGCTTGTTTCGTTTCACATGAAAGTTGAGGCACACTGCTCTAGAAAGTTCTTTT 

ATCTTTACTGGCCCACCAAAGTAATCAAACTTTGNGAAGTACCCTCGGNCCGCTCTAGAA 

CTAGTG 

Sequence 655 

GCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGATATGAAGTGAGGTTAAGTCAGA 

TGGMTGGCAGTGGACTACTGTTTTTGGTTAATAAATCGAGATACCCTTAAGAGTTGTGN 

NCTGAACATACTGTCTTTCTTTCCCCAGTTCCATGTCACAGCACCTGCCTAATAATAGGT 

GCTCGAAAAACATCTGTTGAATGAAATGAATTCTTTTGTTTGCAGTAGGGCAAAGAAGGG 

TAGAGAGGAACNACTTTGCCAAGCTGATNTGTAAATGTTGCAAAAGGGTTTNGGCCAGAA 

AATTCCNANAACCGATTNGAGAGGCAATACATGTTAAGGGACCTNTAAGATGTTTCACAA 

CCTTGGAAATMTTAMAAGAACTTTCTACTGNTTTACTTATTTCCCACTCCTGGCTGCC 

CCTCTTGGGTGGACTGCCTNCTGTTGGAGGGAATACTGNGTGAGACACATCTTTTAGTAA 

AACAGAAATGTGAAACCAACTTGCAGAAATCACAAGCACACTGTTACCAAATAGGTCTTG 

ACTGG CTCCC TTNCTGGGGGACAAATGTTTTGATAATGTCTGTCAGTAGATTCAGTTCCC 

TATT TCI I I I I AAGACTTGATATTTAAGAATACTGGTTCTTTTTTGGCCAGCATCGCAAN 

GAAG I I I I I CTTTAACTTTTG GG CCAAAA AAAAAAAAAAA 

Sequence 656 

cgaattggagctccccgcggtggcggcccgaggtacaggatgttttcctaaattttaagt" 

camttcttccttgacacatacctatttttattttgttttggttctcatctctgtgaaca 

gagcaaagcatgcaaccattgtaacactttcatttgttrttataaactcaagttctagag 

ttggatttcatgatttgcataactcggcatagtgtaagtgcttgtagttttaaacagaaa 

aagagggaagaaatgacnattccanaaaaaaagatcaaatcttatgactgtaatttatta 

aggnatccaatggaattctttccctttttctttc i i i i i 1 1 i i i i i i i a aag agacaagc 

tcaagttccataagctgggaatgcagtatcatgatccatagttcacagcagccttcaact 

ccctggggttcaaggngatcctaagaacttgngggcctcaagcagtcctcctgcctcagc 

ctgccaaagtgctggggattacaaagcatgagccactgctcctaattcttaagagata 

Sequence 657 

GAATTGGAGCTCCCCGCGGTGGCGGCCGGGCGCGGTGGCTCATGCCTGTGGTCCCAGAAC 

TTTGGGAAGCCGAGGCGGGCGGATCACGAGGTCAGGAGATCAGGACCATCCTGGCTAACA 

CGGTGAAGCCCCGTCTCTACTGAAAATGGAAAAAATTGGCCGGACCGTGGTGGCGGGCGC 

CTGTGGTCCCAGGTGGCTGGATACACGGGTGTGCACCACCATACTGGCTGATTCTTGTAT 

TTTTGGTAGAGATGGGGNTTTGGCCNNGTTGGTCCAGCTGAtCTTGAACTCCTGCACCTG 
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CCTNGGCCTTCCAAAGTGTTGGGATTACCGGTGTGAGACACTGGCCCCTGGCTATATTTT 

ACTATTTGGAAATCACAATGCATCTTAAAATTGATGGCTTCTTGCAACCACTTTCAACCA 

GGTGCCTGTCATGATTTAAGTGCTAGCATCAAGGCAGGTTAGTTATGAAGAAATAGAGTG 

TGTGTTTATATACTCACACAGTTAGAMTCGACCCTTTTAAAMTTATTTCTTTTTGAAA 

A 

Sequence 658 

AGCTCCCCGCGGTGGCGGCCCGCCCGGGCNGGTACCGCGGGATATGAAGTGAGGTTAAGT 

CAGATGGAATGGCAGTGGACTACTGTTTTTGGTTAATAAATCGAGATACCCTTAAGAGTT 

GTGTTCTGAACATACTGTCTTTCTTTCCCCAGTTCCATGTCACAGCACCTGCCTAATAAT 

AGGTGCTCGAAAAACATCTGTTGAATGAAATGAATTCTTTTGTTTGCAGTAGGGCAAAGA 

AGGGTAGAGAGAAANCAACCTNGCACAAGCTGNTTGTTNAATGTTGCAAAAGGTTTAGGC 

CAAGAAAANTTCNAAAACCCATTNGAAAAGCATACATGTTTAGTGGAACCTTGAAAATGT 

TTTCACAACCTTGGGAMTAATTTAMAGTMCTTCTACTGGTTTTACTTATTTCCCACT 

CCTGG CTGCC CCTCTTGGGGTGGGACTGCCTCCTGTTGGGAGGGGAATACTGTGTGAGGA 

CACATCTTTTAGTAAAACAGAAATGTGAAACCNACTTTGCAGAAATCACAAGCACACTGT 

TNCCAATTAGCTTGACTGGC7TCCTTNCTGGGGGGACAAATGTTTNGATAATGTCTGTCA 

GTAGATTCAGTTCCCCTA TTTCT TTTTAAGACTGATATTTMNAATACTGTTTCTTTTTT 

GCCACCTCGCANTGGAAGTTTTNTTTACTTTTGGCCAAAAAAAAAA 

Sequence 659 

CCGCGGTGGCGGCCGAGGTACTGGTAAAGGGATAGTCACATAGATCAATGAAAAAGAACA 

GAGAATCTGTGAACAGACCATGCAAATATGCCTGCCTGGTTTTTCACAACAGTGCAAAAG 

CAACTCAGCCAACAAAAGACAGCTTTTGGCCAGGCCGCAGTGGCTCACTCCTGTAATCCC 

AGCACTTTGGGAGGCCCGAGGCGGGTGGATCAACGAGGTCAGGAGATCAAAGACCATCCT 

GGCTAATATGATAAAACCCCCGTCTCTACTAAAAAAACACACACCCCAAATTGCCCGGTG 

TGGTGGCAGGTGCCTTTNGTCCCAACTNCTTTGGGANGGTTAAGCAAGGGANAATGGCNT 

TGAACCCGGAANGGAAANCTTTGCCNTGGGGCCCANATTTNNNCCCNTTNNNCTTCANCT 

TTGGGTGNANAAAAANCNGGACTTGGTTTCAAAAAAAAAAAAA 

Sequence 660 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTGAAGGTGAGCTTTGAA 
GATGCAACATGAATTTGACAGTANAGATGTAGGGAGGAAGGAAGGCAGGACAGGTCAGAC 
AGAAGTGCAGGAACAGCCCAGGCCTTTGCAGCCTTCCCACACCCCCTACAAGACCTGCC 
Sequence 661 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACGCGGGGACT 

TGACTTAAACTCTGGGGCCCGGGAGGCCGCCGGTTTTCTCCCCGCTTGCCGGGGTGGTCC 

TCTTCCCTTTGTCGGACCAAAGAAGTAAACACTGTGTGGAGAGGGACTGACGTGTTTGGA 

GGGAAATGGGAATGTACCT 

Sequence 662 

AGGTACTCCAAGCTCTGAGACCACCTCTTCTGCAAAGCCTTCCTGATTCTGCAAAGAACA 

GGTAGGCATTTCATCCTTGGGACCTCACAGCAATTCAGGACACATTTGTGTCCCAGCCCT 

GCTTGGCTTGGCTGTCTCCATGAATATACACTTTGTAACTTCTGCACCAGGCATCATACC 

MGCACACAGTAGGCACTCCTGTGTTTTTTGAATAAGTGACTATATCATCACCACATTTC 

AAATGCGGAATATATGAGCTACTAGAAAAGACATAAGGGTAGATTTTACATCTTTATTGT 

ATCCTAGATATACAAGTCTATTACTGCCTTTTCCCATGTTCTGTCAACATAGCATAAAGA 

ATGTGGATTTACCTGTTAGAAATTGAATAAGCGGCCGCTCTAGAACTAGTG 

Sequence 663 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGTTACACGGCAATTT 

TNTTAATAACTCCCTTTACTATGTCAGACAAGCTATGTCAAGCGTCTTCTGTATCCTNTA 

CGGGGAAAAAA AAAAA GTTAACCAGAGCCAAATGCTTGCTTTCAAAGATAACTTGCCATC 

CTGAAMTATAATTTTTACMTTCAATAACACCTTTTTGAAAAATAAAATAATNTGGCAA 

AAATG CCCA TGCATTAACAAACCA I I I I I I CAGTTTAATCTCTTTATATGTTCAACTTTG 

ATGTATTTTAAATAAACAAAGCAAAATTCAACTAAAAATACAATCTGGATTCCATAGCCA 
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ANGGTTTTATTTACMTTCCTANTAGGAAGGCm 
ACCTATMGGGMAmAMCCGTTTNCNTTG^ 

AGGANTTCCCAAAATGGGGAMGGGGMMAMGGGGGNAANNCCNTTTTGGCCTTTTNN 
GGNANTTTTAMAAAAMNTTTNCCCCCCCGNGNCCCCCAAMAAAMANNMNNTTTTT 
TTNAAAAAAAAAAAA 
Sequence 664 

CTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTGCTGGTTTTCTGGT 

GTTCCTACAAGCTGCCTAGGTCTCTTTTGTTCTCAGCAGTTCCCAGGCATGCAAAAGTTG 

CCAGTTCTGTCAGCATTCCAAGTCAGGTAAGACAGAAAGCCATCTCTTAGGCAGTCCCCA 

GAAMGCTG AAAG GTTGGATATACTTTCTACTCTTCTCTTTTCTTCATGAGAGAAAGGCC 

ATGTGGGCATTTTCTCCCAATAACACTGAGTTCTGTTGTCTTCTGTCGGCTGTGCTGCAG 

GTTCTCAGGTGCTGCAGTTAGCTGCT 

Sequence 665 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGTAAGATTTCCTGAAGGGATCCATAGCC . 

AAAACATGTTTGAAGGCCACTGGGCTCGCTAACTTCTAAAAGCACCCCAGTTCTAGCAGA 

CATCCTAAGGAACATTCCCAGGAAAATTCCAGCCTAGAACCTCCTGGGGTCTGACAACCT 

TAGAGMCAGTGCTGGCTTTGAATGGGCTTGGGGGCAGCCTCGAAACCCTCTTTCCAGTC 

TCCATGCAGGCAGGGGAGCTCCTTAAGCAACACATAGGACATTTCTGGGAGAAATGGGAT 

CCCCMCACAATGAACACTATAGATTTTAATGGTCTATATGGTTAAATACACAAGGCCCC 

TCATTTCCAACCCCGCCTGTTCATCTGATTCATCTGTACCTCGGC 

Sequence 666 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAGACATTCCCCCAGTGGCT 

GAAGTGGCATATGAATTATGAAGTTGGATCATTTGGAATGAATGTAAGAGAATTGCCAAG 

GGCTCCTCCTACTCCAGAGAGGAAACCTCATCCAGGGCCATGAAGCCACTTCCTCACCAT 

CTGTGTGCTGCTTAAGCTAATGCTGCGGGAACCATGGTTCCTTGGGAGGAATCAAGCTGA 

CTCTTGGCATGAGATTCCTGCCTTCCTAGGGTTGAGAGCGGCACTGCCATGGCTTCTCTG 

GACGACCCAGGGGAAGTGAGGGAGGGCTTCCTCTGCCCTCTGTGCCTGAAGGATCTGCAG 

TCTTTCTATCAGCTTCACTCACATTACGAGGAAGAACACTCAGGGGAAGGACCGTGATGT 

CAAAGGGCAAATTAAAAGTAAGAGGCGAGGACCTTGCCTACCCCCTGCTTGTCGTTGAGA 

GCTTTMCTCACNGGATAGTTCTTATCACTTTTGGTGGTGGCACAGGNATATGATAATTA 

GTAGTAGCCAACAGATGACTAGTNGTTGTCATGTGCCAAGCGTTTTAAAAGTTNCCTGTT 

ATTAATTTCATTA 

Sequence 667 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGAAGAG 

AAAGCGTGAGGGCTGGGCCTGCGGCGGGCTTTAGGGAGTGGTCCCTGGCTGTGGATAGAT 

CTGCTGATGAGTCCAGGCCCCGGTCCATTCTCCTCGCGCTGCAAGGATGCTCCTGGGATT 

TCGGAGAGGCCGCAGGAGTCATTTCAAACACATCATCCATGGCCTTTTACCTGCAGCCAG 

CGTTGCTCCGAAGGCAGCTGTGCCACGCACACCTCCTCCCCGCAGCCCCAACCCATCTCC 

AGAGAGACCAAGATCTGCTCTGGCAGCAGCCATTCTGGCGACAACATTGAC 

Sequence 668 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGATAGGAGAGGGAGGAGGG 

CTGGCAGCTGAGTAGCCAGAATCAGGCAGACGGTGGTAGCAGAAGTCAGAGGCCGAGGGA 

ATCAGGGAAAGGAGTTCAACCATGGAGGACCTTGCTGGCCAGGTTAGAGACTGTGGACTT 

TTGTCTGGGTGAGACAGGAAGTCACTGGAGGGCTGTGACAGAGCTCTGAGGCTGTGAGGC 

ACTGCTCTGTCAGTGCCATGAGTGGGGAAAACAGGAGCTTGCTGCACTGGTAGAGACCAC 

AGATAATGATGACTTGGACAGAGCAGCTGGGAGAGAACTAGTTCAATAACCCTAACACGC 

CTCTCCATTCTGCATTTTCCCTAAAAATGTACCTGCCCG 

Sequence 669 • 

CCGCGGTGGCGGCCGAGGTACCTGCCCTATCTTGCTGAATGTTTTATAATCTAATAAAAC 
TCAGATAAAGACCCAGATGTCACACCTGAACAGGAAAAGCTGAAAGGAAAAGATAATTAA 
MTATAAATCAACAGMTCAAGATTTTGAAAAGACCTAGAAAACTTGAAGGATTACTGAA 
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GCCAA GCAAGAGGAAATGATAGGATTAAAAGTAAAATCTTGTGTTTAGA I I I I I 11 I I I I 

TTTTTCAAACGGAGTCTCGCTCTGTCACCAGGCTGGAGTGCAATGGCGCAATCTTGGCTC 

ACTGCAATCTCCATCTCCGGGCTCAAGCAATCCTCCCACCTCAGCCTCCCTAGTAGCTGG 

GACCACAGGCATGCGCCACACGCCTGGATMTTTAAAATATATACATATTTTTGTAGAGA 

CAGGGTGCTGCTTTATTGCCCAGGCTAGTCTCAAACTCCTGGCTTCAAGGCATCCTCCTG 

CCCCAGCTTTCCAAAGTGCTGGGATTACTGGTGTGAGCCACTGTGCCGGGACATAAATAG 

TTATGCTGTATTGGTTAAGGAATAATGACA 

Sequence 670 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAA 

GGG CTGTG TTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGTGGTGCCATC 

TACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGG 

TCCTGAAA TAGTC ACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATT 

CCGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCAGATGC 

TGTTGCTGCACAGATCCTGTCACTGCTGCCATTGAAGTTTTTTCCAATCATCGTCAT 

Sequence 671 

CCGCGGTGGCGGCCGAGGTACAAGGAAGGCC7TAAAGACTGCCCCACTCTCCTTGTTTCC 

CATCCCCTGTCCCTTCTACTTCTCACATTCACCACTATGTGCCCTAGGACAAAATCAAAT 

GTGGAAACATTTGGTCATGTCTACTTTGTCCAAGGGTGGGAGTTCTTGAGGAATTCAAGT 

GGGAACTAGAACMCTTTCTACCCTTTCCTTCCCTTCCTTCCTCCCCACCTCTACCTAGA 

AGCCCATCAATCACTTTGAACTTCTTGGAGAAAAAAGGAAACAAAAGAAAAAGAAAAGGA 

GAGGCTGGGTGCGGTGGCTCATGCCTATAATCCCAGCACATTGGGAGGCCAAGGTGGGTG 

GATCACTTGAGGTCAGGAGTCGAGACCAGCCTGGCCAAAATGGTGAAAACATGTCTCTAC 

TAAAAATACAAAAATTCGCTGGGTGTGGTGGTGGGTG 

Sequence 672 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACACTGGTGGGG 

GAGAGTCCGACGCGCCTGGCTAGGAGCGCCGACCGCAGGGCCTCTACGGACCTTACTAGA 

AAAATGAAACCTGATGAAACTCCTATGTTTGACCCAAGTCTACTCAAAGAAGTGGACTGG 

A GTCA GMTACAGCTACATTTTCTCCAGCCATTTCCCCAACACATCCTGGAGAAGGCTTG 

GTTTTGAGGCTTCATGCCAGAAAGGGGMTGGGGAATGGCTGCTTAACGGCATGTNTTTT 

TT 

Sequence 673 

CGCGGTGGCGGCCGGCCGGGCAGGTACACGATGAAACGGGGGTAAGGAAGGAGAAGAAAA 

ACATTGAAAGGCATTTGACAGGGTAAGGTTGTATTCCCCAGACAACCCTGTCAAGCAGCT 

CTGAAGGG ATGATG AGCCTGGACTCTCTGGACTCCTAGATTATGAACTCCTGCAGTGGAC 

CATGTCCTATTTTTTGGAGGCGTTGGGGGGAATTGTCTTACGCAGCACCCAAGCACACTG 

CTATGCAATGGACCACAGATAGGAAGCAAGCACTGCATTTGGCTCCCCQGCGTACCT 

Sequence 674 • 

CCGCGGTGGCGGCCGAGGTACTAAATCATTAA7TCATCCTGAGCTAGTGGCTTTATTAAT 

GAGTATCTCACAAATACCACAAAAATTCAACCTGGCCATGTGGAGCAATATAAAATTATG 

GCATTTCTTGGTATGTTTTTCTCTTTGGCGAGGAGACAACTTGATCTTGTGTTTC^ 

G CATGTTA ATTTGCCCTGCTTG CAG AATCTCTCTG GCTTG AAAG G AG ATTATATTCATG G 

CAGTCTGTGAATTTTCATTTTATTTCATTTATTTATTTGAAGACAAGAGTCTCACTCCAG 

CCTGGGTGACAAGAGCAAGACTCCCGTCTCAAAT 

Sequence 675 

ATANGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGGAGGTACCATGAGGGGAGA 
ACCGGCAAGGGGTGCCATTCTAGCATCTGGGTGGGAGAGAGGAGGCTGAATGCCAGGGGA 
MCTTCTTGGAAAAAGTGATGCTGAGTTAGGACAATTTAGTCAATGAGAAGGGATCTGGC 
TGTTCCTGGCAGTGGAGACAACATNTTTAAAGGCATGGGAGAATATCTAAAATTTACCCT 



Sequence 676 

AGATAATAACATCTGATATCCACATGGGGTCTGGAGGNGCAAGCCACCTTCCTTTCATCC 
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CACGGTCTCACAGCAGCCCTGGAAAGAGGCTGCTCTCTGTTGGAGGCTAAGGGCCAGTGT 
TGGAAGGAGCTCGGGTGGAAAGTGTGGTCTGCATGAGGGGCTCCCGTGAATAGAGGAGAG 
GGGTGGCNGGTACCTGCCCG 
Sequence 677 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACCTGCCTC 

TGCCAGATACCCCTGAGGGAAGAGGATGTTCTATAACCAGGCCGACAGGTTAGCATTTGT 

GMCACAGTTCTGACGTTGTTGGGAGGGTTTGTTTGCCAGAAACATCCCCATGCGCTACT 

CTTTCMCCAGAGGTCMGAAGTCCTTTACTTTTGTGTC I I I I I I GTTTGTTTGTTTGAG 

ACGGAGTTTCACTCTTGTTGCCCAGGCTGGAGTGCAATGGCGCAATCTCGGCTTACCACA 

ACCTCTGCCTCCCAGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATCA 

CAGGTGCCCACCACCACGCCTGCTAATTTTCATACCCGGGTACCTCGGCC 

Sequence 678 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTGTGGTATTTCA 

CATAATATTAACCAGC TGTTA GCAATGACTGATATATACTTCCCATTGAAAATGATGTAA 

G GTCTGM AGGATTCATTTTGACMTTTTATATCACATATTTATATTTACCTTAGGTGGT 

TCTTTTTMTGTTTTMTTTGGGACCACACTMTTTCTMCTTGGTAACTCATCTCTTAC 

CMAATTMTACCAAGCCAAGAAAAATGGTTTCATGAATAGAATCTACTAGTCTTTTATA 

TCTT ATAAT GGTAGATCACTGATGAGGTAGAACTCCATAAGAGCTTCNCTCTCACAGTNA 

MGGTTTTGGTTGT GCATG GATTACACCTGGTGAAAGTTGGTTAGTATTTGTCTAAGTGG 

CTTAAGACAAATTTATTTTGATTTGTATTGTGAATGACTT^ 

TNCCGCTTCGTGTGTNGTGTGTGTGTGTGTT 

Sequence 679 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACCCACATAATAGCTC 

AGT GCATGC AATTTACAAAGTAATAAGTGAAATGCTCCCCATAGTTGACTATAACATTTC 

CTCA 1 1 I I I CTCTGAATTTGCTTTTTAAAAAACTCTTCCCCTTGCCATTCCCTTCCCCAT 

TCCAGATTGTAACTGCTTCTTTCCAGCTGCATCAGAAGAAGGGGACTTTCCATGTAGGTG 

TTATTCTCAGAAAAGGCCAGAAAAGACCAGGTCATGGTGGGGATGATTTGCTCCAAGCAT 

AAAAGAGAATTGTGATGGTTCAGGAAGACTGGAAAATAACGAGACTGGAAAGAAATGAGA 

AGGGCTTCAGAGGAATGGCACATTGAAATAAAAGGGAAGTGGTAAGAACAGGAACCCAAG 

NGGAATGAANGGGCNCACAGTGGCAGGGATGATTGGATAGACTGTGGAATAAAAATAATT 

TG 

Sequence 680 

AGGTACAAACTGGCTTCTTCTCTTTGTCACCAGCACCTGCTTCATAGTCTCTCTGGAGTG 

CCAGGMCGGGTCATTTAGATTAAATCTCCCATACCGTTCCTGGATAAATACCTCCTTCC 

TGCGAGCCCGCAGGGCCTCGATGACAAGGTCTCTGGCCTCCAGCTCCCCTTCCATCACGC 

TGAGGAGCATCCGCAGCTCGGATTTACTGAGAGTATCCACATCMACTCTTTTTTCAGTT 

TTACMGTGGAMTTAAGCAGTCCTCCTCCCCGTTTCTCCTTCCATTGCCAGGCTCAGCT 

CCTCTCACCCCAAGTACCTGCCCG 

Sequence 681 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGTGGAAAGTGAGGTGGTTT 

TATTTCTAGTTCACCATTGTCCTTAGGCTGTATAGACCTCTGGAATCCCAGCTTATGTGG 

AGAAGGTATCCTGTTAGACTTCCCTCCTTTGGTCAGCACTGGGCCTTAACTTCTGGCCCC 

TCAAAGCTGCTAAAACTGAAGGCCAGGCTTGCCTGGCTTGGCAAAGGACGTCGGGCAGAA 

GCAGCTTCTCCTCTCCTCTTGTTTCTCTGTTTCCCCTCACCATAGGCTTTGGCCTGGGAG 

TTTTCCTACA 

Sequence 682 

CCGCCCGGGCAGGTACCTTCTTGGTTGCTGTGACTGTCTGCTAGCACTAAGACTGTCTTA 

AGCAGATAGAGGGCAATGGTCTTTGAAGGCAAATGACAAAGCGTGGCCCTGAGCTCCCTG 

ACTGAGTTCATTTGGACTCTCAAGGGATGCCCTGGAGCTAGACTCGATCTGAGTGGTTGG 

ACTAACTCCTCTTTGTTTTTGTATTGMGAGCCAGCTTACCCCGCCATTTNTAAACCTCA 

GGCCAGGAAAACCAAAAAACAAAAAAACCCAACCCAAACAAAAAACAAACCCCACCTTCT 
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TNAGAANTNAGTAANCTTAAGGCTTNAAGAATCAACAGNGCCCCTTTGGGNATTAAGGGC 
CATT 

Sequence 683 

CCG CGGTGG CGGCCCGAGGTACTATAAAATACTATCCTMCTTTTTATGTGTTTTTTTAA 

CTTGTTTTTTAGMGTTTTGTAGCGGTTTTTAAM 

TGCTMTATCTGTATCTTTTACTCTATMCCTAATTTTTACATm 

TACAACAATGTAAAAAATACATGGCCCGGGTGCGGTGGCTCACGCCTGTAATCCCAGCAC 

TTTGGGAGGCCGAGGCGGGTGGATCACCTGAGGTAAGGAGTTAGAGACCAGCCTGGCCAA 

CATGGTGAAACCCCGTCTCTACTAAAAGTATAAAAATTAGCTGGGCATGGTGGCAGGCGC 

CTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCAGGAGGCAAA 

GGTTGCAGTGAGCCCAAGATCCGCGCCATTGCACTTCTAGCCAGGGAGAGAAGAGCCGAG 

ACTTCATCTTAAAAAAAAAAAAAAAAAAAAAAGGTC 

Sequence 684 

CGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACACTCTGGCCCTGGCTTTATTT 

TTAGATTTTCCTTTCCCGGTTGATATCGGAAGGCACAGAGGCAGGAGGTGGGGTGGATAG 

TAATGTGTGCCCCCTTGGGGGTNANAGTGAGGTGGAGGGGGATGTTAATNACCATGAGAG 

GCAGAGGGTCAGNCNANTTTCCANNGCTTCNNGCTTCCTTTAAATGANGGAAAACACGTG 

CANGTNTTAGGAGACAAAGGAAGGGAANTGACTGTTTCCTGGCCTGGTNTGTGGGCCCAG 

TNGNCTGNTNCNTTCAGTGNCTNCGTGCANTTNGACTNTACACNTANGNNGGCAGGCATA 

GGTGTNCGGTTNTGAAAGACNGNNNTCTTTNCACATTCTCTNCTGCTCTAGGGACTGAC 

Sequence 685 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTAGTAGAACTCTA 

GGAACTGACCAAACCCTTTTAACAACCAGGGGAGTGCTTGATGAAGGAAGAGGCTACCGA 

TCTTTCATAAGTATGMTAAGCAGCATGCATAAACCAATTACCTTCCCCTATTCCTCACA 

ACCACCACCACCACACCCACCACCTCTTGTGGCAGTGGGGATAGCAGCCCATGTTCCTGG 

AGTTGCTAACCGGTGCCAGGAGGGACAGTAGGGATCATGTCCTTCAAAATTTAGGGTTGT 

ACCT 

Sequence 686 

CCGCGGTGGCATGCATCAAGGTGACAGGTGACGGCATGGTTATGGATTAACTCACCAAGG 

AAATGAGTGTGGAAAGAAGAATGCANAAATCTGAGGACTAGAGCCTGGAGATGGGGAGCT 

TCGAGCTCAGAGGAAGAAGAGGATCTTCATCACGGGGAGACATCAGCCTTCTGAGTATCT 

GGGACTGCAGGTTATGTGCCACCACACTCGGCTAATTAMAAAAATTTTCTTAGAGACAG 

GGTCTCTCTACGTTGCCCAGGCTGGTCTCAAACTCCTGGGCTCAAGTGATCCTCCTGCCT 

CAGCCTTCCAATGCCTTGGGCTT 

Sequence 687 

ATGCATCAAGGTGACAGGTGACGGCATGGTTATGGATTAACTCACCAAGGAAATGAGTGT 

GGAAAGAAGAATGCAAAANTCTGAGGACTAGAGCCTGGAGATGGGGAGCTTCGAGCTCAG 

AGGAAGAAGAGGATCTTCATCACGGGGAGACATCAGCCCTTCTGAGTATCTGGGACTGCA 

GGTTATGTGCCACCACACTCGGCTMTNAAAMAAATTTTCTTAGAGACCAGGGTCTCTC 

TACCGTTGCCCAGGCTGGTCTCAAACTCCCTGGGGCTTCAAAGTGAATCCCTCCCTNGCC 

CTCAGCCCTTCCAAATGCCCTTGGGGGCTTACAGGCCATTGGAGCCCCACCATGTGCAAA 

NGAAAAGAANAGCMTTTTTGGACATCCTGCCCMAACAAACAAAGGTTTGGGCAATGGG 

TCCTGGTCAAGCAAAAACAAGTGGGGTTTGGNGGAATAAACCAACCCTTGGGTTAAAAAT 

AA 

Sequence 688 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACT 

TGAAAGGTCACGTAGCTGANACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAA 

AGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCT 

TGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGA 

GCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGG 

Sequence 689 
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CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATACTCATTAGAGTTGCTCGGTGG 
AGATGGAATGATGGTGGGGTGCAGTTAAACATGGCTGAGTGCTTTCTGCTTAAGGACCTG 
ATGTATTAATGCTCTCCAGGTCATTCATATTTGGGGGAAGGAACAAAGAGGGTACCT 
Sequence 690 

CCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGCTATGAATCTCACA 

GATGTAATAATGAGTTAAAGAAGCTAGGCACAAAAGAATATTACTGTATGATTCCAATCA 

TATAAAGTTCAAACCAGATCAAACTAATCAATGAACGAGGAGTCAGGATTCTGGTTATAT 

TCAGGGATAGTGATGGAAGAGGGCTATAAGGAGGGTGTCTGGGTGCAGGTCATGTTCTAG 

ATCTTGATCTGAGTGGGGGTTACATAGGTGTATTCACTTCATGAGAATTCAGAGGGCTGC 

ACACTAATGATCTGT ATAAT GCTCCTCTATAGTATGTCACACTTCAAAAAAGTTTACAGA 

MCAGTTCCTTCCTAATTTTCACAGGGCCTAAGAGCTAAAAACGCAGCCCCAG 

Sequence 691 

NCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGG 

AAGACTTGAAAGGTCGCGTAGCTGAGACTTCAGGAGAGACCATTCAAGGCTTCTGGCTCT 

TGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGAATTCTACTGGTCACAGACAAGA 

CTCTCTTGATCTGCAAATACGACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTC 

CTCTGAGCGCTGTCTATCGCATCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGG 

ACAAGAGACAAGGAGAAGGCCTTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCTCTTC 

TGTCCCGCTGGAACCCATGGTCCACTGAAAGTTCCTTATGCTACTTTCACTGAGCATCCT 

ATGAAATACACCAAGTGAGAAATTCCTTGAAATTTGCAAGGT 

Sequence 692 

GGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGCAC I I I I I I I I I I I I I II I I I I 

TTTTTTTNCCATANAGATGGGGCTTTGCCATGTTGCCCAGGCTGGTCTCAAACTTNTGAG 

CTCGAGCAATCTGCCCACCTCGGCCTCCCAAGGNGCTGGGATTACAAGCATGACCTGCCG 

NGCTGGCTAAAGTTTCTTATTTATACTTACTCATTCCTCTAATATCTGGATTTCCTTAGT 

CATCTGTCACTTCTCCCTGCATATTTCCTGTGATGTCTTTAGGTCCCTCCCACTNTTGTA 

GTAGCACTCCCTGGGGACCMTTTCGMGGATGCTGAGTCATATGGTTTTTGGTTTTGAG 

AGGGTTGAAAATGGAGACTCAACTCAATTTAGGAGCTATCCCATCATAACTAGTAGCAAA 

ACACGTCACTACTTGAGTCTCAACAAAAGACAAAAAAGGTTTNAAAGTTGGGGAACAAAT 

AGCTGCCAAGGGTTNTTNTTNTNTGACAAAAACATTGNGTTGGGGATTTAAAATCNATGT 

GAATCCTTAATCCCTAACTCATCCATGTTGGGG I M i ll 

Sequence 693 

CCNCGAATTGGAGCTCCCCGCGGTGGCCGCCCGGGCANGGTACCTGCTTNCCAGAAGTGT 

TATCATGATTAAATGACAGACCAGTGGCAGTAGCATCTCCTGAGGGAGGGTTAGAAATGC 

ATATTCTCAGGCACCACTGCAGTCTTGCTGAATCTGAAGCTTTGGGGATGGGACCCGGTA 

GTCTTTTTGGATMCTCTGCCMGNGGTTCCMTGTGCTCMGTTTGAGAGTTGCTGAAT 

TAAAGCGCTGGGTCTTGCCAGGCATACCTGTAATTCCAGCTCTTTGGGAGGCTGAGGTGG 

AAGGATTGCTTGAGCCCAGGAGTTCGAGACCAGCCTGGGTAACATAGCAAGATCCTATCT 

CTACCAAAAAAAAAAAAAAAAGTACCT 

Sequence 694 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGGTCTCTGTCTTTAC 

AGCTGAAGCATCAGAGGATGGAGTGACCAGGCTGGTTCCAATGACAGTTATACGGCCATG 

GGGAGTANACATGGAGTCTAATTCAGTGCTTGAGGCTAAGAATGAAGTTGTATGCATTGT 

GGAMTTGTTCCAGGAGATCTTGCAACTTTCAAGTTTGAAGTCATGTCTGTGACAGTCCA 

GGAATNTGATGCAGCTGTGGAAGACCAGGTGGAAGGGTGTTCTGTAGAAGTTGTGCGCCT 

CTCTGTGGCCGGGGTGCTGTCCATGGTACCT 

Sequence 695 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACATGTATCANG 
GAAMGAAAACGTTATTTGTCCCACAGATGCTGCTAGGAGCAGCTACCCCAAGACAGGCC 
TTGCACCTTGGGTCATGACAATGCGTGGCTACTGAGAGCTGTTGACAGAGTGGACAGGGC 
CCAGACCAGGACAGTCTCTCTAGAGGTCTTCACCTCCTCAACCGTAACTTAATCAGCCCC 
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ATGCCGGGCTAGCCCCATGCCACAAAGGCTCAGAAATGCCCTGCAACATGTGGGACACCT 

GGTAGTATCTACATAGGGGCCAGCATCCATCCCAGCTGCTGGGGGTGGCTCAAGAGCTGT 

GAGGGACACCCTTTCCTGCCTGATACCGTGGACCAGTTTGCAAAGAGCTGACTGTCCTGC 

TAGGCCCA 

Sequence 696 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTAC I 1 I 1 I I I 1 I 1 I II I 

TTTTTGTATATTTAGTAGAGATGGGGTTTTACCATGTTGGTCAGTCTGGTCTCGAACGGN 

TGACCTCAAGTGATCCGCCCACCTTGGCCTCCCAAAGNGCTAGGATTACAGGCATCAGCC 

ACTGTGCCCAGCCAGCCCTATGCTTTTAAGAGTTCGATGGTTGAAAGAGACTGAGCGGGG 

MGGTAGAGCGGGGCAGGGGAGGGACTACTTGGAGTCAAGTCAAAGTTTTAGGGAAAGAC 

CTGAATCTGAAAAAGATTATTTAACCTTTATGTGTCTGAAATACTATATTGTGCGAATTG 

TACCT 

Sequence 697 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACAAACACGACAGAAGCCCACGGAGCAAGCCC 

TGTGCTGGCCCCTTCACATGACTTTAGGCCCTCTAGCMGGTGATGTTTATTTACAGGGT 

TGCATAMCAAGGCCTCACCATTCAAAAAACCTTGTATTCTATTACATGTTTCACATTAA 

CMAGACTGGAMTTCTNTAGGAAAGGGATCTTTTTTATCTACATGAAAAGCACAGGCTA 

GTAAAGACTTGTTGAAAAAGTTGAAAGAACATAAATGTATATGGTATATGCCACATAGCA 

TAATGGAGGAAGATAGCAAATAGGAAACATATTGGTGAGGAAGACTGGAGTTTGATGATC 

TAGTCAGGAAMCATCAAGTTAAATCCTTTACTTTACACCTAAACCATAAACTGGTGAAT 

AAAACAAGTATGTGAAAGCACAANANGAGAGAGGACAGGCCGGGCGCAGTGGCTCACGCC 

TGTAATCCTACACTTTGGGAG 

Sequence 698 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATCCTATGCAGNCATNC 

TGTNAGNACCCATTCCATTNTNCTATCCCTGGNTNGCTGGTGTCAATACTNTNAAGCGAN 

TTACTGCNGGNGCTCTNNTTTTTCCCTCANAGATACCNNGTTGATTTCTTTGATTCTCTC 

CATCTCTACAGGCATAATAACTCCTAATATTTAAAAACNCTGTAGAGGGATGNANNGAAG 

CTGNGGNGAGAGCCCNTGGGCTTTTCNCNTGGGTNAAGATGCACATTCCTGAAAATTNTG 

GGCCTTGGCTTAAGCTGNACTAGNGCCGGCCACTCAGCTGATCTCACTAGCGTCACCTGT 

CGCAATGGTGCTGAAGCGCACTNCCNAGAGGCCATAAGGCAAAGCGAGAGTNCNTGGCTA 

TNGACTGGANCCCATTTAAGCAAAAAAACATGCCTCNCGNANGACAAATTCNATCAACAA 

AGGGNGGGCAATACAGGATCTGTACCTGCCCGGGCGGNNCGGGCANGAACCI I 11 1 I I I I 

TTTT 

Sequence 699 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCAGCCCGCCACCCGGCTT 

GTGTGTCATCCTGGGCCCAGGCAGGTGATGATGCCAAACACAAGGCCCAGCTATGTAACC 

MGTAMMCTTTCATCAGAATGCCCATCTTTGTGACCCACAGCCCATTGTCAAGAGCCT 

T CCCT GTGCCAGGAGTTCAGCAGGTTCACCTCCGCCTCCACTAGTCACTAAGACACGGAT 

ATTTTAAGAATTAAAAGCCTCCACAAGCCAGGCACAATGGCTTACACCTATAATCCCACA 

ACTTTGGGAGGCCAAGGTGGGAGGATCACTTGAGCCAACGAGTTCGAGACCAGCCTGGGC 

MCATAGCGAGACCTTGTCTCTACAAAAAAATTTAAAAGTTAGCCAAGCATGGTGGGGCA 

TGTCTATAGTCCTAACTACTTGGG 

Sequence 700 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAGTAGAGGCCAGGTTTCACC 

ACGCTGGCCAGGCTGGTCCTGATCCCCCGGCCCCAGGTGATCCGTCCACCTCAGCCTCCC 

AAAGT GCTGG GATTACAGGCGTGAGCCACCGCACCCGGCTC I 1 I I I I I I I I I I I I I AAA A 

TCATGAT TTTM CAGMGCCTCCATTCAAGGCGAGACATGCCTTTTATTTCTCTAATTGC 

GAGACACTTTTCTGMTCCTCTTGTCAGTTGCACCTTTTAATACAATTGAGGTGACACTG 

TTCTTCATG GTGA CACTGGTCTTTCCCMGAGGTTTCAGCTMTTCAGTCTATCAGATTT 

TACATCAGATTTTAAAATTTGCTTCAAACTTGGGTGCTTGTATTCAAATTCATGCTTCAT 

AGAAAAATGCATATCAAGTTCAACAGTTGACTAACTGCAGCCACGTTCACAGTACCTGCC 
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CGGCGGCCGCTCTAGAACTA 
Sequence 701 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAAAGGCTGATACACAC 

TGACAGATTTTGTMCMGGGACATTTAMACTGAGCTGGTMTAGACTTGATTTCTGGT 

G TTGCC ACTCAATAGGCATGACTAMTAGTGTACCTCACTGTTCTACTTTTTATAATTAA 

MTTTTAGAGGAAGCT GAGTTCTTGTATTTAACTACAAGTTAGAGACTCAGCCCACAAGC 

I I I I I I I I I I I I I I I I I AATATGGTTTC I I I I I I I I I I I I I GAGACGGAGCCTTGCTNTG 

TCACCCAGGCTGGAGTGTAGTGGCGCGTCTNTGCTCACTGCAATCTCTGCCTTCCCGGTC 

CAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACCGGCGTGCACCACCACG 

CCCAACTMTTTTAGTATTTTTAGTAGAGACGGGGTTTCCCCATGTTGGTCAGGCTGGTC 

TTGAACTCCTGACCTTGTGAACTGCCCACCTTGGCCTCCCAAAAACGCTGGGGTTACAGG 

CGTGAGCAACCATGCCCAGCC I I I I I I M I I II I I IATTT 

Sequence 702 

AGGTACGCGGGATATATNTAAATTTAAGAAANCATCCCCGGTAATATGGCTCTTCATAAT 

TCTAAGACTAAGGCTGGNGTAGAAACCTAACCACCTACCTTACAAGNGAAGGGGGCTATA 

CCATGGGGTAAGCCMGTTTGAMTTTATGGGGAATCNTACCCAACTTGGNTTAAGGGGG 

CCCTNGGATTTGGCCTNGGGGGCCAAGNNTTTTCTGTAI I I I I I ATAAAAGGTGATCTTN 

CATNGGTATTCCCTTGGTTTACCTTGGATAAGGGGGGATTACCAATGCCTTCTTAAGGAA 

AAAAATTCACCTTATTTGGGCCTTGGGGGGAAGGTAGGGTNGGGCTTCAATAGCCC7TGG 

TAAATTCTCCCAAGCCACTTTNGGGGMGGAAGGGCCTGGANNGTTTTGCCGCCCCACTT 

ACCACTTCCCAAGCCCTTGGGGGGTGGAACCAAGAAGTGGGAAGGAACCTCTTGGCCCCT 

CAATATNNAAAAAAAAATNAGAAAGGNAANATTNCACCTATTCTTACCANAACCCCTAAG 

NACCTMTTTTTAAAAAAATACCAAAAAGAATTGGCCCTNGTTTNTCAAAAACCACCTAA 

TTTGGGAMTAAAANAANGGGGTGGAAGAATTATTTTCTTTAACCCNNATNGGAATAAAA 

ATNNATNNNATTNGGGGNTCCCTTGGCCCCNGGGCCGGGCCCGCTTCTAAAAACNTAAGN 

GGGGATCC 

Sequence 703 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGNCGGGGCAGGTACTACTGACTTACCTGCC 

ATGGGCTTTCTCAAGACAAGTCCTGCAGGAGAGGCACACTCACTTCTAGCGTCAACTATT 

GAACCAGCCACTGCCTTCACTCCTCATCTCTCAGCAGCAGTGGTCACTGGATCCAGTGCT 

ACATCAGAAGCCAGTCTTCTCACTACGAGTGAAAGCAAAGCCATTCATTCTTNACCACAG 

ACCCCAACTACACCCACCTCTGGAGCAAACTGGGAAACTTCAGCTACTCCTGAGAGCCTT 

TTGGTAGTCACTGAGACTTCAGACACAACACTTACCTNAAAGATTTTGGTCACAGATACC 

ATCTTGTTTTCAACTGTGTCCACNCCACCTTCTAAATTTCCAAGTACCT 

Sequence 704 

CCGCGGTGGCGGCCGAGGTACTGTGAAAGAACTAGCACTTTGAGCAGAGAACAAATGCCT 

TACTTGAGTTTCCCCTGGACTCTATCCCTATTCAAAGATGCTTGGTTATACCTCAAGAGG 

GAAGCAATCCAGACCAACTCCTATGACATGACAGGCACTCAAGCCTGATGAGGCAGAAAC 

CTGGCAGCTGTAGATGTTGGAAAGGATAATTTATGTGTTCAAGTGACTAGGATTCTAAGG 

GCTAGATGCTAGCTTCMGCACGGCTGGATCTAGGMGCCCTTTTGCTCTCCCTTTTCCT 

TGGTCTACTTTTCTCTGTAGGCMGTTCATTCATTCTTAGGCAAGTTCTCTGCATGTGGC 

AGCAATGATGGACACTGGAATCTCTGGGTATTCAGAGTTCTTTCAGTAGCAG 

Sequence 705 

CGGGCGGGTACCTTACCACCCCATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTA 

GGTGCTCAATGTAAACGTGTGCACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGC 

CCGTTCTCTGCACCCCTCCCTCCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGA 

GGGGCTGGGGCGCTGGTCTGATTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGA 

ACCCCAAANGGTTGGGACAAACACAGCAACAAGCCCAGCTCCCTGAATTTCAGTGATTCA 

TTTGTGGGATAAAGGAGTGAATGATAAAGTGAAGGACGACTGTCCCCGCGTACCT 

Sequence 706 

NGGTTAANTGCCGCCNCTTGGCCGTAATCATTGGGNCATTAAGCCTGGTTTTCCCTNGTG 
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TGAAAAMTTTGTTATCCCGCTCCACAATTTCNCACACCAAACATTACCGAAACCCGGGA 

AGNCAATAAAAAANTGGTAAAAAGCCCTTGGGGGGGTGGCCCTTAAATGAAGGTGGAAGC 

CTAMCCTCAACMTTTAMTNTTGGCGGTTGCGGCTCAACTTGGCCCCCGCCTTTTTNC 

CAAGANGCGGGGGAAAAA 

Sequence 707 

GGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAACATCCATGGCAGACGCTATT 

CTTTCCCTCTTAGAGATGCAGACACTGAGGCTCAGAGAAGTTGTCCCGCACCCAGTATGT 

GATGGAGAGGTAGAGGGTAAAAACATCAACTGAAGGATTTAGCATTTGGGGAAGAAGGAA 

GAAGCCCAAAATGGAGTAGATCAAAGGCTCCCCCGTGAACAAATTTAAAATTAAGGAGAA 

AGAAGCAGAATTCAGTCTTCTCCACACCCATAACCAAACAGCTCCTATGAAGGCACCAAG 

CCTGACGCTCATCCCAATAAAAAGGAACGATCTGGAGAGAGGGGCAGCCGCTGGTGACAA 

GAGAACCCCCCAGGCAGCCTCGTCATCTGGCCAG 

Sequence 708 

GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTGGGGGACAGATATCATAT 

AGGGGATTCATGTCAGTGACCAAACGAAGTGACCATTACAGCCCTTTNGAAACCTGAGGT 

GTAATTTTTAMAATGAACTCATGACTTTAATAGTCATAGACTCAAACCTGAGTTGATTA 

TTATGAATTAGTTTATGGGAGTCTCMTATGTGAATATGATGGAGACAAGTTTTGGAATA 

CAGATAAATCAAGTCACTGTATTCACTCTCTCTCTCTCTCTTTGAATAGCCTTATCTTTG 

CCTATACACACAAACAGTGCAGCCATCMAATTTTCAATTTACAAAATGTTCACAGTCAT 

GCTTCTTCCTTGACTAAACACTGGGGTTGCTGCCAGTGGTAATTGGCTTGAAACCAGCTA 

A7TTTTATATATCTATTTAGTCTGGATATTCTAGATGAGTGGGCACTATAGT 

Sequence 709 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACCCACGTTTTGCTCCCACTCCTT 

GACCGCAGGGGCTCGGACACAAACCCCTGTCACCAGGAGAGTCAGTCAGCACTACTTGGG 

AGGGCTAMGGGAMTTTGGAAATAAAATTCCAAAGTTTGGAGTAAAAAAATTCAAGTGT 

TGATTTTATATTCTTTCCCTTTCTGACACAGCCTAAAGCGTAGGGGGAACATGTGTTTAT 

CTGTGGGAGATMACAAGATGGAGTCCCAAAGACTTTAACAAAATATTTTTTTAAAAATC 

CACTAGAATAGAAMTACATTATTTAGATATACTTTATGCTGAGAGTGAGTATATATGCT 

TGTCCTATTTAMCTTGTGAGMAAAGTGGTATCCCTTGATACATTTAGAAATATGGGGG 

CTATCTTGGTTCAJTGNGGGGGGTGGGGGCAGAAGGAGAATAAATGCAGGATGCCCTTGT 

TGAAAGGAATCTTAGCATGGCCCACAGGGGACGTTTCCAGTCGATTACCAAGGAATGCCA 

GCCT 

Sequence 71 0 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACCCACGTTTTGCTCC 

ACACTCCTTGACCGCAGGGGCTCGGACACAAACCCCTGTCACCAGGAGAGTCAGTCAGCA 

CTACTTGGGAGGGCTAAAGGGAMTTTGGAAATAAAATTCCAAAGTTTGGAGTAAAAAAA 

TTCMGTGTTGATTTTATATTCTTTCCCTTTCTGACACAGCCTAAAGCGTAGGGGGAACA 

TGTGTTTATCTGTGGGAGATMACMGATGGAGTCCCMAGACTTTMCAAMTATTTn" 

TTAAAAATCCACTAGAATAGAAAATACATTATTTAGATATACTTTATGCTGAGAGTGAGT 

ATATATGCTTGTCCTATTTAMCTTGTGAGAAAAAGTGGTATCCCTTGATACATTTAGAA 

ATATGGGGGCTATCTTGTTTCATTGTGGGGGTGGGGCAGAAGG 

Sequence 71 1 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGACGGTGAGCCGGAG 

GAGTCATGTCAGAGGGGCGAGCAGGAGCGATTCCGTCGCCAAACAGGTTATGAGTGCCAG 

TGAGCCGCCTTAGATAGAAGCATCGTCAGCACTTTATTAATGATGGATAGNGAGAATAAA 

CCCGAAAATGACGAGGATGAAAAGATAAACAAAGAAGCACAAGACTTGACAAAGCTTTCA 

TCCCATAATGAAGACGGTGGGCCTGTATCTGATGTGATAGCAAGTTTCCCTGAGAATTCT 

ATGGGCMAAGAGGTTTTTCAGAATCATCGAACTCTGATAGTGTTGTTATAGGAGAAGA 

Sequence 71 2 

NCCCGCNGTGGCGGCCGAGGTACTCTTATGAGAGGAACATTAAAATTTGCAATTATAATG 
CAAAGAAACAGGAGACGATCGTGAGAATAAGCAATGTCACACACATTTCTCTCCAAACTA 
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TATGTATCTTGTCCTTTAAATTCTTTGACGTGTGTGTGGGGCCTGGGTGGGGGGTGTGGG 
TGTGTGGTGTTCTMGGGCCTTTTCTACTAATTCCAATAACTGGGTTATCTGTGAGGCTG 
CTTATTCTCACTGTATT7TACTGCTTCTTTGCCTTTCTTG I I I 1 1 I 1 1 I I I AACCATACT 
CAGGTATGGTTAAAATGTAAATGAAAGAA 
Sequence 713 

CCGCGGTGGCGGCCGCCCGAGCAGGTACTGGGGCTGCACAGGCTGTGGCAACACTGGCTA 

GTCAMGCCTGGAMGACCTGTGAGCTTGMCTCTTCTGGGTTGCCATTGTTCTGTTGGT 

TTCAGCAGTGAGACTGAGAGAGCCTGTTCTGTTTAGAAMGCCACAGTGGTTTTCTAGGT 

AAAGTCTGCAGGAGATGTCACTTGGTGCCTmCMTACGAGTmCCACCTGCATT^ 

GGAACCATTATGGGCCTTTTAAAAATTTATTAATAAGTCTCTTAAATATTTTATAATCTA 

GCTTCTGAGACAAGATGATTTTAAAACAGTTATATGCTCTAAATTAAAAATTTA 

Sequence 714 

AGGTACCGTGTGAGCAGGTGGCGTTCACCAGGGGTGAGACTTTATTGACAGTAAGTTGCC 

T CTGCCA AAAAACGCCCTCATATGTCTGCTGATGTTTGAATTNCNNCCNNATGGCAGGAG 

GTTTTTTGGTCCTCCCCAGNTTTAAAMAAMTTGGTTAAMATMCCTGGGTTTGGTTN 

TTTCTTTTGNTATGGGGAAGGCCCTTCCAAGNAAGNGGAAAATAAANAAAAAAGTAATTA 

NANCCTTMCCTTCCTTAAGGGGGAATTMTTGGTMTTTCAAAATTTTTTGGAATGGCC 

TTANCTTTTNTTAA I I I I I I I I AMTTTTTTAAATTTTGGGANGGAAACCAAGGGGGGGG 

TTTCCTTNTGGGCCTTTCTTGGTTTGGCCCCCAGGGGGCNTGGGGNANGGTTGGCCAAGG 

NCCAATNTTTNNGCCAAMTTTCCAMCCMGGGCCCTTTCNAACCTTTGGGCCAAGNTC 

CCCCTTTCAAAMCCCCCTTTCNNCCTTGNtTTGGGCCCCCCAAAA 

Sequence 715 

CCGCGGTGGCGGCCGAGGTACCGTTTTATGATGATAACATAACTTTAATGCTCCAACCTG 

AGAAAGATAAAATAGACTAAGATGACCATTGAATGCAAACAGAAAGTTCTAAATGAACAA 

TCAAGNCAGGACCTGGAAATTTCAGGTCCCTGGTGGTTGGAAAANTAAATTAAATTAAAA 

ACCAANTTTCTTGGTTTTTCCAAGGAMAANTGGNTAAAAAAMTTMGGTNTTTAAAAT 

MNCCCCAGGGAMAATTTCCAAATTCAMATTATTAAAAAGGCCTTAAATTAAATTAAT 

ATTTTrTGGCATTTCNMGGCCCCAMCCTTAAAATTC 

GGGTAATTAAACCAGGGCCAAATTTNATTAAAAAAATTCC 

Sequence 716 

AGGTACACGATTATTTCACCATCCAGGTATTAAGCCTAGCACCCAAGAG I ITTTTTTTT G 

CTTCTCTCCTTCCTCCCTCCACCCTCAAGTAAATCCCAGTGTCTGTTGCCNCCCNTNCTT 

CGGTANNAACCAAGGTGG I I I I I I I I I AAATTTTTCCACCAAAAAAAAATCTTCNCATNC 

MCCTTCCTTCTTTTCAATTTTTTGGCCTTNNCCTTAAAACCGNAACCTGGAAAAACCCT 

TNCTGGGTTNTGGCTTCCCCCAGTAAAAGNACCAMGGAAANTTTTAAANTTTCAATTTT 

TTTAATTCCCCGGGTTTTTTAAAMAGGGTTNAAAAGGGAMGGNTTTCCTTAACCCCCA 

A GTGCCC MGGGGAMGNTTGGGGTTGGGGGGCCTTCCAACCNAATTNCCTTAANTTAAA 

MTTTTTGCCCCNMGGGCCAAACCCTTTNNTTTGGGAAGNGAAAGGGGGGCCCCCCGGA 

MGGGGGCCCCGNMGGGTTTGGGGGAAANTTCCAAACCNMTTTGNTAMGGGGGTTTT 

TGGG GGG AAAAAG 

Sequence 717 

CCGCGGTGGCGGCCGAGGTACTACAATAAGGACAAATATTCAAAACATTCTGTTAAGTAA 

AATAAGACAGTCAAAAAGGAAAGCTGTATAATTACACTCATGTAAAAATATTTAGTCCAA 

CNCTCACAGGANAACCAAAGGTGGTCAATAGGTTCCTCAAGCCAGGTGGCCACCCCAAAG . 

GAATGGTTAMCCMGGTTCTTCCTTCNGTTAAGGTTCCTGGAAGGAATTAAAACCAATT 

CCCCAAGGAGGTTTNCTTTTTGGTTTTCCTMCCCTTCTTAMGGGGAGGMTTTTAM 

GGGAGGTGGTTAAAAANCAACCAAAAMGGGTTTGNAMGGGNTTTTGGGGGAAGGNTTT 

GGGAAAAAAANGNTTTTTAAAGGNA 

Sequence 718 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACTGTCTCAAATTCTT 
GGATAGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGATCCTAACAGGTGATGA 
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CTCAGACCGACACTGCATTGGTAGGAATTCCACAAATAGGTGCCTCAATGTGCCTAGATT 

GAAATATCAGCCTTTCCCAGACTGACCTGATGGGTTGACTTCAGGTGTGGTGTAAACACC 

TACATTTTAATGTAAACATTTCAGTGTMTCMTGAGMCTATCATTCTGCTTTAATCAC 

CATGAGTTCTGAAATMCAAAGGATTrGTCTGACATTCATTCTMGAMTTCATTCTTAC 

CTGACTMGAMCTTTTTTAACCCGGCACAATAATAAAGAAATGACCTGTNAGTACCTGC 

CCGGGCGGCCGCTCTAGAACTAG 

Sequence 719 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGACTGTCTCAAATTCTTGG 

ATAGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGATCCTAACAGGNGATGACT 

CAGACCGACNCTGCATTGGTAGGAATTCCACAAATAGGTGCCTCAATGTGCCTAGATTGA 

AA TATC AGCCTTTCCCANACTGACCTGATGGGNTGACTTCAGGTGTGGNGTAAACACCTA 

CATTTTMTGTAAACATTTCAGNGNMTCAATGAGMCTATCATTCTGCTTTAATCACCA 

TGAGTTCTGA MTMCA AANGATTTGTCTGACATTCATTCTMGAAATTCATTCTTACCT 

GACTAAGAAACTTTTTTAACCCGGNACAATAATAANAAATGACCTGTAAGTACCTGCCCG 

Sequence 720 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCTGTGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTGCGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCACGTGGTTAC 

CTTGCAGTGCACAGCCTGTGGTCATAGA 

Sequence 721 

CCGGGCAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCTGTGCTCCAGGTGTTCACA 

GNTGCTTNGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTGCGCAAATGTTGCC 

TGTGCCCAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAAAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACTGCATTACACCACTCAGTATATGTGAGGGAGGGGATGTGCCTCTGGCCACGTGGTTAC 

CTTGCAGTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAA 

CATGTCCTTGCTCTCCAATGGCCCTGGCAGGCCAACCTTTAGTTTCAGGGCTACCACCTG 

TGCGGGGGCTTNTGTCATTACCCCCTGTGGATATTAATGCTGCACACTGNGGTTATGACT 

TGTACCTTCGGCCGTT 

Sequence 722 

GGAGAGGAAATGTGTAGGGGTGAGGGATGATACAAGAAAGCCAAATCCTCATCTTCTATA 

GTAGAGAGTCAGCGGATAAAACCTAAAAACAATACATCAAGAAATACTTACACTTATGGA 

AGGAAATACCAGAAGTTAAAAGGGGTTACTTCTGGGACATCAGACACCAGACTGCAGGGA 

AGGGC TGCC TCTTGTATTAACAAGCTTCCAGTATAATTTGC I I I I I AAAAATAGGTCCAT 

GCATTATTTTAATAAAAATTANGCTGGGCGTGGTGGCTCAGGCCTGTAATCCCANCACTT 

TGGGAG 

Sequence 723 

ATTGGAGqTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTAAGCAAGCTGTGTGACCTA 
GAGCACAGTGCTTTGAGTTTTTGAGTCTTAGCTTCTGTGTCTATAAMTGGGGTTCACAC 
AACTCACCTTACAGGGCTGTAAGATTAGATTACACAGAAAATATAI I I I I I GGCTGTGGG 
GGCTGGAAGTGTTGCTGATTAGCATTTGAAATCCCATCCTGTGGGTGAGAAAACCCCACC 
TTATGACTTGGTGGGAAACAAAGCCAACCTCCCACTGATGAAGCTGAAAGTAGCAGAACC 
TTGCTTCTACTGCCTCCCTTGCAGCTAGAGGCAGGCACAGGACTAGCCTGTCAATTGGAT 
GCAAATGCTCCAGGCCTTGAATCACAACCTGGTGACTTGCACCCAAGTCTATTA 
Sequence 724 

GGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCTGAGACTTCTGGAGAG 

ACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAACAATGAGAAGGAGAGA 

ATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACCGACTTNATCATGCTGAG 
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TTGTGTGCAGCTGCAGCGGATTCCTCTGAGACGCTTGTCTATCGCATCTGCCNGGGCAAG 
TNTCACCTTCCCTGGGATGTNCCCTTGGACAAGANACAAGGGAGGAANGGCCTTAANGAT 
CCTANCTGGGGGGGGA 
Sequence 725 

TAGGGNGAATTGGAGCTCCCCGCGGTGGCGGCGCCCGGGCCGGTACCCATAAAAATTAAA 

AACTATTTTAAAAAATAAATTCCATTTGAGCCACTCCTTCAAACCACCCAGAGTGGGTAG 

ACGTCTTTCGTGCCTCTAAGAAGCCCCATCTCTA7TCTGCGTCTCACCTTGCAGGGCTGC 

TCATCTGAATCCTGAAGATGGTGGACACCCATCTGCTAGGACTGAAATGAATAGGACAGA 

GGGAGGTGCAGAGTGAATGGACCATACTACCTGTCATCTTGGCAACGTGTGATTGAATAA 

AACAACTTCTTTAGMGTTTGATAGAGTGATTTGATAATGTAATTTACAAGTGATCATTT 

CI I 1 1 IA 

Sequence 726 

GGAGCTCCCCGCGGTGGCTTTTTGAGTCTGGACAGGNCTCTGI 1 1 1 1 GCTTTAAAGTTAA 

GAGAGCTAAATAAATGATGGTAAAAAGATAATAAAATAGAACATGAAGGGCTGTCAGTCA 

GTGTAGGTATTTCCATCCCCTCACTTTTCAAGTGAGGTCACGGAGGCTCAGAGCGATAAG 

GAGACTTGTCCAAGGCCACACACCGGCTGGTGCTCAAGCCGGGACTTGAACCCACGCAGT 

CTGACTCTAGAGCCCAAGCTCCTAACTATGACATCCTATTTGATACACTGTTTTACTGGA 

GAAACAGATCATTTGACAGACATTCTTTCTGTTAGCAATTTGACAACTCTTTCCCCAGTT 

GTCTGTACCTGCCCG 

Sequence 727 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCTA 

TTCTCTTACAGCTAGGACCCAGCGTTCTGGGCAGAATAGCAGGCAGCGCAGTAGCTACTG 

AGATTATAAGTGGTAGGTTTATGGAGCTGTGACCACAACTCTCCACAAGCCAGTGCTGTC 

TCATGCAAGCACTCTCAGTGTCCAAGTGCTGAGTGTGTGAGTGGTCTGGGCTTTGCAGGG 

TCGGCCAAGCTCTTGGAAGAGCAGGCTTTGTTAGCTGGGGAGTCATCGCTCCATGCAGGC 

CCTGAGAATGGAGCATCCTGAGTGGACTGGTAGAGATGGGGCATGGGTCACTCTGAGGGT 

TTGAGCTACTTCTGCTATTTTTGAAATTCTGGTTTGAACTGCAGGATCGTGCTGAGTTTG 

GCACAGACTAATTTCTCTGTTGGCAGCACATGATGTATCAACTCATGTGTCAGTTGGTTT 

G 

Sequence 728 

CCGGGCAGGTACTACCTTCTCTGCTACAAGTCGAGCGAGGAGCCCCGCATGAGCCCTGAC 

ACCTGTGCCACCATTTTGGAAAAAGCTGGTCTCGATAACTGGGCTCTTGGAAAAACAAAA 

GTGTTCCTTAAGTATTATCACGTGGAGCAGTTAAATCTAATGCGAAAGGAAGCTATTGAC 

MGCTTATTTTGATTCAAGCTTGTGTCAANAGCATTCTTGTGTTCAAGGAAGGATACCAA 

AAAATACAGGGAGGAAAAGGGAAAGGAAAGCCGCTTATAATAATACCAGTCAGCTTGCAA 

GGAGGGACCACCCTTGTCAGGGAAAACAANAAGAAAAGGAAAATTTGGTTTGGACCATTG 

AAAAAACCCCAGCANTTMCCAACCCAATTTCAAAAACTTTCTTGATTCAGGGAAATTTC 

GACTTACCAAGAAAAAACCTTTGGNAAAAATACCCANGGGGGGNTCCTGGGNAAAGGGNA 

GGANGGGAGCCCAAAAAAANAATTTGGANAACCCCCCNANGACCGACCCCCCNGGAAACC 

CCCA 

Sequence 729 

CCGCCCGGGCAGGTACTTTC I I I I I I I 1 1 1 I I I II I 1 1 1 I I I ACGGAGTCTTGCTCTGTC 

ACCCAGGCTGGAGTGGAATGGTGTGATCTCGGCTCACTGTAACCTTCGCCTCCCAGGTTC 

ACGTGATTCTCCTGCCTCAGCCTCCGGAGTAGCTGGGATTACAGGTGCACACCACCATGC 

CTGGCTAA 1 1 1 I I IGTATTTTTAGTAGAGACGGGGTTTCACGATGTTGGCCAGGCTGGTC 

TTGAACTCCTGACCTCAAGTGATCTACCCACCTTGGCCTCCCAAAGTGCTGGGATTATAG 

GCATGAGCCACCACGCCAGGCCCACTCTNTAAATnTGACCACCCTGCCTTGAGTGGTCT 

TCTAGCACCCTAACCTCTGTCTAACCTTCGAGAGCTTTGCACTAGCNATTCCTGGGGACC 

AGCTATGGTTGGTATCTTCTCMCTTTCTAATTTTTTTAAAAATATTATTATTATTATTA 

TTATTTTAAAA 

Sequence 730 
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TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCCCGGGCAGGTACCTGTGTGGAA 

MGAATGCTTGCAMGCTTGTCACCCTCACGAGAATTCCTCTGACAGACATTTGCCTTTG 

ACAGTGAAAACAGATATTAAAGTGAAAGGAGAAGAAACCGAAGAGCATCAGAGGGGACGA 

CTGGGTTACTTAACTGTTGGGGAGCAATCTGAGGAGTTGGTTACCAGAGAAACTGGCGAT 

GGCGATCCCGTGAGCAACATCTCTCAGACCCATTTTAAATGCCGGGGGATACTTAATCAT 

GCTGAAAAACAGCAGAGCCCTGAGGTTTTGGACTACATGTTGCAGAAAGAA 

Sequence 731 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACTACACCTGGGACAAA 

TACTTTTTTGTGMGNCAGGTAAAGCCTTTGCGTGCAATATAGCATCTCTATGCAATGCA 

NCAACTCCTCGTCTATCGCTACAGTAAGAAAACAGCCACGGGTCAGGTGTTGNGGCTCAC 

ACCTGTMTCCCAGCACTTTGGGAGGTCNAGGTGGGTGGATCACTTGAGGCTAGGATTTT 

GAGACCAGCCTGACCAACATGGAGAAACCCCATCTNTACTGAAAATACAAAATTCCCGGG 

TGTGGTGGCNGCATGCCTGTAATCTCAGCTACTCGGGAGGCTGAGGCAAAAGAATTGCTT 

GAATCTGGNAGGCGGNCGTTTGNNGGTGAGCCAAAAATCGTGCCATTGCACTCCAGCCTG 

GGCAACAAGAGCGAACTTTCGTTTCAAAAAA 

Sequence 732 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTAGTTATTT 

TAAATTCCACTCATAACTTATCGGCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGT 

GGAGCGGGAAGTGCAATCCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAAT 

GACTACCACACTTCGCTAAGGTCACATAATAAATAAGCATCAGACATCAGGTGTGGTGGC 

TCATGTCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGA 

GTTGGAGATCAGCCCGGACAACATAGTGAAACACGTCTCTACTAAAAACACACGCAAAAA 

AATACGAGGCATGGTGGTGCATGCCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAG 

AATCACCCTTGAACCCAGGAGGCAGAGGTTTGCAGTGACCCGATATTATGTCACTGCAAG 

TNCAGCCTGGGTGACAGAGCGAGACCTTGTCTNAAAAAAAAAAAAAAAAAAGAAAA 

Sequence 733 

CGACCACTATAGGGCGAATNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTGC 
TGGTTCAAGCCATTAMTTACATCACMAGGTTTGGTTTCTCTGTATATATTTCTCTGGG 
GCACTTTTGCTANGTTGGCTCTATCCTGAGGCAGNCTCTCTCCTCGTGGNAACCAGGTGG 
CTCTAGCAGCCTCAGCTTTATATCTCTCAAGAGTAAGTCCACCGTCACAGAGC 
Sequence 734 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACACCACCACA 

CCTGGCTAATTAATTTAAAAAAAA I I I I I I I I AGAGATGGGATATCTNTATGTTGCCCAG 

ACTGGTCTCAAACTCCTGGCCTCCAGCAATCCTCCCACCTCACCTCCCAAAGCCCTGGGA 

CTGCMGCATGAGCCACCATGCCCAGCTATATTTTCTGTAAATTGCTAATGANAATGAAA 

CATGTATGCTGTGGACAGAAGCCTTGTTGGACCTAGAGCCCATGCTGGGTCCTTTGCCTT 

AATMMCATMCTCTGGCATTACATATATMTTMCAGCCTCAMGANCATGTTTCTTT 

ATTAMCTCTGACTGTTTCAGCATTATTTT 

Sequence 735 

GCGAATTGGAGCTCCCCGCGGGGGCGGCCGCCCGGGCAGGTACTACTGTGTCCTTTAGAT 

CACTCTGCCTTGATCACTCTGTCCCGTCACTCTGCTATTTCACCTGNCAGNGAAATACCT 

GGTATCGTCCTGCCMCGTGMGCATTGAATGCTTNATACGTCTCCATCCTGATTGTTTA 

GGCTTTGAATGCTGAGAAGTATCTGCACTTTGTTGGTCA 

Sequence 736 

CCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAAGTACCCTTAAGCCCTGCTCCT 

TTGTAMGTCTTTTTGGATTGTCATCATCAAGAGTCAGTNGATCTCCANCTTCTCAGAAC 

TCACAGGGCACTCTGTCTAGGCATTGCTGACCGTCTGCAGTGTCAGATGGTGACTTCTGT 

ATGTGTTGTGTTTCCCGTTAGACTCTAAGGTTTTTAAAGGCGAGACTCACTCCTGCAGAA 

GCACATAACACMTGCCAMCTCTTATTTACGGAGGTCCTGGCGCATTGTCAGCTTTTGG 

TAAATGCTTTTCTTTTGTTGMTACTTATCTTCTGTGTGCCAAGATTTGTGTTAAGTGCT 

AGAAAAATGTGGGAGGTCACCGCAGACCCTGTTCTCATGGAACTGATGGTGTGTAGTGGG 
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GTGNGGATTAACATAAATAAAATGATGCGCAAATGAACACAAAATTCAAATTGATGATGT 

GTACCTGCCC 

Sequence 737 

TNTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCCTGAGCAGTCGAAG 

TGGATGCCCAGACCAATGGCCAGTGCTAATATCAATGCAANGATCCCAATGACGATGATT 

GGAAAAAACTTCAATGGCAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGT 

GCMCAGGACTTATTTTCAAATCATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCT 

TCMCAGCAGGCGGATCATTTTCCCCCATGGTGACTATTTCAGGACCTCTGACATCCGGC 

TCCGCCTCCACCTCTACCTCATAATTCCCGAGTCCCAAAAATGTAGATGGCACCACGGAA 

GAGATAGTAGGCCACAGTGTTACTGGCTTCCCATAAACACAGCCCTTTCCTGGCTCACAC 

GGGGCATGACCTCCCGCGTACCT 

Sequence 738 

AGCTCCCCGCGGTGGCGGCCGAGGTACATGTAGTTGGATGTCGAGGTTNGATTAGATTCT 
GGGGTTGGTTTGCTTGTTTTGGTGGATNGTTTOTGAGT^ 

TCCACCAGAAGGCACATGTGCTTGCCTGTGTC 1 WW 1 GTTATTGTTTTGAGGCAGAGCC 
TCNCTCTGTCTTCCAGGCTGGAATGTAGTGGCACAATCTTGGCTCACTGCAACCTCCACC 
TCCCAGGTTCAAGTGATTCTCCTGCCTCAGCCTNCCAAGTAGCTGGGATTACAGGTGTGT 
GCCACCATGCCCAGCTAATTTTTGTATTTTCAGTANANATNGGGI I I I I IGCC 
Sequence 739 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACACTTCCACGA 

GMGAATTMTATTGTAGTGTTAGGAMACfAGCAATTTAACTAAACAGCATCAAGTTAC 

MACCAGGAAAGTGATTTAAAACTAAATGCTGGCTTATCTTTCTGAAACAAAGCATCTAA 

ATTTGACAGTCCAAAATGGCACTTATTGAGTGTCCGTGACAATACATGCTGACAAGCAGC 

ACACCTCI I I I I IGI I 1 1 I I I AAGACGGCATCTTGTGCTGTCACCCANGCTG 

Sequence 740 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTATATAAAAATTGATTTAGCTTCTACACTCA 

AGTAATTATAAACAGGTTTNTCTTTTGGGACATTTGACAGTTATGTGAAAGGTGAGTCTT 

CGTTGTGTAGTATTGTCTGTTACACTGCAGGTGTCTAGAATTGCTGATAGTGTTCTCCCT 

CTAAAGTAATGTCACCCAACCACTTGTAAATTGACGATAATAAGACAGGAAATCAAGAAC 

CAATATAAATAAGCAMCATTTGAAAATMGAGCTAAAAATCAAAMTAATCTCTCTTTT 

TGCTGATAATACTTTATACCTAAATAACCTAAG A 1 I 11 I I 1 1 11 I I 1 1 1 1 IGAGACAGAG 

TCTTGGCTCTGTCACCCAGGCTGGAGTGCAGTGGTGCAATCCCGGCTCACTGGAACCTCC 

GCCTCCTGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCTGAGAAGCTGGGATTACAGGCA 

TGCCCACCGTGCCTGGCTATT 

Sequence 741 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACAGTATAAATCATGCT 

CCTGCTGTCTAGAGCTTACCACCCAACGAGGGCTTCAGATAAGATCAGCAACTGCCCTAG 

AGTGTGGAACTCCTATGACAAGGTGAGCCTGGGGTGTGATGGAAACACGGCGTGCATGGT 

TACCAAGCCACGCTTCCAGGGAAAGGGGTCCGTGCGGGAAAAACTTCAGAGAGGAAATGA 

CATGTCAGTCAATAACCTGAAAGAACTGGNTGAGAGTTAAGCANCAGGGAACAAGGGCAC 

AGTNNTCCACACAGCTTTTTGGAAAGATCATGTTGNTTATAGTGCAAAAAAATACTGAAT 

ATGGGAMCAATTTGTTATTATTTTTTAGGAGTNTTGC7TTGTCCCCCAGGCTGGAGTGC 

A 

Sequence 742 

ACTACTATTGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGGGGGAAAA 

AGAGTTACAATTTGCCCATAAAGAATGGAGAGAACAGAAATGTANCTTTTATGCTGAAAA 

ACAAAATGCAAGGGCAATCCAGTTTCTAATTCCTGTGCCAAAGCTGCTGTTCTTGATGAC 

CTCGGTCAAATCATTTAAATTCTCTCAATTTGTTCATATAAAAGTGCTATTAACCTGCAG 

TTCCTTCAMTACTATCCMTCAATGTTGGCTACTTGATTTTCA 

Sequence 743 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCACAGGGTATGCATAAA 
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CCAMTATACAAGMTTTAATGACAGCATTATTTGCAAACAGTGAAAGATTCTGAGCAAT 

CAAAAGGTTCATTACTACAGGCATAGGTCTATAAATTACTACTGTATGGAATACTATGCA 

GCCATTAAAATAATGAGGAAGAGGGAGAGTGCCCTGTATGCACTGACATGGAAAGGTTTC 

AGTATATGGTAAAAAGCAGCTCATCTTAGAACAGATTTTATAGTATGACCCCACTTGTGT 

GGAMTATT7TGTGATGTGCCACTCAGTGTATACGTATTCATGAGTGCATATACAAGTGT 

GTGAGMGCMTGTAAGTAACTGTTTCACAAGGACCCCCCCTTTAAGAAGGCAGAGGGGA 

TCGGGGGATATAGAGTGMGGGATGATTTTTGCTTTTTCCTCTA 

Sequence 744 

CCGCGGTGGCGGCCGAGGTACGCGGGAGGAAAGGGCTGTGTTTATGGGAAGCCAGTAACA 
CTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACA I I I I I GGGACTCGGGAATTATG AG 
GTAGAGGTGGAGGCGGAGCCGGATGTNAGAGGTCCTGAAATAGTCACCATGGGGGAAAAT 
GATCCGCCTGCTGTTGMGCCCCCTTCTCATTCCCGATCGCTTTTTGGCCTTGATGATTT 
GAAA 

Sequence 745 

CCGGGCAGGTACACAGTAAGTGAAGGGCCAAGACTGACGGCTGATAGGACAGGGGTGACC 

AGNGGTGGGGAGGGTAGTGGGAGCAGTCCATCCTGGAATCTGGCATTCAAGGGGCGCATT 

GTCTGTGGGAGGATTTAAAAATMTAAAACCAACTAAAGGCAGTCTGCTTTTTATGGTCA 

CCAGGCCGCCAGCAATTCTAAATTTCAGTGATAAAATATTCCTCCTCACTGGACACGAGA 

AGCTGGCTTTCTCCTTATTCCCCAGTACCTTNGGCCCGCTTCTAGAAACTAGGTGGGATC 

CCCCCCGGGGCTGCAGGGAATTTCCGATATTCAAAGCCTTATCCGAATACCCGTCGACCC 

TTTNGANGGGGGGG 

Sequence 746 

CGGTAATACTNGTTATCCACAGCAATCAGGGGGGATTAATCGCAGCNAAAGAACATTGTT 

NAGCAAAAAGGGCCAGTCAACAAGGGCCAGGAACTCGTAAATAAAGGCCCGCGGTTGCTN 

GGNCGTTTTNTCCCATAAGGGCTTCCGCCCCCCCTTGGACGAGGCNTCACCAAAAAATTC 

GACCGCTCAAAGTCANGAAGGTGGCGGAAACNCCGACAGGGACCTATNAAAGGATACCAA 

GGCCGTTTTCCCCCTTGGGAAGGCTCCCCTNNTTGCCGCTCTCCTGTTCCNGACCCTTGC 

TCGCTTACCGGATACCTGTCCCGCCTTTTCTCCCTTTCGG 

Sequence 747 

CCGCGGNGGCGGCCGAGGTACATCTTTGGTGACTTTTCATTCACATTTCATGGATMTTT 

GGGGAGGTGGCCTGCCANCCCTGAAGCCCTACATCCCCATACACACTCTGTGCACATCCA 

GTGCCCTGCTCCACCATGGCAGTGCCCGCAAGGGGGTCCCAGATGAGAAGAAGCTGGCTA 

AAGGGCCCTTGTCCCCTCTCAGACTCCTTCAGCGGGCTGGAGTCCTCCCTCGCTCGATTT 

CGCCCGAGAGCGTTAGGGGTTTCTAMTGCAGGCGCCTTTGTGTTGTMCGMACTTTTA 

GTTTAAGGGAAAATCTCTTTTAAGCCACTGATTGTTCTGACTTGCTGAGTTTACTCAGCA 

GCCTTATGCTGGCTCTGCCACTGCACAATAAAACCAAAGCANGACAGTTGCAGNTNAAGC 

AAGGGGGAACATGTTTTGCATTT 

Sequence 748 

GCCCGGGCATGGTACCTGTGTGGAAAAGAATGCTTGCAAAGCTTGTCACCCTCACGAGAA 

TTCCTGTGACAGACATTTGCCTTTGACAGTGAAAACAGATATTAAAGTGAAAGGAGAAGA 

AACCGAAGAGCATCAGAGGGGACGACTGGGTTACTTAACTGTTGGGGAGCAATCTGAGGA 

GTTGGTTACCAGAGAAACTGGCGATGGCGATCCCGTGAGCAACATCTCTCAGACCCATTT 

TMMTGCCGGGGGATACTTAATCATGCTGAAAMCAGCAGAGCCCTTGAGGTTTTTGGA 

CTACATGTTGCAGAAAGAAAGAAGNAATTTNTACCTTNNCCNAAAAATAAAAATATNNNA 

NNNNGGTACCTCGGGCCCGGTTTTNAACTAGTGGGATTNCCCCCGGGCTTGAAGGAATTC 

GNTNTTCAAAGCCTTNTTCGATCCCCGTCCNANCCTCNANGGGGGGGGGC 

Sequence 749 

AGGTACI I I M I I I 1 1 M I I M I I I I IGGNCTMCTGNNNGGAGTATTTCTtTTACCCAA 
GATAAGTAAAAGCTACAACTCTTAGTATAAATATGNGTCCAAGTGCCTNATAACTGCTAA 
CCACAGGGATCCTGAGCTCTNATAGCTTAAACACACAGNGTNNATTTTACTGGTCTACTT 
CTCCTGNAGACCTAAAAGGGCCTATAGCCTCAGTAGTTGACAAAACAACATATTAAAATT 
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CCTCACTGATCACTAACATAACCTAAAATCCCTGCTTTTGACATTAGCATGGNANACATC 
* CTTAGCAGGCCTAAATAGAATGGCCTTATAAGTGGATCCAAAGGGC 
Sequence 750 

AGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGT 

CAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTG 

GTCACTGTCMGGCATTGTTCTCCTCCMTTTGGACCCCAGCCTGGTGGAGCAAGTCTTT 

CTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGAC 

ATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAG 

CACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCA 

GGCACCACCAATTA 

Sequence 751 

GTGTCCGGAATCCTACCCGGTGTGNNGACAGTGCCTGATAGTTTCTTCTGCCTTTCTATC 
CCAAAACGATTGGTCAGTTTACCCMGTTTGCAATGCAGTTTANAATCTCCCAGGAACAT 
CTCTTCTAGTAGTTGCCTTAGCCATCTTGTAGTTGATTTGA Ci I I M 1 1 I I I I I IGCTNN 
CAGAAAGCTCTATGCTTCATATGGACTTGCATACCAA I I I I 1 1 GTTCATCTGTTGGTCAT 
GATGGTTAGCAGAGCCTGACCTCCTGTTACAATAGAATGATCGGTTCCTGGGCTACAGAC 
TTGAGTCTG 1 1 I 1 1 I I IG I I I I I I AAACCTTCCCATGNGGCAATTTGCCATATGCAAAAC 
T 

Sequence 752 

CCGGGCAGGTACGCGGGTGAAAATGGMTAGTTTTCTAATTACAGAAAGAAAAGAAGTTG 

AAGTGGGTTTCGCCATGTTGAGCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCAGCT 

CGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCACGCCTGGCCAAAAAT 

CTTATAMTMTCCCCTTCTMTTTCGGCCAGCTTMTCACACACCAAATTCCTTTCATG 

AGATTMTCTTCCACAACTTCTACACTTCCTTAMTCTTTGATTTTGTCCTATACTTCTT 

TTmATATTAGCAATCTACTTTAGGACAGAMTTTACTTTCCTTTC 

CCAAAGTCCTCTCTTAT 

Sequence 753 

TAGTTTTCTMTTACAGAAAGAAAAGAAGTTGMGTGGGTTTCGCCATGTTGAGCAGGCT 

GGTCTCGAACTCCTGACCTCAGGTGATCAGCTCGCCTCAGCCTCCCAAAGTGCTGGGATT 

ACAGGCATGAGCCACCACGCCTGGCCAAAAATCTTATAAATAATCCCCTTCTAATTTCGG 

CCAGCTTMTCACACACCAAATTCCTTTCATGAGATTAATCTTCCACAACTTCTACACTT 

CCTTAMTCTTTG ATTTTGTCCTATACTTC I I I I I I I ATATTAGCAATCTACTTTAGGAC 

AGAAATTTACTTTCCTTTCCTCTTGATTTTGACCAAAG 

Sequence 754 

CCGGGCAGGTACCTATATGATGTTGGCCATGCTCACTCACTCCTCCAACCCTCAGTTTAC 

ACATCTGCMAATGAGATACTTCTTTTCCAGTGTTGCTGTGGACATTAGCAGGCACACAC 

ATTTGGTGCTTGCACAAATGAGGTCCTAAGAGGTGGGTCCCTCTCATCTTACGTGAGGAA 

ACTGAAGCAGATTAGAAATGACCCAAGGAAACCACTCCGAGTTCAGTCTGGAGCCCACTC 

CCCTAGGTTTTAATCATCCCCCAACTCAGTCCCTATCTGCTGAGGTTCCTGGATCCAGAC 

GGTTCTTACCAAGGAACTGTCTGTCCTCACCACATGGATGGTTTTCCTGGCAGAGGTGTG 

CCCTGTGAGGGGGTCA 

Sequence 755 

GCCGAGGTACANACAAGGGGGCNACTGNCATGGGGGNGGNNTCTGGTCTTGTAGTCNGTT 

TGGAATTTTCTAAGTCAGGGTGGGGTGGGGGGACTGTGCACGGGTCATGTGCAGACTGGA 

ACCCATCTCCCCCTCGGTCTGCAAGTTAAAACAATTGGGTTGTCCTTCTCAGCATCTGCC 

AATGTCTCTTANTCAATCTTGGATGAAAAGGGCGTTGGAGGAGGAGGCTGGGAGGGAAAT 

CCAGACAGTTCTCCGCCTCTGACATCAGGTCCAGCTGTTAGCATCGTGCTGTGGGTCCCT 

GAACAAGAAGCAAAGTCAGGACT 

Sequence 756 

AGGTACCGCTGTGTCCGGGTGGGTGGNGNGAATGCCGTGCTCCAGGTGTTCACAGCTGCT 
TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 
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CAACTGGGTTTCCCAAGCTATGTAAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 
CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 
TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 
TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGT 
Sequence 757 

AGGTACCTCTGGATATGTTACACTGAGANAGATACTTCATCACTTACATGATATTGCTCC 

CCCACAAATTTCATAACCTGAATCTAGTTATAAGGAAATACTATGCAACCCAAATTGAGG 

GACATTCTGCAAAACAACTACCTGTAATC I I I I I I I I I I I I 1 1 I I I GAGACGGAGTCTCA 

CTCTGTCGTCAGGCTGGAGTGCAGTGGCGCGATCTCAGCTCACTGCAACTTCTGCCCCCG 

GGGTGCGAGCGATTCTTCTGCCTCAGCCTCCTGAGTAGCTGGGACTACAGGCACACGCCA 

CCACGCCCAGCTMTTTTTGTTGGGGTTTCACCATGTTGGCCAGGATGGTCTCCATCTCT 

TGACC 

Sequence 758 

CCGGGCAGGTACTATGGTCCCCGGCAACCTCCCCTTCCTCCTGGGAATGCTCAAATGGGA 
AGGCA GCATG AAACGGTGAACAGGCAATCACTGGACAAAGTCACAAGAACTGGGCTTTAG 
AMTGGTTTTACCATTAGCAGTTGTGACACCTCAGAAGTGGCAACTCTGGATCTNAATAC 
CCTACCCTTNACCCTAAGNANAGGTACCTCCNCNATTTTNNCGGGGGGAAACNTTCTNGG 
GAANTTNCCCTTTCCNAAAAAGGGGGGGGGGGGCTC I 1 1 I I 1 1 I I I I I I I GGGGGGGGGG 
CCCCNNCNNCCCCCCCCN I I I 1 I I I I 1 INTNAAAGGGGGNTTNAANANANATTTCNTTNC 
TCNTTNTTTTNTNANGNNAAAAANTCNGGTNGNGGNG I 11 1 I I II I I I ANAAAAAAAAAA 

Sequence 759 

GGCGGCCGAGGTACAMGMAGGACTTGATAGCTATTACCTTGCTGCTATGTTTGTTNCT 

TNGNCTCACCAATCATTNTTNTGTATACCTAGCACTGCACCAGGCGCTGAGGTTAGAGAA 

ATAACTAAAACTGCGCCCTTCACCCCTGATGGCAGGATAGGCAAGGTTGGCACCATCGTC 

ACAGCAGGACCCTCATCGATGCCTTGGTGTGTGCCTGGCATGGNGTTTGCAGCAGTTTAT 

CACATNNAATCCTTACAGC 

Sequence 760 

AGGTACTCAGGCCTTACTGGGATTTCCTTTAAGACCTCTGGGAGGAAGTGTCAGTAGCTG 

GGCAGGCCTTCJTGGCAAGCATTCCTCCCTGGGTTGTGGCGGGGGCTCCCGGCCTGCTGT 

GTGGCAGCTGCAGGCTCCTGGGGACCTGAAGGAAAAGCTTAACCGTTCTCCCTTCCCTTG 

CTTGGCACTTAGAGCACTAGTTCCATTCCAGACATACCGATTATCTTGCCTACGTGGCAT 

AGAGGCCTAGGAGCCTCCCTGGGAGGMGAGGCAGGCCMGGTCTTGCCTGGCTGCTTTT 

AGGGGGAAAGATGTAGGGAGGAAGCTGCCTTATGCTTGGATCTGCAACGTTTGCCTGGAC 

CTGC GGAG CCTATTTTGGCCAGGGGGAGGGAGACAGAAATTANACCCNANGTATTNAGGT 

MTCCTTTTNTTGCCTTTGMCATTGCNCGGGNGTACTTTGNAAAANAAAAAAAAAAA 

Sequence 761 

CTAGGG CGAAT TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGAAAAT 

GGMTAGTTTTCTMTTACAGMAGAMAGAAGTTGGAGNGGGTTTCGCCATGTTGAGCA 

GGCTGGTCTCGAACTCCTGACCTCAGGTGATCAGCTCGCCTCAGCCTCCCAAAGTGCTGG 

GATTACAGGCATGAGCCACCACGCCTGGCCAAAAATCTTATAAATAATCCCCTTCTAATT 

TCGGCCAGCTTMTCACACACCAAATTCCTTTCATGAGATTAATCTTCCACAACTTCTAC 

ACTTCCTTAAATCTTTGATTTTGTCCTATACTTC I Mill I ATATTAGCAATCTACTTTA 

GGACAGAAATT TACTT TCCTTTCCTCTTGATTTTGACCAAAGTCCTCTCTTATGCAAAAT 

GAAAAATTACTCTTTTTTCMCTTTCTTTACCAAAAATACATCCTCATAAC I 1 1 1 1 I ICC 

ATCTCTCCTACTTACTGG 

Sequence 762 

ATAGGG CGAAT TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGTGAAAAT 
GGMTAGTTTTCTAATTACAGAAAGAAAAGAAGGGGGNGTGGGNTTNTGCCATGTTGAGC 
ANGCTGGNCTCGAACTCCTGACCTCAGGNGATCAGCTCGCCTNAGCCTCCNAAAGTGCTG 
GGATTACAGGCATGAGCCACCACGCCTGGCCAAAAATNTTATNAATAATCCCCTTCTAAT 
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TNCGGNCANCTTMTCACACACCAAATTCCTTTCATGAGATTAAACTNNCACAANTNCTA 
CACTTNCTTAAATNTTTGATNNTGNCCTATACTTN I I I I I I I ATATTNGCAATCTACTTT 
AGGACAGAMTTTACTTTCCTTTCCTCTTGN^^ 
TGNANAATNNCTNTTrTTTCAACTTTCTTTACCAAAA 
Sequence 763 

TTAGGGCGATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACATGTAATGCTCCTGAACTGT 

ATGCTNGACACGGCTGTCTACNTAGGTTTTGTTCTGTGTATTTTATGACT AI I I 1 1 I I AA 

AAAGTAAACAAAAAAGAATTAGCTGGAAATACCAGCACAGGCAAACCCCTGGAGACAGAA 

AGCAGGTGAGTGGTTGCTGGGGCTTGAGCAGGAGGAAGGGCGAGGGACTGCAGAATGGCC 

ATGGGCTTTGCCTTCTAGCATGATGAGAATGTTCTGGAATTAGACAGTGGTAACGCTTGT 

TCAACACTGCCAGTGTAGTTAATGTCACTGAATTATACACTTTAAATGGCTAACATGACC 

AATTTTATGTTATATATATTTTACTACCACAAAAAAACTAGCTGGCACCTAAAAACATTC 

CATT 

Sequence 764 

CCGCGGTGGCGGCCGAGGTACTTGGATGGGTTTTGTGTGTATGTTTGTGTGTGCACTNGC 
GTCCACCCTGTTGGGCTTAGTGMCTTTTTGATTCAGTGATTTAAAGTTTCTCATCAGAT 
TTGGAAAATTCTCAATTAC I I I I I I CTTTAAATATTTCTCTTGCCCTTCCCCTCTCTCTT 
CTTCCAGGATTCCAATTTCATCGATGTT 
Sequence 765 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGTTCAAGCG 

ATT CTCAT GC CTCAG CCTCCCAAGTAGCTGGGACTAAAGGTATCCACCACCACGCCTGGC 

TAATTTTTGTATTTTTAGTAGAGATGGGGTTTCACCATGTTGATCAGGCTGGTCTCGAAC 

TCCGGCCTCAAGTGATCTGCTCGCCTTGGCCTCCCAAAAGTGCTGGGATTACAGGCATGA 

GCAGCTGTGCCCAGCTGGATAATTATTTAATAAATTGGGGAGCATAGGAAGCATAGTATT 

TGTGMGTGGGTAGGCAGGTGTGATGGGGGTAGGTGATGTTACATTTGGGGCATTTTGAA 

GTTG GTGGTTCTTCTGAGTTGAGCAGTCAGTCACTCTTCATn'GCTGCACCTTTATCTCA 

TTTTAGCCAACAGACATTGAATACCTACCAAGTCTTAGGTATTTGCA 

Sequence 766 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGAAA 

GAAAACAAATACCAAGTATTTACAGATCCAGAGAAAGTTCACAAGAATGGGAGGATGCCA 

GTTCCAATGCTTTGTAAAGTCAAAAATAGCCACATTGCAAAACAAACAAAAAAAAACGAG 

AACGTTCCCGAGTGTGCCTCCAAAACATAAAGGAGAAAATCATACAGAAAAACCTCATGT 

AAGGGTTGGAACTTGAGCAACCAGCTATCCAAATACAGAGGGGAATCCTCGCTTAGCTAG 

GGCATGGCCTGAG 

Sequence 767 

AGGTACACACAGTGATTTGGGGTCCTTTTTCCTAMACAGCTTCTTTATCAGGACTTTGG 

AATTCTGGGTGAGATAGAAACACTGAAAACAGGGCGGAAG I I 1 1 1 1 CTTCTGGCTTCTTA 

GTCCATGGAGGGCTCAGCGTGGAGAGGATATGCCGTGGCATTCTCCCTGGGAGACCACAC 

ATGTTCCCGACAGCTCAGACCCCAGACCCGCACATGCTTCTTGACAGTTNAAACCCCAAA 

CCGNAGGNGCTCCCGACAGNTNAAACCCCANACCCCGCGTACCTGCCCG 

Sequence 768 

CCGCGGTGGCGGCCGAGGTACTTAATAATTCATAATTTAGCCATGATAGTATCTAAGCTC 

ACTTTCAGAATTATTGCATACATGCCTTAGGGMGMCCTATCCACTAATGCTTTTAATA 

ACTTACATAGATTGTGTTGCGGCMGTCMGTTTTAATATAGAGGAAAGGGTTTATCTTA 

TCATAGTAAAATAGTAGTGATGTGTTCATMTTTACTATTTGCATGGTATATTATCAAGG 

CTGTAMAGCTTGMTTTGCCTTTTCCACATCTTCATTTCAMTTMTTTTTGTGAGGAC 

CCAGAGAAGTGGGTAGAACCCAAATGCCCATGNGGGT 

Sequence 769 

GGCANGGTACAAATTCAGGGGAGGATGGAGCAGCTGCAAGCCTGGCTGGCATCCCATGCC 

AACAAGGTGACCCAGTAGATAAGTGACAAGGTGACTGAGCTGCCTGGTGCTCTTGCATAG 

AATTTTCMGTGTTAGAAAAATGTCTCCATGCCTTGCAATTTGTCTCTTGTGGCCAAGCC 
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TAACTCAGATGGAAATGCAGAAATCACCCGTCTTCTGCGTCGCTCACGCTGGGAGCTGTA 
GACCGGAGCTTGTTCTMTTNGGCNATTTGGGTTCNTCCXJCCCCGGGNNCNTN 
Sequence 770 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI I I 1 1 I I 1 i 

I I I I I I I I I I I IGAGGNGGAGTCTCACTCTGTTGACCAGGCTGGAGTGCAGTGGCACAAT 

CTTGGCTCACTGCAACCTCTGCCTCCCGGGTTCAAACGGGTCTCCTGCCTCAGCCTCCCA 

AGTAGCTGGGACTACAGGCGCATGGTGCCACTCCCGGCTAGTATTTTGTATTTTAGTAGA 

GACGGGGGTTTTACTGTGTTGTCCAGGCTGGACTCGAACTCCTGAGCTNAGGCAATCCAC 

CAG CCTCAGCC TCCCAAAGTGCTGGCATTACAGGCATGAGCCACCGTGCCTGGCCTCTTT 

CATAI 1 1 I I I I I AC ACTTTTCATTTCTTCTTATTTT AAGTG NGCTGG ATAGGGGCTCCAG 

MCAGMTTCMTAGAMGTTGTGACAGTAGGMCCCTTATCTTGGTCCCTGATTTTAAA 

GGAGGGTTNAAAAAAAACCCCCCCCAA 

Sequence 771 

ACTTAGGGC GAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACI ITTTTTTn 

I 1 1 I I I I I I IGGAGAGAGGGGGTTTCACTATGTTGCCTAGGCTGGTCTTGAACTCCTGGC 

CTCAAGAGATCCTCTTGCCTCATCCTTCCAAAGTGCTGGGATTACAAGCGGGAGCCACTG 

TGCCTGGCCTAGMGATCTGTTTTCTTTTCTCTGAATAATTCTTGTGACACTGTCTCTCC 

CTCCATCTCTTTCTGTTTCTTTGTCATTTTTCCCAGCTATCC I Nil I CCTTGNCTTGTC 

CTCTTCTCCCCTCCATCCTAAAACCTTTGATCACAAGCTAGTTTCCTTTCCACATCATCT 

GCTCCCCTCTACTAAACGCTATTTCGCCCCCACCTGCTTTCAAGCTGNGCTTGCCTCTGA 

GCCCCTCTTTCACCACGGCCCAAA 

Sequence 772 

NCTACTTAGGGCGAATTGGAGCTGCCCGCGGTGGCGGCCGAGGTACCTCATGCTAGACAA 

CCTTATGACTTGAAAACAAAATAATATTTGAAATGGAAATGGCCTCAGTTCCACCCCTGG 

TGCCCACATAGCATAGTGAACCTGCCCCTGCAGCATTGCCCATGAGTGCTAAGATCCTGT 

GCCCATTTGCATGTCTTCCTTAAACAAAAGACCGCCTTAGTAAGAAATTAGTAAACCAGG 

GAGATAATCMCTTATCCCCCAAMGATTTAAGCCTCTCATTTTGTTTAACCTTCATTGG 

GGATTTTAAATAGAAAACTAGGGCCCGGCAGGGTGGCACATGCCTGTAATTCCAGTACCT 

GCCCGGCCGCTCTAGAACTAG 

Sequence 773 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGG 

ACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTTCGCT 

TCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCT 

CCTCCAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCT 

CATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGT 

CATCAGTTTATCAACCAACAAGCAG 

Sequence 774 

AGGTACGCGGGGAGTGAACGCTCTCGGAGAACCCTTTCCCACGAACGTCCACTTCAAAAG 

AACGCGACGGAGCATTAACTCTGCCACTGACCCCTGGCCTGCCTTCGCCTCCTCCTCTTC 

CTCCTCTACCTCCTCCCAGGCGCATTACCGCCTCTCTGCCTTCGGCCAGCAGTTTCTATT 

TAATCTCACCGCCAATGCCGGATTTATCGCTCCACTGTTCACTGTCACCCTCCTCGGGAC 

GCCCGGGGTGMTCANACCAAGTTTTATTNCGAAGAGGAAGCGGAACTCAAGCACTGTTT 

CTACAAAGGCTATGTCAATACCAACTCCGAGCACACGGCCGCTCTAGAACTAGTG 

Sequence 775 

CCGCGGTGGCGGCCGCCCGGGCAGGTACACTACTGGCATAAGAGTAAATTGGTGAGAACT 

TTCTGGAGGGGTAGTTTGGCAATGTGTTTCCAAAAAATCTAAAAATTATATTTGCCTCTA 

ATCCAGCAATTATACCTCTAGAAATTAATACTAAGGAAAATCTTAAGAATATACCGTAAA 

ACTTTAGTTGTMGAAA7TTTTTTGTGGCCAGGCATGGTGGCTCACACCTGTAATCCCAG 

ACTTTGGGAGACCAAGGTGGGCGGATCTCCTGACCTCATGATCCACCCGCCTCGACCTCC 

CAAAGTGCTGGGATTACAGGCGTGAGCAAATTTTAAATAAGAAGAAACAGTCAACAGCAT 

CAGACATAGTAGGTATGTCCAACACCATAATGGCTGAAAAGTGCCCCCTAGTCTGGCAAT 
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TAGTAGGTCATTGGTTTATTAATAACCGGCATGTTAAAGTTG 
Sequence 776 

CCGCGGTGGCGGCCGGCAGGTACAAATCATACCTCCCMGGTATTGCTCCATTGTGTTTT 

TGTGCATTTGGTTTGGATTTTTATGGGGAATTGAAGACAAGTGGATCATAAAGTGCAAAA 

TAAAATGCTCTAGAAATGACAGATGGGGCACAATTTCCAAGAAAATTCATCTAGACAGTG 

GCAACACTGAGAAAAAAAAGAAACATTCAAGAAG 

Sequence 777 

GAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACI 1 I M I I 1 I 1 I 11 I I I I 1 1 

TGCTAGAGATGGGGTTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGGCCACAAGTGA 

TCCACCTGTCTCAGCCCCCCCAAAGTGCTGGGATTACAGGTGTGAGCCACCACTCCTGGC 

CCATGTTTAGGATTTATACCAATATTATTMCTTAGAMTAAGTTTCTAATAAATTATTC 

CACCCGMCTTAGGGTAACTGMTTTTAATGCTGATGTATTAAGCAGGTTCTTCCTGGGG 

TCTTTTGATTCTCAAGGGATCCTTCACTGNGGGTGGACTTCAAATTAATAGGAAGCAGGA 

AGGAGCCACCTGCACTGTTTTCTTGACTGGGGATGACACCNAAACCTT 

Sequence 778 

CCGCGGTGGCGGCCGAGGTACTATGAGAATTTCAAACAAAGAATGAAGCCATAAAACAAA 

MGACTGMTATTTGGCTCTGCCTGGCTCCCAGGCTTTCTACTATTCTTGTGACTTGGCC 

TCMCAAAATCTAAAGTGACTTGTTATTTGTGGGTCAGCTTTGTCCCATCCTTACCAGTC 

ATGGCTTTAGACAAAAGACTCAGCACCACTCACCCfNTGGGACAGTNTGACTGNGGTCTG 

AGNCCCCTGCTTANATATTAGGC7TAAGCTCAGTT 

Sequence 779 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATGAGAATTTCA 

AACAAAGAATGAAGCCATAAAACAAAAAGACTGAATATTTGGCTCTGCCTGGCTCCCAGG 

CTTTCTACTATTCrrrGTGACTTGGCCTCAACAAMTCTAAAGTGACTTGTTATTTGTGGG 

TCAGCTTTGTCCCATCCTTACCAGTCATGGCTTTAGACAAAAGACTCAGCACCACTCACC 

CTCTGGGACAGTCTGACTGTGGTCTGAGGCCCCTTGCTTAGATATTAGGCTTCAGCTCAG 

TTCC 

Sequence 780 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACACAGTC 

AGCATGCGCTGTAGCAATGTGCTTTGCAGCTGGAACTCCTATCAAGCATCCTAGGCAAGG 

CATGCACCCCAGCGCCAGAGAGAATCAGGAAGGGGAAGGTGCCCTGAACCTCAGACAAGA 

ACCCCTTCCAGAAACCACCACCAAAGCCATCACTGTGTTTCCACCCTCAGACCTGTGTCT 

CTTTAGCTTCTTGGTAGAAGGAAAGAAGAGGAGCTTGGGTGGGGCAG 

Sequence 781 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACCATGGACAT 

GCACCCCGGCCACAGAGAGGCGCACAACTTNTACAGAACACCCTTCCACCTGGTCTTCCA 

CAGCTGCATCAGATTCCTGGACTGTCACAGACATGACTTCAAACTTGAAAGTTGCAAGAT 

CTCCTGGAACAATTTCCACAATGCATACAACTTCATTCTTAGCCTCAAGCACTGAATTAG 

ACTCCATGTCTACTCCCCATGGCCGTATAACTGTCATTGGAACCAGCCTGGNCACTCCAT 

CCTNTGGANNNTTAAACNTTNAAMANNNAMNCNNNCCCCTNGGCNTTTTTAAAAANNN 

GGNNNCCCCCGGGNNNGGGAAA I I I 1 1 1 I I IAAANAI I II I 1 1 I CCCCCCCCCCCCNGGG 

GGGGGGGGCCCCCCCCCCCCCCI I I I I I I 

Sequence 782 

NAATTGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACTTCCCTGAGCAGTCGAAGTGG 

ATGCCCAGACCAATGGCCAGTGCTAATATCAATGCAATGATCCCAATGACGATGATTGGA 

AAAAACTTCAATGGCAGCAGTGACAGGATCTGTGCAGCAACAGCATCTGCATCTGGTGCA 

ACAGGACTTATTTTCAAATCATCAAGGCCAAAAAGCGATCGGAATGAGAAGGGGGCTTCA 

ACAGCAGGCGGATCATTTTCCCCCATGGTGACTATTTCAGGAC 

Sequence 783 

CNATTGAGCTCCCCGCGGTGGCGGCCGAGGNCTGATGTCCTACAGTCCTCTACCTGATCT 
ACGTTCACTGGAAAGTGTNGAGTCTCAGCAGGAAGCACCTTGCTCTCGTGTCCGGCTAAT 
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TCGAGTGCTTTACGTAAGTAGAGGMTTGCTGACTTTTGGGACATTTCTGGTCTTGCCAA 
AGTTCACCTTGTAGTAAAGCCCCCAAAGATACTTCCCAAATAGATGCTCTCTTGAAAATA 
ACTCAG 
Sequence 784 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTCGGCCTATTTTA 

TATTCTGTTTAGTGCTTTCAACCCCGAAATCCACCTTCTAACATTAATATTGATATCCAT 

CCTCTCTCTCCTCCAACCTCTCTCTCTCTTGCATTTACTTTTAGATTTTCATTACTTTCT 

TTTTATTCTGATTCTTGTAMTAGTATAAACTAGATTCTTTTA1 1 1 I I ATTTACI UNA 

AATTTATGATTGACACATAATAA7TGTATATATTTATGGGGCACAACGTGATGTTTCGGT 

GCATGTATACATAGTATAATGATCAAATTAGGGTAGTTACCATATCCATTACT7TAAACA 

TTTATCATTTCTTTGTGGTAACAACATTAAAAATCTCATCTAGAATGGCGTGAACCCTGG 

AGGCAGAGCTTGCAGTGAGACGAGATGGGCGCCACTGCCTCCAGACTGGGCGGACGAGCG 

AGACCTCCCNTCTCAAAAAAAAAAAAAAGG 

Sequence 785 

GCTCCCCGCGGTGGCGGCCGGCAGGTACGAAATGAGAGAAATGGTTTAGTAAACGTATAA 

GACATCAACATAGNAAAGTATTCTATAGGNNTATGTGTTGGAATTACAAAGATGAAGAAA 

AGATACAGGCAAGTATTTGATATACTNAATTAAAAATAGCAAGATGTAGAGTAGNCATGT 

ATACAGTGATAGCAAGAACATGGATCCTTAAGGACAAAACTGAAACATAATGCAAAAAAA 

GAAAAATATGCAMTTATTTTTCGTATGATGTAAGTTGTAAATAT 

Sequence 786 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNGTACNCGGGCCTATTTCTGAA 

TAACTCAGNGGCTTAAAATTATATCCCAAAGTAGNGGTATCACAGGGTTTCCTGATGAGG 

ATAAATGGGCCTGAAGTGCTTATGGGCACCCACTATGTATCATGGNAAAACTTGCACGTG 

TGTGTGTGTGTTGAGAGAGAGAGAAAAAANAATAGANAAAGTTGGTGAGAAAAGGGNAGG 

CTGI I I I I I GNNCCGAGGGNTGTNTGGTTGGGCTTT 

Sequence 787 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGNACACANTAAGTGAAGGGCC 

AAGACTGACGGCTGATAGGACAGGGGTGACCAGTGGTGGGGAGGGTAGTGGGAGCAGTCC 

ATCCTGGAATCTGGCATTCAAGGGGCGCATTGTCTGTGGAGGATTTAAAAATAATAAAAC 

CAACTAAAGGCAGTCTGCTTTTTATGGTCACCAGGCGCCAGCAATTCTAAATTTCAGTGA 

TAAAATATTCCTCCTCACTGGACACGAGAAGCTGGCTTNTCCTATTCCCCAGTACCTGCC 

CG 

Sequence 788 

GGNGGCGGCCGCCCGGTTTTGGACGCGGGTNTNTGCCCTNACI I I 1 I I AGCGGAGCAGAG 

GAAACATTCATAAGGAAATATGCGAGTAGAGCTCAGGAGAAAAGCAGGACTAGAGGCCCA 

AGAATCACAGGCCAGAAGAAGAAGCTGTAGCCTCGGGAATGGAAGAGCTCTCTGAAGGGG 

AAAGGGGAGAACAGGAATGTNCCAGGAGCCAAGGCTCATCTATAAGGGACTTNCACATTT 

AGGATGTAGAAGAAGGAAGCAGAAGCAGGGGATGACCAGAAATGGCCCCAGAGATGAGAT 

GAAAGTTAGGAGAGCGGNGAGCMGCCTTTAGGTTTCACAAGGGAAGGAGGGAAAGTAGG 

TGTTAGGTGCTGCCAAGATCAGGGAMATAAGCAGAAGACCAGGCCATTTTNANTTGCNG 

TGG 

Sequence 789 

CCGCGGTGGCGGCCGAGGTACCACAATCAACTCAATCACAACATATTACAACAAAACCCT 

TCATCTTTTTCCTTAACCCACTGTAACACAAAGCAG AG AATACAGATTAGC 1 1 I 1 1 I ATT 

TGTCTGTTTGACTTCATCTCTTACATACCTCTATTTCAGTATCTATGATATTTTCCTCTT 

CTTATCTGTTCMTGACAGTCTTCCCTTTTMTATTCTAAACTTGTCAAGCACAGCANTT 

AAAAAAGTATTCTCATGTATATATTTTATCTTTAGAGCATCGCATAAAGNCTGATACATA 

GGMGTTTTAGATGCATATTTACATTGGGTAGATGAATCCAGGGGAAAAG 

Sequence 790 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCCAAGAGAGACGTCTCTTACTGCCTCATT 
AAGCATTTGGAGCTGTTAAACACAAATCAAGGCAACCAGAAAGGGCATCTTGGCTTCAGG 
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CTGGGCATAACCATCCCATTTGCCACATAAAAGTCTAGTGGCTACTCTGCACCCTTTCTG 
GGTAGAAGCAGAGTTAGTTTGGTCATGGGGGCCCCTGTGGGACAGTGTTGCCCAGACAGG 
TACCTCGGCCGCTCTAGAACTAG 
Sequence 791 

AACCCACTATAGGGNTNATNGGAGCTCCCCGCGGTGGCGGCCGCCCTGGCAGGTACTGNC 

TGTCTCAMTTTTTGGATACGCTGTCCCTCATGTCACCTAGCTGCTGAGAGCTTTGTGAT 

CCTAACAGGTGATGACTCAGACCGACACTGCATTGGTAGGAATTCCACAAATAGGTGCCT 

CMTGTGCCTAGATTGAMTATCAGCCTTTCCCAGACTGACCTGATGGGTTGACTTCAGG 

TGTGGTGTAMCACCTACATTTTAATGTAAACATTTCAGTGTAATCAATGAGAACTATCA 

TTCTGCTTTAATCACCATGAGTTCTGAMTMCAMGGATTTGTCTGACATTCATTCTAA 

GMATTCATTCTTACCTGACTAAGAMCTTTTTAACCTGGCACAATAATAAGAAATGACC 

TGGNAAGTACCTCGGCCGCTCTAAAACTAAGGTGGGATCCCCC 

Sequence 792 

NGGCGAATTGGAGCTNCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCAAAATGTAA 

GTTGGTAAAAGACATTGGACTCCAGCTATGTTCTTTAGAAAGAAGGTATTGGACTCTGGC 

CATGTTCTTCAGAMGAACATGCCTGGCTTTTTCACGATTTGATCAGTCTTCTTAGACCC 

TGAACCCCAC CATGA MTGGCTTCCCCAGACACAACCCGAGAGAGTTATGCTTTGTTTCT 

CAGCTAAAATATTTTGCAGATCTTAATTTCCTGGGTCATTGCATC AI I M I I I I I I I I I I 

T 

Sequence 793 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCMGTATACCTTTT 
CATTTAAATCATTGAACAGTTCACAATGGCTGTTGTAAAGTTTTTGCCTTGTTACTGAAA 
CC ATCAT CTCTTGGTCTATTTCTATGG AGTTA I Mil I CCTAGTTATTGGCAGTTTGCTT 
ATCTTTTATGTGTCTMTMCTTTTAATTGAATGTAGAACATTATGGATCTTATA I II I I 
GAGMTCTGMTGmAAMTTCCTTrGMGAGTTTGTGTmCTTCTGGAGGCAGTTTA 
CTTACTGGCTGTGAGCTCAGTCCTGTTGAGGCTGTTTNGCGACNNNCTGTGGCTTTGTCA 
GGGTGGGGTGGTGGATCAGACATCTGGTCATCAAC 
Sequence 794 

ACNCCCGGCCAAAAGGGAGNNCACAGGGGGGGGCCAATATATAAGGGGGGCAAGGAGGGG 

GGGNNGGGNAAAGANGNCAAACCCCNNCCCNNGGGGNNGGGGGGGGAGGGGNAANAAAGA 

AGGNGGGGGNGNCCANGCAACCNAAAGGGACNAACCCAACANNNAAGCGGGGGAAGGGGG 

CACGANAGAANANANANNANAGGNGANNCAAGAACCAANCAAGGGNGGGGGGGGAAGGGC 

NNCAAANGGGNNNAAAAANNGGCNCAGGNNACCNAANGGGGGGNNGGGGCNNAGAANGGG 

GGNGGGGGNNANGGAAANNAAANNCNGGANNNAACCAAGGGGAAGGGAGGGGGNGGGAAA 

NCAAGGCCCCCGGGGNCAAAACAAAAAGNGGGGGGAAAANACAANCGGGGNNNGGNGNCA 

GGAGGGGAGGNNANNAGCCGGCGCGNAAAAAAAAAAAACAANCCAGGGGCAANGGGGNNG 

GGGGNGGCNAANGCCNCNNGGGGNAAANCCCNANGGGNGNACCCNNGGAGGGNGGGGNNN 

GGGGGGGCNCNGAAGGGAAAAANAACCCCCAAGGGAAAANCCCANGGAANGGGNAANAAG 

GGGGGNGGCAAGNGGGNANCCCGNAAAAANAAAGGGGAAACNGGGGGANNGNCCCAAAAC 

CCCNGG 

Sequence 795 

CCGCGGTGGCGGCCGAGGTACTATCTCTTAGGAGAAGGCTGACTTGAAGGCTGTGAAAAA 

CTAAGAAAACACCAACCATTCAACAGTATACTAGMTTCCTTTCAATGCATAATAGAAAC 

AAGMGGGATTAGAAAAGCATGTCATAATTTCCAGATAGCATAATTATTTACATTAAAGA 

TCCMGAGACTCAGACCTAGTAAMGATTTTGGCCACATTGTGACATTTGAGATCACATT 

AAAAAAAAAAAGGAAAAATCAAGNGATACTAACATCACCAATTAACATCATNAATTAGAA 

MTTTATCCATCAGCNGTATTNNNNNGGGCTCMTGCIsnTGTMTGCCGCATTTCGGGAG 

GC 

Sequence 796 

CCCCCCCCGGAGTTTCAGCGAAAANCCCGGCCGAGGNACCNCNCCNTTACNCCCAGGNTT 
ANAAACNCCCNGGTNTNGNCCCAGGAGAGGCGGGGGANCACCGCGCCNGGCCAGCANGGN 
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CCCCTNCTAAATNNNGNNGTNNAAGGAANCNGGACCCCAAANNCNCGCCCCCCACGAAAG 

GAGCCTGGGAACTACCAGGGCGTGAGCTCACCGNGCCCGGCCGCTCTAGAACGAGAAGGA 

ACCCCCGGGCTGCAGGAAAAACNATAACAAGCNAANCNAAACCCGACCACCCCAAGGGGG 

GGGCCCGGGACCCAGCTGTTTGGGCCCTCAANACNAGGGAAAAATTGCGCGCTAAGNGCN 

CAANCANGGGCCAAAANGCTGGNTNCCCCGGAGGGGAAAAAAGGACACTCCCGCGNCACA 

AATTACCACNCAAAACATAACNGAAGCCGGNNAANCAATAAAANNGGGGAAAAANACCCC 

NGGGGGTGGCCCTAAAAGGAGGGGGAGCCCCAACCACCAACAAAATAAATTGGGGG 

Sequence 797 

AGGTACTATCTCTTAGGAGAAGGCTGACTTGAAGGCTGTGAAAAACTAAGAAAACACCAA 

CCATTCMCAGTATACTAGMTTCCTTTCAATGCATAATAGAAACAAGAAGGGATTAGAA 

AAGCATGTCATAATTTCCAGATAGCATMTTATTTACATTAAAGATCCAAGAGACTCAGA 

CCTAGTAAAAGATTTTGGCCACATTGTGACATTTGAGATCACATTAAAAAAAAAAAGGAA 

AAATCAAGTGATACTAACATCACCAATTAACATCATCAATTAGAAAATTTATCX^ATCAGC 

CGGGTGTGGG 

Sequence 798 

GCGGCCGAGGTACAATTCAACAATTNNTGGTCCAGGATCATGAATGGGCCATTNNTAGTT 

CTGTGTGTGCTTAAACACA I I I I I I I GTGGGGTGCTGTGGATGTGTGGATGTAGCCAAAA 

AAAACCCTATTGTGGGNTNGGTCCTGGGGCAGAAAGTCTGGTGCCAGAGAGTGGGGTTCT 

GGGGGTCTGTCTTCATAGTTTGGGGTAGCACTAAAATCCTGTGAGCCTTTCTGGGCCTTG 

GTAACCTCCCCTGTMGTTAGCTGTTAGATAATTCAGCTGGGTAGCATTTTATACCTGGA 

TGATGTTCTAAAGTCCAGCCACANAAGGCCNNNGTCTGGCAGAGTGAGAATTNCCTTTGA 

AGAACCTTNAAACTGNTNCCCNAGAGTGACACAGGGGNNCCTNGGGGAAAANCNAAAAAG 

NNNTTGGGAATTCTNTNCAAAAGNAAGNCCCA I I I I I 1 1 I 1 GCCNNNATTTNGGCCCNCG 

NTAATNCCCCNCCCCAAGNAAAAANNAAAAAAANTN I I I I I I I 1 I 1 1 I I 

Sequence 799 

CGGTGGCGGCCGCCCGGGCTTGGTACCCTCTGTCACGGCTTCCCTTTGCTGGAAAAGGGA 

ATTTCCCAACCCCGGGTGAGGCAATGCCCCGCCCTGCTCCGTGGGCTGCACCTGCTGTCT 

GTCAAGCCCCAATGAGATGAACCCTGTACGCGGGGGCCTGGGATCTCAAAATGGCGGCCC 

CGTGCGGAAACAGCGTNTGGGAGCAGANATTGTTGCCTCCTGAA 

Sequence 800 

GGGCGNTTTGGAGCTCCCCNCGGTGGCGGCCGGGCAGGTACTATCTGGAACNTGTAGCTT 

CCTTTNGCACTGCAGCATGGGMGCCAGAGTTGATGATTCATACACCAGCATTTACATTT 

TTCAGCATGAAAGTGGTATGTTCTTCAACTCACAACCCATTGGCCAGAACCAGTAACATG 

ACTTCACCTAACTGCAAACTAGCTGGAGAATTGTGGGAGAGCTCATGG 

Sequence 801 

CCGCGGTGGCGGCCGAGGTACCATTTAGCACACAATTTCCATGTCCCAAAAGCAACCCCC 
ATAAACAGTGAC TA I i I I I I ATGCTGTTTTTCTTTGCCCCMCACTTTTATCATTTGATA 
TGTTATATCTTGC I I I I I I 1 1 I I ICI I I I I I AATGGAGTCTCACTCTGTCACCCAGGCTG 
CAGTGCAGTGGCGCGATCTTGGCTCACTGNAACCTCTGCCTCCTGGGTTCAAGCAATTCT 
CCTGGGGGGTGGGGAGGTTTGCAAGTGATCCAAGATTGCGGCTCTTCACTCCAGACTGGG 
NGAAAGAACGAAACTCCATCTNAAAAAAAAAAAAAAAAANAAAGTACCTNCCCGGGCCGG 
CCGCTCTANAACTAGGTG 
Sequence 802 

CCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGATGGTGTCAACTTATGTCAGGACCCATG 

GGCCCTCCCCATGCACACAGCACTCTTGGAATCTCATCCTTTTCCATGGCTCTGGCTCAC 

ACTTCCACAGCATTTACTCCTAAATATGCCCCCTGGGTTCAAGGGTGATTCTCGTGCCTC 

AGCCTCTCAGGTAACTGGGATTACAGGCATGCACCACCATGCCTCGT A l I I I I I I IGTGT 

GTGTGTTTAGTAGAGACGTGTTTCACTATGTTGTCCGGGCTGATCTCCAACTCCTGTAGT 

CAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTATAGACATGAGCCACCACA 

CCTGATGTCTGATGCTTATTTATTATGTGACCTTAGCGAAGTGTGGGTAGTCATTAGTGC 

TGGTCTATCTCTATACCTTCCCCAGGCAAGGTAGGATTGCACTTCCCGTCCACT 
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Sequence 803 

AACGACCGCCC6GGCAGGTACTAAGCCAGGACCTCAGTTAGCACTAAGCACTCTTACTAT 
TGCCCCCACCTGGCACAAAGCAAACTGAGATCTTAGTTGGGCCCATCATGTGTCATCTGA 
TTGTCTTAGAAGTTC 1 1 I 1 1 1 ICTAAGACAGAGTTTTGCTCTTTTGGCTCAGGCTGGAGT 
CCAATGGCACAATCTCGGCTTACTGCAACCTCCGCCTCCCAGGTTAAAAGCGATCCTCCC 
GCCTCAGCCCTNCGAGTAGCCGGGACCACAGGCACCCGCCACCACGCCCGGNTAACCTT 
Sequence 804 

TACTATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACTCTCTAT 

GACTATCAAGCTCAGGCCTCTCCC 1 1 1 1 1 1 1 AAACCAAAGTCTGGCAACCAAGAGCAGCA 

GCTCCATGGCCTCCTTGCCCCAGATCAGCCTGGGTCAGGGGACATAGTGTCATTGTTTGG 

AAACTGCAGACCACAAGGTGCGGGTCTATCCCACTTCCTAGTGCTCCCCACATTCCCCAT 

CAGGGCTTCCTCACGTGGACAGGTGTGCTAGTCCAGGCAGTTCACTTGCAGTTTCCTTGT 

CCTCATGCTTCGGGGATGGGAGCCACGCCTGAACTAGAGTTCAGGCTGGATACATGTGCT 

CACCTGCTGCTCTTGTC7TCCTAAGAGACAGAGAGTGGGGCAGATGGAGGAGAAGAAAGT 

GAGGAATGAAGTAGCATAGCATTCTGCCAAAAGGGGCCCCAGNTTCTTAATTTAAGCAAA 

CTAAGAAG 

Sequence 805 

CCGGGCAGGTACAATGGACTTTGACAGTTCTTCCCAAACAGATCCTAATTTTAAACATTA 

GGTTTGCTTTGATTCTTTTCCTTGGGGCTAAGAGCTCACAAAGAC7TAGGTTCTGGTCAT 

GGCTCCAGAGGCCACACATTCCAGGACAAAGTCTCTCTACAGTCAACGCCTTAGTCCCAC 

ATCTGTAAAATCGGAATAATCATCCCTGATCCAGCTATCACATTGCAGTAGAGTGAGACT 

CAAATGAGATAATGGAAGACAGTGGGAATGATCATTTCCAACTTGGCCTGGCTGACCCAT 

TCCTTGTTCTAAAGTCAGCTCAGGTTTCACCTCTTCCAGNGAAGTTGACCTGGCACTTTC 

TTTTAGGATGGCTACTGCTCCTCTGGGTGCCCCGGGGCTCANTGTCTCCCCATCACCGCC 

CATGGCACACTTGGAGTGACTGGTCCTTTACTTTGNTT 

Sequence 806 

TNCGGGCAAGGTACATTGGCCCCAAAGAGNAGGAATTCCTTGTAGAGGAGCTTGTAGATG 

CTTNCCCTCCAGCGGAGAAGCAGGCCAGAGAAACCTCCGAAGCGGGCCTCCGCCACTTTG 

AGAGTGTATGAAACCGTCATGGTGCTGGGAGCCTGGGGCAGGAGGTCACAAGAGTTGCCC 

CCAGGGCTGTCGTTTAGTTCTCCAGACAACCTCCCTTCCACTCTGGTCTCACACACCCCA 

GCCTTCACCCTGCGTCAGTGGACAAGGGGGTAGGAGCCTGCAGAGCAGAAAAGTACCT 

Sequence 807 

AGCNCCACCGCGGTGGCGGCCGANGTACGCGGGATATGTAGAACTTCACNNGTTTGAAGT 

TGGCTGATTAAATATACTMGTATTACTGAATCACTGCCCTGCCTTTTCTGCTTTCTTTA 

CAGACCTGTTTAGTATACACTGTATGTA I 1 1 I 1 1 I I I I I I I 1 1 1 GAGACTCCGTCTCAAA 

AAAGAGAAAATTATGGGCCGGGCACAGTGGTTCATGCCTGTAATTCCAGCATTTTGGGAG 

GCCGAGGCAGGTGGATCACCTGAGGTTGGGAGTTCGAGACTAGCTTGGCCAACATGACGA 

AACCCTGTNTGTACCTGCCCG 

Sequence 808 

CCGCGGTGGCGGCCGAGGTACGAGACTTGTCACCATGTGACATGGCAGCTTCAGAAACTT 

AGCCACTGCCAAAAAAAGAGCAGGCAGGGATAATGTTGTCCCATTGTCCAGTCAGAGAGA 

CCTGTTGAGTCTCTAGTTTGCCAGTCCCCAAGAGACCTTTGGAGTTTTGCTGGAGCCAGA 

CATCCTGCTTAGAGATGAGGAAGATCCTGCTGTTCCGTGGGGAGCTCTTGAGACACCCGT 

GCCACCACCCACCTTCTCCTGATTGCCACTTGCTGCCCTTTTCCCATTACCCTCTCCTGA 

CTCCATAAACATCTTCAAGTCTTCCCTTTCTCCACCCCAAAAAATGCCCACCTTGGAAAG 

GG 

Sequence 809 

AAATTAATTGGGGTTGNGCTAACTGCCCGGTTTTCAATCNNGNAAACCTTGTGGGGCCCA 
NNT GAATT AANANAATNGGNCCACCCCCCCGGGGAAAAGGGNGGTTTTNNAANTTTTGGG 
GCCTTTTTCCCCI I 1 1 I IAAAAAA 
Sequence 810 
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CCGCGGTGGCGGCCGCCCGGNCACGGTACGCGGGGATGTCTTCTGAGAGAGTCAGGGCAG 

CTGAAGACTGGGTGAGGGTGAGGGAAGCCGCTGGTGTCCTCCTCAGTCACCCGTGAGAGG 

ACTCCTNTGTGGAGCTAATCAACTGCAAGGAAGATTGTTCCCAGTGTCCAGACCTGAAGG 

AGTCTGGACCCATAGTGCANTGAGATTTGGGGAAGGAAGGATTCCGGATAGGGGTGAGCT 

TTNTGNTGATAAGCAAATGTGAAC 

Sequence 81 1 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGTGTTTATGGGAAGCCAGTAACACTG 

TGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGACTCGGGAATTATGAGGTA 

GAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCACCATGGGGGAAAATGAT 

CCGCCTGCTGTTGAAGCCCCCTTNTNATTCCGATCGCTTTTTGGCCTTGATGATTTGAAA 

ATA 

Sequence 812 

CCNTCAGGTACCAGANCTTAGCAGGGATTTTGGACAACAAAAGCTCTAAATCCTCTTGCA 

TCGACACGTTCMTTTGCACTGACCAATCTGTTGGCACAGTAACTGTTTATAAGCTAAAT 

TTCTACATTTTGGCTACAAGTATCCCAMTCCACCTTTTAAAAAATCCTAGGTAGATGCC 

ATCT GGTGTTMTGATTTGCACACCCCTTAAATTGAAAATATTTTAAATAAATCTCACGG 

TTTTATATAGTATCATTMTGTGTCTATTTTAAAAAGACAATCTGAGAATAACACTTCCC 

CTAATTGTTGTCTTAATAATGACCAAGAGCTGAGGAAAAATGATTCACACTGTTAGTTGT 

TTTGTTGTTTTGCTCACGGGGGAAGGGGGTGAAGTACTGGCTGTGCCTGGGTTTG 

Sequence 81 3 

CCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAAA 

GGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTCC 

GTGTCGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCAA 

GTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTTG 

GTGGACATCCATGTGGCAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGC 

ACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGCC 

CAGCCAGGCACCACCAATTACCAGAGGAACAAA 

Sequence 814 

CCGCGGTGGCGGCCGAGGTACATTATTCATATCCAGCACTCCCTGCGGCTGCTGCTGGAG 

GAGCAGTTATCCMCAAGGACTGTTTCAACCTCATCGCGTTTGGAAGCACAATTGAAAGC 

TGGAGGCCTGAGATGGTTCCCGTGAGTCACAACAATTTACAAAGTGCCTGGCGGTAGGTT 

ATGGGCAGAGACTTCGTGGGGCTGTGTCTGAGGGMGGTTTGCAGGCATTGTTTTCTCTG 

TCCCCCTCTCCACCAAGAAGTAGCTCTCTAGAGTCCCTGACCCCAAACAGCCATGGGCAG 

AAATCAGAAAACAGCTTCCTTCTGTCTGCTGCTCTCCCCACCTGGCCATCTTCACTTTAT 

GAGAGTAATGACATCGACTCCATCACGTCTGAGATGGGAAAAAGGCTCTTCAGCTACTCC 

CAAAAGGGTATGCCCTGGGCATGGG 

Sequence 815 

CCGCGGTGGCGGCCGAGGTACTC 1 1 1 I I! I I I I 1 1 1 J II II 1 1 GAGACAGGGTTTCTGTT 
GCCCAGGCTGGAGTGCAGTGGCACAATCTCAGCCCACTGCAGCCTCCGCCTCCCCAGTTC 
CAATAATTCTCATGCCTCAGCCTCCCAAAGTGGTGGGATTACAGGCGTGAGCCACTGCGC 
CCGGCCACCTTTCTAI I I I I CTGGTTAACTTTCTAAATGTTTGAAATGGCTTCCAGTGAA 
TTTCATTTTATTATTGGGGGAMCTTCCATACTTATTTTCTn 

AGTATACTCTCCTCCCAAAATTTAGATAGTTGTATTTTTCTGATTATTCCAAATAAGAGT 
GCTGAGAGGCTAATCACAAAGAGCAACAGCCAGAGATTTACAAAGTGGTTCTCTTACTAT 
TGAACATTTTCACTTAT 
Sequence 816 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCTGCCCAGGTAAGATCAC 

TCGTGGGTMGMCATGAGGTTCTCACCCGTMGGCAGGATTTTTATAGAAGGAAGGTAG 

GTCTTTCAACCTATGTCCTCCTTCTGTTCCACAAAGTGGAAAGCCACAAGCCCTACAAAA 

GCCTTGCAAGTCCCAGAGGCTGCAGCCGTATTTATTCTTCAGGCCAAGACTCTCAGGACA 

GAGAGCACCCATGCACCCCGCAGGCTGCAGGCCATCTCCCTGCATTTGGGACTGTCCTGA 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

131/467 



GGATGGCGGCTTCATTTTTGTCCCTCCTACCTCTGA 
Sequence 817 

GMCCTAGGGCGATTTGGAGCTACCCNCGGTGGCGGCCGAGGTACATTTTGGCAAACCGT 

GMGGGCTTTCNTTTTNGCAGGTTGGACTTCCCCCCCCTAGTNGGCAGG A I I I 1 1 I I I AG 

GGGACCACCTGAGAMGGTCTGTTACCGTGCATAMCCTCCTTTMCACCTTTTAAAAAC 

TCTTCTGGGGGCCGGACTCAGTGGCTCATGCCTGTAATCCCACCACTTTGGGAGGCTGAG 

GCAGATGGATCACCTGAAGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCC 

GTCTCTACTAAAAATAGAAAAATTAGCCAGGAGTGGTGGCAGGTCCCTGTAATCCCAACT 

ACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAACCCAGGAGGC 

Sequence 818 

GCCAGGAAACCCGTAAAAAGGGCCCNGTTTGTTGGCGG I M i ll I CCATAAGGGTTTCCG 
CCCCCCTTGACCGAGGCANTTAACAAAAAATNGACNGCTTCAANGTCAGAAGGTGGGC 
Sequence 81 9 

CCCCCCGGTGGCGGCCCGCCGGGCAGGTACTGGGAAATGAGGCAAAAGTNTNTCTCTTCA 
CTGCTAGCTCCTTGGGGACCAGCAAGCGGCTCTCAAGTTGCGTGGTGGCCCAACTGGAAA 
AAAGGCAGTTCGGTGCATCCTGGGAATATCCAGGTGAAAGTGTGAGATTTACCTAGAATA 
GCTTCTGGGCCTCTGGGGTTTTCACGCTGTCTCTGGTGAAGGTGTCCATTTTAGAACTGA 
AGCAAAAAGGTTTCMTCCGTTCCGTTTTCTTTGTTTTAGCACTTACCCCAGNNNCCTCC 
ATAACAAAGGTGGNGCCCCTTCAGGGAAATTAAA I II I I I I I I CNTAAAGGCCTTGGCAT 
TAANCCN I 1 1 1 I 1 1 I GNGGGNNGGGNAAAN I I I I II I 1 1 II 
Sequence 820 

TAGGGCGNTTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACAT 

CCAGGNCAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACGCATTCCGCTTCTG 

TCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTC 

CAATTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATT 

CCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATC 

AGTTTATCAACCMCMGCAGTTCCAGCACCCAGCACTTTTTACCTGAATTTtAC 

Sequence 821 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGCATGCAAAACTCCAGATTCCTATCTTC 

TTTGGGGGAAAAGCAAATTGGMGCTCTGACAATGCTGGGCTTTACTTTCCCACATAGCA 

ACCATCAGTTGGAGCTGAGACACCTCTGCTCTCTTTAGAAAGAATTATTAATGCTTCAGT 

CTCCATTATTGCTTCCCTMCAGTGAGGATAAGTTATTGGCATCAANCCTGGCCGGTTTA 

NCTTGGGGGTTTATTTTNTNNNTTTGGGGCCTNAAAACCCCGGGGGGNNCCTTTTTGGCN 

CNGNGGGGGGGGGGMNTNTNNNANNANGGNNGGGGGGGGGTTTNTCTCNNCCCCCCCCA 

Sequence 822 

CCGGCAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAAGGGCTGTGTTTATGGGA 

AGCCAGTMCACTGTGGCCTACTATCTCTTCCGTGGTGCCATCTACATTTTTGGGACTCG 

GGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAAATAGTCACCA 

TGGGGGAAMTGATCCGCCTGCTGTTGMGCCCCCTTCTCATTCCGATCGCTTTTTGGCC 

TTGATGATTTGAAAATAAGTCCTGTTGCACCANATGCAGATGCTGNTGCTGCACAGANCC 

TGTCACTGCTGCCATTGAAGTTTTTTTCCMTCATCGTCATTGGGATCATTGCATTGATA 

TTAGCACTGGCCATTGGTCTGGGCATNCACTTTCGACTGCTCAGGGAAGTACCTCGGCCG 

CT 

Sequence 823 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTGTT 

AGCTATTATCATCACCCTCCTTGCTAGGCAGAGCAGGACAGTGGGGAATTGATGTTTCCT 

CCCCTCCATCTCACAGGTGGGGCAGGGGTGTGCTGAGAAGAGAACTTGGGACTCTTGGCC 

CCTGTTCMTTCTCTGCTTAACCTGCTAGGCAATTTGGGCCTCTGAAAATTCAGTAATCC 

TCATAGCAACTTAGACGTCACCTGGGCCTGTGGTCCCCTTCCTAGCCTAGGAGTCAGAGC 

ATGMGCTCCATCTGTCACATTGGTTTGTTCAGAGAACTACACATGCGTTTTAT7TTAGC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

132/467 



AGCATACAGGTTCCCACTTAGGCATTGAGAGGACATAGGAAGCTGTTTAACTTCCTA 
Sequence 824 

ATCACTACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAGTTATTTTA 

AATTCCACTCATAACTTATCGGCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGTGG 

AGCGGGAAGTGCAATCCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAATGA 

CTACCACACTTCGCTAAGGTCACATAATAAATAAGCATCAGACATCAGGTGTGGTGGCTC 

ATGTCTATAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGAGT 

TGGAGATCAGCCCGGACAACATAGTGAAACACGTCTCTACTAAAAACACACGCAAAAAAA 

TACGAGGCATGGTGGTGCATGCCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAGAA 

TCACCCTTGAACCCAGGAGGCAGAGGTTGCAGTGACCGATATCATGTCACTGCAGTCCAG 

CCTGGGTGACAGAGCGAGACCTTGTCTNAAAAAAAAAAAAAAGAAA 

Sequence 825 

CCGCGGTGGCGGCCGAGGTACAGATGTATGGATCTCATAGCATTGAGGGGTCTTTCAGAT 

TATGTTTTCAAACCCCTCACTTTCTCTTTTCAGATAAGACCACAGCGACCTGGGAAAGTG 

CAACGTCTTAGCCAAAGACACAGAACTATTTAGCGACACTGTCTAGACTCTAGTTTCCAT 

GTCTCCTGACTTCAGTCTAGTGTTCCACCCCTGCCGCCCACCCCTGCCCCATCCTCATTC 

CTCCTGTAGGAGAGGCCAGACCTTTGCCTGCTGCAGCTTGTGGCTCTTCTCCTGCCTTCA 

GTTNTTCCATTGCCTG 

Sequence 826 

GGGNNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTACCTGTCTGGGCAACACT 

GTCCCGNNGGGGCCCCCATGACCAAACTAACTCTGCTTCTACCCAGAAAGGGTGCAGAGT 

GGCCACTAGACTTTTATGTGGCAAATGGGATGGTTATGCCCAGCCTGAAGCCAAGATGCC 

CTTTCTGGTTGCCTTGATTTGTGTTTAACAGCTCCAAATGCTTAATGAGGCAGTAAGAGA 

CGTCTCTCTTGGGCAGTACCT 

Sequence 827 

CNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTGGTGTGGCTACTACCGTTACA 

ACTGCCTGTGCTTGGACATGGACCCTCTGCAATATGCGGCAGTTTCATTCATTGCCCCCT 

ACATTCTACACCAAGTAGAAATGGAAGGCAATTGGATACTTCACAGACAAGATCTAAGTG 

GAGAAGGAATGCGTCCTGTGGCTGCAGAGATCCTTGGAGCTTGGAGGGGAGAGCTTGAGC 

CCCACTGATGATGACCTCCCACAGCTCGCCAACTCAGCCCTCCCTAAGTCCCCATCGGGG 

GCCAATTCTCACTCTGGGGTTGGGGGGACTCCACCATAGCTCATCCATCATAGGGGATGT 

TGGTATCTACTGTGGGTTTGGGTAGGGCCCGATGTGCTGAGGATGGCTCCX3CCACAAGCA 

AGAGATGTGGGATTTGG 

Sequence 828 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACA 

TCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCT 

GCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCT 

CCAATTTGGACCCCAGCCTGGTGGAGCANGTCTTTCNAGATANGACCCTGAATGCCTNAT 

TCCATTGGGCTGGGGCTTCCAGCTACCAAGTTGGGTGGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTTA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCCGGCCCCCCCC 

Sequence 829 

CGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACCTGATCTACTCCTCTCTACAACAAC 

CTTGTGGGTGACGTTATTATCTCCATTTCACAAATGAGGCCACAGAGGTTCTAAAGGGTA 

AATGACGATGATGATGAGAGGTAAGTGATAAAACAATGTCTCCTGACCACAAATCCTGGA 

ATTTAAACATAAGNGTAGTAAACATGAACTCTAGGAAGCCTCCTGGGGCTTCTNCCTGTG 

TCTGGAGCCCCTGCACATGCCCAAAGGAAGTCCTTTTGGTTCTNCGNTCAGNAGAGAAAG 

GGNGCATTTCACTAAAAGGGAGGTGGGGAAACAAGACTGGTGGTAGGG 

Sequence 830 

CCGCGGTGGCGGCCGAGGTACATTATTCATATCCAGCACTCCCTGCGGCTGCTGCTGGAG 
GAGCAGTTATCCAACAAGGACTGTTTCAACCTCATCGCGTTTGGAAGCACAATTGAAAGC 
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TGGAGGCCTGAGATGGTTCCCGTGAGTCACAACAATTTACAAAGTGCCTGGCGGTAGGTT 

ATGGGCAGAGACTTCGTGGGGCTGTGTCTGAGGGMGGTTTGCAGGCATTGTTTTCTCTG 

TCCCCCTCTCCACCAAGAAGTAGCTCTCTAGAGTCCCTGACCCCAAACAGCCATGGGCAG 

AAATCAGAAAACAGCTTCCTTCTGTCTGCTGCTCTCCXCACCTGGCCATCTTCACTTTAT 

GAGAGTAATGACATCGACTCCATTCACGTCTGAGATGGAAAAGGCTCTCAGCTACTCCCA 

AAAGGTATGCCCTGGGCATGG 

Sequence 831 

CC GCGG TGGCGGCCGAGGTACGCGGGTAACAGGAGTCTTTGCTGAGTGATCATCTGTTTA 

TTCTTTTACTCCACAAATATCGAATGTTTACAGCGTGCCTGGCACTGAGCAGGGCTGGGG 

TTTCCTGACCATATGGACCTTCCTGGGTATATCTGTGGGGCTTGAATGGTGTGTGACCTT 

GTGTCCTGCCCG 

Sequence 832 

CGGGCAGGTNCGCGGGGGTGTTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTC 

CGTGGTGCCATCTACATTTTTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCC 

GGATGTCAGAGGTCCTGAAATAGTCACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGC 

CCCCTTCTCATTCCGATCGCTTTTTGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACC 

AGATGCAGATGCTGTTGCTTGCACAGATCCTGTCACTGCTGCCATTGAMGTTTTTTNCA 

ATCATCGNCATTGGGATCATTGCATTGGATATTAACCCCTGGNCAATNGGCTTGGGCATT 

CAATTTGACTTGNTAAGGGAAGTNCCTCGGCCGNTNTANAACTAGNGGGATCCCCCGGCT 

GGANGAATTTCMTTTNAACTTATTGATACCGTCCANCCTTGNGGGGGG 

Sequence 833 

ACCGCNGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACTTTTNNGGACAAG 

GTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACC 

MCTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGAC 

CCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTG 

GGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTTATCA 

AC 

Sequence 834 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACCTTACCACCC 

CATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTAGGTGCTCAATGTAAACGTGTG 

CACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGCCCGTTCTCTGCACCCCTCCCT 

CCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGAGGGGCTGGGGCGCTGGTCTGA 

TTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGAACCCCAAAAGGTTGGGACAAA 

CACAGCAACAAGCCCAGCTCCCTGAATTTCAGTGATTCATTTGTGGGGATAAAGGAGTGA 

ATG 

Sequence 835 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAGTTATTTTAAATTCCACTCATAACTTATCG 

GCCAAAAGTAGTCACATGGCTCCACCTAATCACAAGTGGAGCGGGAAGTGCAATCCTACC 

TTGCCTGGGGAAGGTATAGAGATAGACCAGCACTAATGACTACCACACTTCGCTAAGGTC 

ACATAATAAATAAGCATCAGACATCAGGTGTGGTGGCTCATGTCTATAATCCCAGCACTT 

TGGGAGGCTGAGGCGGGCAGATCACTTGACTACAGGAGTTGGAGATCAGCCCGGACAACA 

TAGTGAAACACGTCTCTACTAAAAACACACGCAAAAAAATACGAGGCATGGTGGTGCATG 

CCTGTAATCCCAGTTACCTGAGAGGCTGAGGCACGAGAATCACCCTTGAACCCAGGAGGC 

AGAGGTTGCAGTGACCGATATCATGTCACTGCAGTCCAGCCTGGGGTGACAGAGCGAGAC 

CTTTGTTTCAAAAAAAAAAGAAG 

Sequence 836 

GNGGNGGCGGCCGAGGTACTTTAACANGCCATACTCCAGTCCCAACAATGTTAAATGCCA 

AAGCAGTGTTGGTAAAAGCCTCAAATGGTGAAAAGGACAGAAACTCAAACCCGCCCTTGT 

GCCAGTAAGTAACTGTTACTTATCTCACAAAGCGCTTGGCTCTGGAMCAATCTAACTCT 

GAGCTGCACGTGGAGTCTACATGGGAATGTGCAMGCATGTATTTCCTTTTAGGTGCAGC 

AGAGGTAACCGAAATTCAGATAAGAGAAAAAAAATCCAGATTTCAATGCAAGAGGTGGAA 
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, GATCCACGMGATACTCGTTACTATTTGGTTTCTAGGAGCAGGATTGCCACTAGATATGA 
TGGAGAACAAAAATGAAGAGGTGTTGTGTAACAAAACAAAACAAAACAAAACAAAAAAGT 
AGAMGAMGAGCAACAGGCCGGGCCGCAGTANCTTCATGCCTGTAATCCCAGCACTTTT 
GGGGAGGCCCAG 
Sequence 837 

NTTGCGTTGCGCTNACTGNCCCGCTTTCCAGTNCGGNGTAAACCTGTCGTGCCCAGCCTG 
CATTAATGAAATCGGCNCAACGCGCGGGTGAGAGGCCGGTTTGCGTATTTGGGCCGCTCT 
TCCCGCTTTCTTCGCTCACCTGACTCCGCTGCGCCTCGGGTCGT 
Sequence 838 

CGCGGTGGCGGCCGAGGTACTAGGCACTGAAGATACAATAAGCAATCCAGNAATNCCTCT 

TTGMGMTTTATTCTGATGAAATAGAGACAAGCCTATTAAAGTATTCAAGGCAACATTA 

CTTAAATATTTTATTTTATTTTATTTATTTTTAGATGGAGTCTCGCT 

CTGAAGTG 

Sequence 839 

AGGTACCTGTAGTCCCANTTACTTTGGAGGCTAAGGTGGGAGGATGGCTTGTGCCCAGGA 

GGCGGAGGTCACAGTGAGCCGAGATCACACCACTGCACTCCAGCCTGGGCAATAGAGCCA 

GGCCTTGTCTCATAAATGAAATATAAAATATAAAATAAAATAAAATAAAACGTTGTTGGC 

AAAGATGTATTCAAMTAGTATGTAGMCTTACCTGGAAGGTMTTTTGCAATATTTACC 

MMTATACTT7TTCATCTAGTAAGGTCTCCTTCTCAGAATCAGTCTTAAAGCCATTCAA 

ATACAGATTTATGATTAAAGATTTAAGTCCAAAAATGCTCATTATAGCAATATTTATAAT 

AGGAAAAATTGGGGAAAAACAATTATACATCCAACAGTAGTAAGAGTGTGACTACATTAT 

AGTATTAGTATGTMTGGGATTGTCACAGAGCAACACACATGTTTTTGMGMTATTTAA 

GGGCGTGATAAATATTAAATGTAAATGTAMTTGMAAAATGATATCTGTAGATTTTCAT 

TATGCATTTCTTTATGAAATTTTTNGATATACACAAMNAAMTAGTCATGCATTTGCTT 

CATGACGGGGACATATTTCTGAGAAATGTGCTGTTAGTCGGTTTC 

Sequence 840 

GGCGGCCGCCGGGCAGGNACANCTTCCGTGGGGGGGCNNAAAACCCCCCACCNAAANCAA 

AANAGCANCAAAGGAAGAATTNNTTAAGGGCAGGGGGGGGAAGCCCCNAAAAACCNGANG 

GCAAANNCCAGAGCNGNGGNNNAGCNCNAGNNNCNGGGAAGAGCAANCANNACGCTTTTG 

TGAGGTTTTNGGNGNNGGAGGGGGGGAACCCCCCCGANAAAAAACCCAGNAGCCCCCCCA 

CNNAACAGGGGANGAANAAGAGGAAGGNAAAGGANNNAANNAAGAGCCACAGNCCNGGAA 

ACANGAAANCANNANNNCAGACANGCCNNNGAANANGNNGCCNACNNNAAAAGAAGGNNN 

GGNGCGCCANNGGGGAGNAAANNGGAANNANNAAAAAAAAAAAGGAAAANGNGAANGAAN 

GAGGANAANANNNGGGGGAGGGNAAANGGGGCGGGGGNNNAANGCCNAANAANANNNAGN 

NNNNGGGGGAGGGCCNG 

Sequence 841 

AGGTACCTTACCACCCCATCCCCAGAGCATTGCATGGGGTGTTTGGCACACAGTAGGTGC 

TCAATGTAAACGTGTGCACTGTGGCATGTTAGAGCCAGACAGGATCTCATCCAGCCCGTT 

CTCTGCACCCCTCCCTCCCCTCTCCAAGTAGCCCTGCTGTGGGTTCAAGTAAAGAGGGGC 

TGGGGCGCTGGTCTGATTGTGTGGGTGATTTGGGGAGATCTCTTCCTCTTCCGGAACCCC 

MMGGTTGGGACMACACAGCAACAAGCCCAGCTCCCTGMTTTCAGTGATTCATTTGT 

GGGATAAAGGAGTGAATGATAAAGTGAAGGACGACTGTCCCCGCG 

Sequence 842 

TAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGAGGTACTCCCAAAAGGCTATGAAATN 

GGGGAAAACCCCAGGTGATTCATGCCTGCTTAGCTGCAGNATNTCAGTNGCANTAGGTGG 

AACCCCAAACCCAGNGCANAGTGCCAGNGTCTGCTTNGGTGAGATATGAGTGTCAAGTCT 

CGAACCAAGCAACCTATCNAAAGCCTGNGACACTCCTGGCCACAGGCGGNTGGTANAGGC 

ATAGNANACTATTGCCCAGGTGACGTGACTTCACAGATGCTGGGAAGCCTGCTGCCCCAT 

CCMTACMTACTGCCCACTGTGCATAGAMCCAGATTCCAMGTTAGAGCTTCGTTTTG 

GCCATGAGTGCMTTTCACTGCAATGTTTTATCTTACTCMCTGCCAGGGTCAATTTAGG 

TGGTAGGGCTMATCTCCTTCTTTATATTGGTCCMATGATTTTTCCTGATGCTGCATTC 
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, CCGGA 
Sequence 843 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGCTTAGACCAGGAACACAGGGAGGTAGAG 

GGCAGCAGAGCAGGGACTGGCTTCAGAGCCAGACAGGTGGCTATGTGACTTAATGTGTCT 

GAACCCTGGTATCCTAGTCTATTAAATGGTATAACAGCAGCTTCTAGTATGTAAGTTCCT 

TGTCGGGAGMAAACTGTTTTGCTCATGGCTGGAGCCTTAGCATGTTGCATCATATTGAA 

CATGTMTAGATGCTCMTAMTATATTTTTAAGAATAAATAAATGTAAATGAAAATTAC 

TTCACAGTGTTTCTGTAGAGATTTTATAAGATATGGTATACACAATGCATAACATAGGAA 

CTGACGCTCAAAMTGCCAGTTACTTCCATCATTGNGTCATAGGCTTTTATGTTCATTAT 

CCTGCTGCATCATCCCAAAGAA 

Sequence 844 

CCGCGGTGGCGGCCGAGGTACGCGGGGAGGTCATGCCCGTGTGAGCCAGGAAAGGGCTGT 

GTTTATGGGAAGCCAGTAACACTGTGGCCTACTATCTCTTCCGNGGTGCCATCTACATTT 

TTGGGACTCGGGAATTATGAGGTAGAGGTGGAGGCGGAGCCGGATGTCAGAGGTCCTGAA 

NTAGTCACCATGGGGGAAAATGATCCGCCTGCTGTTGAAGCCCCCTTCTCATTCCGATCG 

Cm ll IGGCCTTGATGATTTGAAAATAAGTCCTGTTGCACCAGATGCANATGCTGTTGCT 

GCACAGATCCTGTCACTGCTGCCATTGAA 

Sequence 845 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTCTMCCCTAAGGGATTCCTACAGCTTTTCT 

GCATGTTAAATAGTCTGTTTTAGCTTATTCTCTTATTACTTGTCTTGGTTTTTACTT^ 

AAGTTTGCTTMTAATCATGGGMTATTTTAGATTTTAAAATACAAAATATACAAGCTAA 

ACTTGAGAGCAGTTTTTAGTTGTAGAMCTGTTTCTTGMGTAATTGACTTAGCGTTTGC 

TCTGCCTCTTTCTTTTCTTACCTAGGTAGGTAGTGGGGACTCCTTCAATTATCTGAGCAA 

TTCAAATCTCAGAATGTAGTGTTGGGTAAATTGAGGGTT 

Sequence 846 

CCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACA 

CTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACG 

ATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTG 

GTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACC 

TACCAGTTGGNGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGC 

AGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCA 

Sequence 847 

TGGAGCTCNCCGCGGTGGCGGCCGNGGTACTCCAAGCAGTCCCAAAGTGGGAGTNCTTAA 

AACACCATGGGCAGGTGAATGGCTGACCAGGTGGAGGTGCACAGTGCACCATGACAAGAG 

CAGTGGAAAATGGGTGAATCTGAGATGCCTGGAGGCGAGGGGGAAAGAGCACATCACAGA 

GGACAACGTCCANNGGGACACCCTTTTATA 

Sequence 848 

CCGCGGTGGCGGCTGTGGACTGAAGGGTGACTGGTTCCACTGTGGTCTCCATGGGAACAA 

GTTGTTTCTGGAGTCTTCCAAGGAGAATTTCTCACAGTGGACCTGATCTCTGGGCTGATG 

CTGGGTTCCTTGGAGCTCATGATTTTGAGAGTGGTAGACATTTCTGGGCTTCCTGGGGAT 

GTGCCTGCTGGACTGCTCCCCGTCTCCTCTGCTGGGGCAGGCCACGTGGAATTTCCTTGT 

GCTGCCTGGCTTGACATCTTTA 

Sequence 849 

CCGCGGTGGCGGCCGAGGTACCTGAAGAATCTCTCTTCAGCTCTCTTCTCCTGGAAACTT 

GAGTGGGGCAGGAGGAAAAGCGGAGCTAGGTGTCATTTTAATGAGGAACATACTTGTCTC 

CTCCATTTATCTGGCCCTCCCTGATGGCACTCCAGAATTCCAATCCCACACGATTAACAA 

CATAGTTTCCCTTTTCTGCTTGAAGGTCCATTCTCCTCTCAATTTCAAATCACCTGAGAT 

ACAAAGCTGCATTTCCCCACAAGAACCAGTTCCCTCTCCTTTCCTTCAGTGCTACTGTCC 

TTCTCTCAGACCACCAAGCTTAAAAACTCCAGAGGCTCAAACAGCAAAGATGGCAGCCCG 

CTCCTCCCTCTGGGGAGTTCTGGCCCAGGGAGTTTTCAAATTTCTGTAGGCGGAAGAATA 

CTAGCGGGGAGTGGCTGGAGACCCCAGTTGGTAGGGNTCCACATTTGGGGGAAGTGAGCC 
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, CAAGCTTTTTN 
Sequence 850 

CCGGGCAGGTACATGAAAGTAAGATCACAACCACAGGAACCACACAAAATTCAAGGCACC 

AGAGGAGCCCAGACTTGGCTGGCMTGCCTGTTTTGGAGCTATTCCACATTTCTGGAAGT 

.CAATGGGAATACCGGAATATGAAAACACTATGAGGCCGGGCACAGTGGCTCACGCCTGTA 

ATCCCAGCACTTTGGGAGGCCGAGGCGGGCGGATCATGAGGTCAGGAGTTCGAGACTAGC 

CTGGCCAACATAGTGAAACCCCATCTTTAATAAAAATACAAAAAATTAGCCGGGCGTGGT 

GGGGGGTGCCTGTAATCCCAGCTACTCCGGCGGCTGAGGCAGGAGAATTGCTTGTACCTC 

GGC 

Sequence 851 

CCGGGCAGGTACTTTTTTCTTTTTTC I I I I I I I I I I I I GAGTGGGGGCGGGGTTTCGCCA 

TGTTGGCCAGGCTGGTCTTGAATCTCGGGTGATCTGCCCGCCTCGGCCTCCCAGGGTGCT 

GGGATTGCAGGCGTGAGCCACCACGCCCGGCCTCGATATATTCTTACAGTGGAATACTGC 

TCAGAMTACTGATGAATCTTAAAAAACATGATGTTTAGCAAAAGAACCTTGGTATAAGG 

TTCTTGGTATAAGGGATACATACTCTATGATTCCATTATATGAAATTCTAGAACAGGAAA 

AACTATAGTGAAAAACAATCAGATTAGTGGTATCTGGGGTAGAAAGTAGGAGGAGATTGA 

TT 

Sequence 852 

CNANAGGGCTTTTTGGGGGCAAAACCGCGGNGGCGGCCGCNCNAGAACNAGNGGANCCCT 

NTTGGGGGGGGAAAAAAACCCCAAGCCCACCGANACCGNCGACCNCGAGGGGGGTNCCGG 

NACCCAGNGNNNGNCCCCTAAANAGAGGGNNAANNGCGCGCNNGGCGNAANCANGGNCAN 

AGCNNGNNNCCNGNGNGAAANNGNANCCGCNCACAATTTNTCnTTTKlTAGNCGAGCCGGG 

AGCAGAAAGCCCNAAGAAAAAAGN 

Sequence 853 

AGGTACCCACAGCCCTTTCTTTTGGAATTCCCTAGAAGGGGTCTGTGCCACATACAGGAA 

GTAGGGAGGGTGTCTTTGCAGCATATTTCTTCTCTTGGAGTTAACTGCGAACGTTGCACG 

GCGACCTCTTGATCCATTCTGTGAAAGCCCCAAGCCTGTCATGCAATAAAGACGTCCAGT 

TTCACCGCAGCAGGGAGGCCGCATGAMTATTCCACTTGAACAAAACCACTTAGCAGTTT 

ACATCAATGCTTACCCTGTCGCATTGAAAGTGATGTGAACCCACACCCAAGAGCCCCCAA 

ACCAGCACGTTGATACCMGTTTCCCCAGCTGCATCCAAATCAATTCCTTCTT 

Sequence 854 

CNTTTGGGGGCTAANCCGCGGGGGCAGCANCNNCACCGGCAGCGCGCNATTCTTTGGGGG 

GGTGNAAAACCCCCAGACCCCCGAGCNCNNCCACTCGNCNCACCANTNTAGAGACTGGGG 

CCGNTCCNCNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnn i mm i GGGGTTTTNAGNGAGGGGAAIMCG 

CNCGCAAAAAAANAATCNCGGCCANAGGGGGGCCNGGGNGAAANTGTTANCCGNGCACAA 

NNNCACACAANANACGAGCCGGGAGCAAAAAGAGGAAAAGCCNGG 

Sequence 855 

CCGGGCAGGTACGCGGGCTACACACACAGTTCGGATGCCAAGGGTGACACCCCATTCCCT 

TCACAAGAGGCGGTTCTGTCACAAATCAGCACTCCACCCCCACCACACCTCTCAGTGAAT 

GAAGTGCTGGTGGTCTCACTCCCCTGGTGACCCTTAGCCGTGGGATGGGGTGGTTACACT 

AAGGCTTCAAGCTGAGAATGGCCATCATGGCGGGAGGCTGTTTGCAAAGGCACCTTCTGT 

CATCCTGGGGTTGGCTAAGTCAACTCCACCCCTTCCCAAAAAAAAAAAAAAAAAGTACCT 

Sequence 856 

GCCAGGATTCAAACCAGGGANTTTGCTCCAGCACTCCGGCTCTTAACCTCAACCGTCTGC 

CTCTCCACAAACACCAGGATCAACCACCAAGACCAAAAAAACAGTCTCACAAACCATCAA 

ACATTGCACTTGGTGGCTCAGGACCTTAGCTTCGTCTTAMGGTCCCTGTTATGCTTTTT 

CTTTTTGCCCCAGTGTGGAGTGGTCTTCGTGTTTGTGAGTGCAGGGGTCAGGGGTTGTGT 

CTTTTCTTCTTGTNCCCTTCCAAGAGGTGACATGTATCCTTGATACTGGAAGGGCCCTT 

Sequence 857 
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. AGGTACGCGGGCACTCCAGCCTAGGCAACAGAGCCAGATTCI I II I I I I I I I 1 I I IAAAA 
AGTCTTTTAAMAATTCTTTTATTGTGCTGATTTTATTGTGTCATGAAGTGTAATATCGC 
ATGTAGGTATGTGTCAAGTATACAGAGTGTCAGGCATACGGTGTTCAGTCATAAGCAGTT 
CTGGCCTTTGGCCCTGCACTGTTTGTTGGCTTTTTAGGTAGGAACCTTCTTAGAGTAAGA 
CTGTCATGCTAAMTTGTAGCMTCAMTGTGCCCCCCATACAACTCATTTGAGGTTGAG 
ATTATGTTGCTAGAGTGGAGGAGATTGGAGTGTTCTAAATGCTAACAGTTTGTCTTGCCT 

Sequence 858 

GGCGGCCGCCCGAGAACGAGGGNAGCCTTTGGGNGGGGGGAAAAAACCCAGnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGNACCCANCGCAGAGNCCCCAAAAGCGAGG 

GGCAAGNGCGCNCAGGGCGGAAANAGGGCNACAGCCGGGGCCGGAGCNAAACAGGCACCC 

GCCCACANTTNNTTACAACANACNANCCGGGAGCATAAAGCCGCAAANCCAGNGGAGCCA 

AAGGAGGGAGCGAACCCACAG 

Sequence 659 

GGCAGGTACAACCTCCATCCCCTGGGCTCAAGCGATCCTCCCATTTCAGCTCCCCCTGTA 

GCT GGGA CTACAGACACACACCACCGTGGCTGGCTMTTTTTGTATTTTTTGTAGAGGCA 

GGGTTTTGCCATGTTGCCCAGGCTGGTCTTGAACTCCTGAGCTCAAGTGATCTGCCTGCC 

TCGGCCTCCCAAAGTGCTGGCATTACAGGCATGAGCCACCATGCCTGGCTGGGACATTAT 

TCAMTTGMGTGAGGACATGATGTTAAAAAGTTCTGGGCMGTATTTTACAAGTTAAAA 

TACAGATGTAAGAC7TGACTTGATCAAATGCCCAGCTCTGTAATTTCACCTAAATTG 

Sequence 860 

AGGTACMCATTGTGAAMTTTTCCTATCATTTCCTCTAAAACTCTGCAGGCATGGAGGC 

TGTCTGCTGAGATATAGCAGGGMCMTTTMCCCTTTGTT7TGCACCCACAAGATGAGC 

ATTACCAAATTCCCAACAGAGACATGCTGGGGCTGTGCTGTGCCTCCACCX)TGCCCCCTC 

CACAGCCAGCTCCGCGTGTCTCAGTCTGTAACTGATAGCATTCCACTTCCTGCCTGTACC 

TCCCCGGC 

Sequence 861 

CNGCGGCGGCCGAGGNACACNNANAANAACCTTTTAAGGGGNGGAAAAACCCAANCCCCC 

CGNCCNACCNCAGNGACNNGANGNATNTACNGGAAACAGGGCGCAGCCCNAGGAAGGACA 

GNNGAAGNCCNNACNGNGCAAGNCNAAANNNNAAGGAAAANNAGNCCCGCGANGAGNNNC 

CNCANGCTTTNNNTNTGCGGGGACCAGNCAGGGGNCGNCCCCGACANAANCAGGGCNCGC 

CGNCGGNGGNAGAGNCACNNNGCAGGGGNNGGNGAAGCNGCNCCCANCCANGGACCNNGG 

CCGCNCNAGAACNAGNGGANCCCCCGGGCNGCAGGAANNCGANANCAAGCNGANCGANAC 

CGNCGACCNNGAGGGGGGGC 

Sequence 862 

AGGTAC I I I I I I I I I I I I I I I I I I I I I I I GGNAGAGATGGGGCTTTACCATGTTGGCCAG 

GCTGATCTCCAGCTCCTGACCTCAAGTGATCCACTCGCCTTGGCCTCCCAAATTGCTGGG 

ATTACAGGCGTGAGCCACCGCACCCGGCCCAGTCTTCATATTTATAAATAAAGTCTTGTG 

GGMCACAGTCACATTTATTCATTTACATTTTGTAAATAGCTGCCTTCAAGAGGTAGAGT 

TGAMGTCTTTMGACAMGATCAGGTGGCTTGCAAAACCTAAAACGTCTGGTCTTTTAC 

CTMMAGTTTGCCAACCCCAAAACAAGAATATATTCGATTGTAATCTTTACAAAGGTGT 

TGGCTATATTTCAGTGTGT 

Sequence 863 

NCTATAGGGCNAATTGNAGCTCCCCGCGGTGGCGGCCGAGGACAAATTCAGTCCCAATAC 

TCAATACGTATTATAGATGACTATGAGTGCAAACCTTAGGATGNGATTNTCTGAATAATN 

GNTCTTTGTAGGATTTGGTTACATTATTTAMATGAAAAAGATCTAGTTTTAGTGTGAGC 

TCAG TAATGNTAATNGGTTMGTTCATTGCGMTCTTGAGTTTTAGATMGTAGTTATTT 

TTTTCMTATCACTTCTGTTTTTAGTGATATTATATCAAGAAACAACGTATTCAAGAACC 

ATGGCTGACAGTGCCAGATATACTTAGGGATAAACATCAAAATGCAATTATAGTTGCTAT 

AACGTTAGATACTCGGAATCAAAATTTATTTGCANGCTGACTTGATAAACTAAATGAA 
Sequence 864 
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TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGAGTCAGCAG 

AMTGTGTGCTTTAAGCAGAGTCACAGGGGCCTGGGGCTGAACTGAGTCATTTCTCAAAG 

ATATCCCTGCCTGGGATGATGATGGCTCTAATTGAAGCTCTGGCATCATCTGGGGCTTTA 

TGAGCCAAGGGAGATAAGAAGAGCCACAGCAAAACCCTTGGGTCTACAGTGCAGGCTGCA 

ACCMGGCAGCATTTGCTAGAATATTTGTGATTATGTGTTCAACCTACAACCT 

Sequence 865 

CTA CTTAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATATTAT 

ACCI I I I IATTGTTGTTATMTTATTATGGGGTATTTCTAATTAATATGATGTTGAAACC 

TGTTTGGCACCTTCTGGAAGCTACCAAAAAAATGACACTCCATTGAAGTGCTTAAAAGCT 

GTTCTCATAAGAATTCTACTGGCCTATTGTAAAAAAAGAAAAAAAAAAAAGAAAAAGAAG 

MAGACACMAGAAAATMTCTAMCACCAAAMCTAMCACAATTCCAATCCTTTTTCT 

GTACCT 

Sequence 866 

ACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGGAAATCTGGACAGTG 

CTGCAC AGATT GATACATTAGCCTTTGCTTTTTCTCTTTCCGGATAACCTTGTAACATAT 

TGAMCCTTTTAAGGATGCCAAGAATGCATTATTCCACAAAAAAACAGCAGACCAACATA 

TAGAGTGTTT AMAT AGCATTTCTGGGCAAATTCAAACTCTTGTGGTTCTAGGACTCACA 

TCTGTTTCAGTTTTTCCTCAGTTGTATATTGACCAGTGTTCTTTATTGCAAAAACATATA 

CCCGATTTAGCAGTGTCAGCGTATTTTTTCTTCTCATCXTGGAGCGTATTCMGATCTTC 

CCAATACMGAAMTTAATAAAAAATTTATATATAGGCAGCAGCAAAAGAGCCATGTTCA 

AAATAAGTCATTATGGGCTCAAATAGAAAGAAGACTTTTAAGTT 

Sequence 867 

CCGCGGTGGCGGCCGAGGTACATAACATGATATCAAGGAAATGCTTGAAACAAACTTTCA 

CAATAAAGTCAGAAAAAAACTGTAAAAATTGTCTGCAATCCAAGAAAAAGCACGTGCCCT 

GTGTGTAGG GGGA MGAGGGAMGCACTTGCAGTGTGACTTTATGTGGTCTTTCCCCAAG 

TATTGCTACGTTTTGACCTTTGGCCCMCTGMCAGGTGAMTGCCCTTCACATMGTTT 

CAATCCCCAAGAAACTAGCTGGAATGCAGGGGACTGTAGACACACTCCTGGACCAAATGG 

CATCGACTCTCAGAATCCAAAATGGGCCCTGCCCTCATTCTGAGCTTACGGCCCCAAGCA 

TATTCTAAACAAAGC! 1 I 1 1 I AA 

Sequence 868 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAGGATATCAC 

CTGMTTATTAATGAATGCCCAGGAAGTMTTTTCTTCTCATTCTTCTAAAACTACTGCC 

TTTCAAAGNGCACACACACCGCGTNCACATACACTGCATTCGTTGCTCCAGTATAAATTA 

CATGCATGAGCACCTTTCTGGCTTTTAAGCCAATATAATGGGCTGCAAAATGAAGACACC 

ANAGTGTATGCATACAAATCTCACTGTATTAAAGATGCAGGTTTTCTAATTGTACCT 

Sequence 869 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGGCGACGCGCGGGACAAAG 

GGAAGCGAAGCCGGAGCTGCGGGCGCTTTTTCTGCCCGCGGTGTCTCAGATTCATTCTTA 

AGGAACTGAGAACTTAATCTTCCAAAATGTCAAAAAGACCATCTTATGCCCCACCTNCCA 

CCCCAGCTCCTGCANCACAAATGCCCAGCACACCANGGTTTGTGGGATACAATCCATACA 

GTCATNTNGCCTACAACAACTACAGGCTGGGAGGGAACCCGGGCACCAACAGCCGGGTCA 

CGGCATCCTCTGGTATCACGATTCCAAAACCCCCAAAGCCACCAGATAAGCCGCTGATGC 

CCTACATGAGGTACCT 

Sequence 870 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACGCGGGGG 
ACCCGGAAGTTTAAATTCCGCTCCCTCCACGAAAGAGTTGTAGTGAGTGAAAATAAATAT 
TAA MCA CACGGAAATGTATTTTCCTGGCTGCAGCACCNGCCATCTTGCCTCGGNAGGAC 
TCATTTTNAAAAACAGCAGCTTCTTGAAGCCCCANAACGCATTCCTGTGCTACGG 
Sequence 871 

CCGGGCACGGTACAGAGCCCAAGACAAAAGATAGGCCTGTGAGGATAACATCTGGTATAT 
CTGACCCTTCCCAGCATGGCCAGGAGGCACAGCCAGGCCAGGGAGGGCATACTGGGTTTG 
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GCTTTGCCCTGCAGCTGTTGGCCTAGGTGCTGCGGTCATACATATGCCCTNAGGCCTTTC 



TCCCCCANGCTGGAGTGCAGNGGCGCCATNTNAGCTNACTTGCAAGCTNCGNCTTCCGGG 
T 

Sequence 872 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCATTCCGGGTCATCCGCA 

GAAATTCCTCATAGATGGTAACTCTGTCTACTCTCCGAGCCAGTGGCGAGAAGTTACACA 

GGGAGTCCACCCCGGTGTGGTGCCTGCTTGNGGACAGACCTGAATGTTGAAACTTGACAG 

TCA GAAAM TMCTCTTGATGCTGCTGTTTCGGAAGAGTTGGTTGAGCGCATCCTCAATA 

TTCCTTTTGTTCCTCTGGNAATTGGTGGTGCCTGGCTGGGCTTTGTCCTGGGAATATGGT 

AGGTTGGTGATGGTAGAAATTCAGGTACGAAGTGCTGGGTGCTGGAGCTGCTTGTTGGTT 

GATGAACTGATGACTCC 

Sequence 873 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCCGCCCGGGCAGGTACAATGCTCACT 

GGGAACCAAAGTCAGGCATGGGGCTGGGCTTTAAGGAGCACAAACAAAAAGGAGGGACTA 

GAAAACTTCAGAAAGGTATTGGTGGGGGATGTTGCGGGGGGACAGGGGACAGCGAGGATG 

TGGGATCCCGAGATCGTCCAAATCCCTATGTGTAGACATATGTGTATAAAGGCCTTTAAG 

AGACTCAGGCTGATGGGGTATCTGTAATAAATCAAACATAATATAACAGCACGTCAAGTG 

ATAAGGGGACTCTGGAAAAACAAGCAGCAAAAGGAGCAGTATCAAACTCCACAGAAATTC 

ACAAACATCAAGACACCAAGAAAGCTGCATTNATTTAAATCAAGGTGACAGGCTGGGCTC 

TGTAGCTCCAGCCTGTAATCCTAGCACTTTGGGAGGC 

Sequence 874 

CTACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 rTTTTTT 

IS I I I I I I I I I I I I I I I I IGGGATGGAGTCTCGCTCTGTCACCCAGGCTGGAGTGCAATG 

GCACAATCTTGGCTCACTGTAACCTACACCTCCCGGGTTAAAGAGATTCTTCTGCCTCAN 

CCTNCT GAGAAGCTGGGACTACCAGGGGATCCCGCCCCACCCCGGGTAGGTTTTTTGTAT 

TTTTTTAGNNAGAAGACAGGGTTTTCCNCCCATATTGGGCCCAGGGCTTAGGTCTCGGAA 

CCTCCTGGACCCTTGNGGATCCTGCCCACCCTTGGGCCTNCCAAAAATGCTGGGGATTAT 

AAGGNGGGGAGCCACTGTGCCCGGGCCAACMTAMTTTTTTTAAGGGTTAGTCAAACCT 

AACAACAAAANTTTAAAAGGTCAATCAGTAGTTCTAAC 1 1 II I 1 I I I I 1 

Sequence 875 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGACTTTGTAA 

ATGTGATTCAGGGCCCCCAGCACCCCTGTGTCTGCAGAGTGCCTTCAAAACTCAGCTGTT 

CCGGCCGGTGCCAACCTGTGAACTTCCCACCATATCCCAGAATCTGCTATTCCCCAAACC 

ACTTCCCAGTTTCCTTTCAGTMTCTTTCTGAAGGAGCCAGGACAATAGGGCCTGTTGTT 

TAGTGMTTTCTTTATT 

TTGTTTCCC I I I I I I rTGCTTCATTTTGTTTAAATTTTCAGGTATTTAGCTCCCCTTTCA 
TATTATTTTTAMTTTTTTAATTACCTGTTGTAGGGGTGTTCCTCCAGAAGCAAAGAGCA 
AAATTTTACTGTTGTGATGTACCT 
Sequence 876 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGACGCGGGATTGA 

TCAAMGCTTTGTAACCACAGGAAAAAATAAACTCTTCCATCCCTTAAAGMTAGAATAG 

TTTGTCCCTCTCATGGGMTTGGGCTGTATGTATATTGTTCTTCCTCCTTAGAATTTAGA 

GATACMGAGTTCTACTTAGMCTTTTCATGGACACMTTTCCACAACCTTTCAGATGCT 

GATGTAGAGCTATTGGGAAAGAACTTCCAAACTCAGGAAGTTTGCAGAGAGCAGACAGCT 

AGAGATAACTCGGGA 

Sequence 877 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGTCTTA 
AACTGCTCTGCTCTTTAAACCAAATACATACACACATACACAGATATAGTTAGATACAGA 
TGTGTGTGCATATAAAAATATGACACTCCTTAGTAAAATATTCCTCTAGACCTGGGGTTC 
ACACATCCCTCCTCCTGATCCGTGCTGGTGCCTACTCAGGCACTACTTTGCAGATTTCTC 



CATGGCTACCTACCTAGAACCCAGATTC" 



GAGACGGAGTCTCGCTCTG 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

140/467 



TTCTATGAGCTMGGTTTTTCTGAGCTAAGGTCAAGCGGTGACTTAGCAAGTTGAACGTG 

TMTGMCCAMCCTGTTTTTCCATGGMCCAATMTAATTAATCTAGAGTGAGCCATTT 

GGCCTCCAGAAACAAAGAGATTTCCATCACAGAGTGTTGGTGAGGGGTCATGAGTAAGGC 

GGGGGGGCAGTGAGAGCAAGCTGTTTTATTGTGAGAGTAGCAGGCAGGCTGAATGAGAAG 

GGGTAGCTGTT 

Sequence 878 

CTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGGTACCATTCC 

GGGTCATCCGCAGAAATTCCTCATAGATGGCAACTCTGTCTACTCTCCGAGCCAGTGGCG 

AGAAGTTACACAGGGAGTCCACCCCGGTGTGGGTGCCTGTTGGGGACAGACCTGAATGTT 

GMACTTGACAGTCAGAAAAATMCTCTTGATGCTGCTGTTTCGGAAGAGTTGGTTGAGC 

GCATCCTCMTATTCCTTTTGTTCCTCTGGTMTTGGTGGTGCCTGGCTGGGCTTTGTCC 

TGGGAATATGGTAGGTTGGTGATGGTGAAATTCAGGTAGGAAGTGCCTGGGTGCCCCGCG 

TACCTCGGCCGCTCTAGAACTAGTG 

Sequence 879 

CTNCT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI II 1 I I I I 1 1 I I II 1 

TTT7TGGGGAMTMCAGGGGAGAGCAAATTTCTAAAMCTTGGGGTTTTATAGTMTTT 

CTGATTTTCATGTTTAGAAAAAGAMTCACATTAAAMTATGCTTTm 

ATAGGATACACTATAATATTATTGTAGTCCAGAAAATCTGTATACTATAATTCCTAGGGA 

AAAAGAGAAAATTATTAGTGTCAAAATACCTATAATTCCCACAGTTACCATATACATTTT 

TAAAA ATTGT TTAAATACACAAACAATGATGATGCTGTCCTACTAGAAATGACAGGAGCN 

AGAGCTTTTACCTTTCTTTCAMATGCCTTTAACCCCTTTCATTATTNCCMGGTTCAAA 

ATTTAAANATTC I I I I I I I I 

Sequence 880 

CNCTACTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGGCGCAATCT 

CAGCTCACTGCAACCTCCACCTCCTGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAG 

TAGCTGGGATTGCAGGCATGTGCCACCATGCTCGGCTAATTTTTTGTATTTTTAGTAGAA 

ACGGGGTTTCGCCATGTTGGCCAGCTGGTCTCCAACTCCTAACCTCAGGTGATCCACCCG 

CCTCGGCCTCCCAAMTGCATCTCTGGTCTTTAAATGCCCTTTGCTGTATATTCTATAAC 

ATCAAGTCTCAGATCTGGTTTGACCTCAGTTGGCCTCTTMTAGTTTTCCCCTATGAACA 

TTCTGGTCTCCCAGTAAGCCTGTAAGCAGCTGAGACTGGGAAACCATCTCTTATATCCCA 

CATCGTCCCAtG 

Sequence 881 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCACACTGGTAAAGAGTGGCAAGGTAGC 

CTTTGTAACCAGATATATCTGATCTCAAAATCMTTTTCTTAATTTAACCCACGTCAGTC 

AGTCAAATGCTAAGGCTCTTCAAGCTACACTTGGTTCCTCCACCCTCTAAAAGGTGAGAA 

CTCAAGAGAGCTGGGTTCTTTGGGACCTTATCATATTTTTCCCCTCCCTAGGCCTTGATT 

TCCCATTTGGAAAATAAATCAGTGAGGGCTTTCTAGTTAAAAATGCCAGTTGAAGCCAGG 

CTTGGTGGCATATACATGTAGTTCCAGTTACTCAGGAGGCTGAAGTGGGGAGGATCGCTT 

GAGCCCAGGAGTCCAGTCCAGGCAACATTGCAAGATCTCATCTCTAAAAACTAAAAAATG 

GACCAG 

Sequence 882 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAATGAATTGAC 

AAATGGAAGAAAATTTAAGGAGAGTAGCCTAGAGAACTCATTCTAGAAACTAAATAACTT 

MGTCMAMTTATTTCCTATATTGCCTCMGCCCTGCAGATAGCTTTGCTATGTTTGTG 

TATTTGCACATTGCACTCCAGCCTGGGCGACAGAGACTCTGTCTCAAAAAAATAAATGGA 

ACMTCACAACAGAAACATTCCCTTATTCACTGAACATTTCAAACCCTGAAAATGTGTAA 

TGAGAAATGACAMTTTTTAAMGTTTAATTACTAAAGAAGACAAAAAATGTCTATTATG 

AATAGACCAATTCTCAATTGGTAGAGGAACTTTGAACTGGAAAGGAACCCTAAAGAAATC 

TCCTGTCTACCCCTGTTATTACAGATTAGAACCCGAAAGTCCAG 

Sequence 883 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTATAATTATAATGATTTCAGATAGAACATGCA 
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ATTAGCCTTTTGAMTCCMCTTCTGTGCAMATTTTAGTATCAGAAAATACGAGATTTG 

CAGGGGGAMCATCAGTAMCTACCATTMTGTCMTGCCCAGTTTTGACTTTTGTTAGC 

CTGACACTCCCAAACAGTTGTAGAATCCGATAGATGACTGATGGCAAAAGATTGTGAACA 

TGTGGAAGAAAATCAGTGGGATTCGGTGCTGATGAATAGGTTGCCTTCAGAGTA7TATTG 

ACAGAGAGCTTGTGGMCTMTTCTTTATTTTTGATGTTGTGGGAATTAACACATCAATG 

GTGGTTATGGGAACTACCAATGGGTTCCTACAAT 

Sequence 884 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATTCTAAATTATAAG 

AAMTAT ACATT TGCACTTATTMTATAGAAATTCATTTTGTGTATATTTMCATAGCTT 

TTAMCTATTTTACATTAGCTACTTTCATTATGGTTTCTTGMCT^ 

AAATGTATTAAACTTATCAGTMCATAAAMCTTATTTTGTTTCACCTAACGAATACTGC 

GTTTGTAAAMTAAATTTAATATAGAATATATTTTTAAATTAAATATTTGAATATAAAA 

AGCTCTMGAAAGMGCAMTTATCACTGMCATATTTCTTATTATTTCTGGCTTTGAAT 

TAATACGTMCTTAAATTGGCTTAMTGATCCAGMTATTGGAGGAATATGATACTTTCA 

CATAATATACTATGMCCTGTTCATATMCTCTGGATTGGCTACCTAACCTTCTGNTTTA 

ATG 

Sequence 885 

CTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAGTCACACCCAG 

CCAG TCAA TAAC TGAGA AATCAAAATAAAATAATAATTTCAAAGAATTACATAAATACAG 

GGCCTTTTGAGATTTTTGGCMTTGTAAACAAAAACGAATGGTTTTTACMTTCAGTC 

ATTCTACGMTATTTATTTGGCACCCATGTTAGGCACTGAGGCTACACAGCAGTGAAATA 

GGCCTAGTTGTTCTCAACTAGAGAACATAGTTGGTTAATGTAGCTGCACTGAATTGTAAG 

CTGTT TAGM GATMTATACCCTGAGGCTTTTTAAAGTATACTATTACTATAAGGAAGTA 

A MTTA TTTTATACTTATAAATTTTGTTTTGGATTATTCAACTG 

AATTTTATGGGCGGTTGGGGACAAGGAAGAGGTATAATGCTA I mm I CCTTTTCTTT 
TTT 

Sequence 886 

CTACTATAGGGCA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI ITTTTTT 

I I I I I I I I I I I I I I AAATTTCCACCGGTGCCGAGGCCTCAGTGGAGCCTGGCTGGCGGCT 

TGTTAGAGCCTGCAGCCTACCTGTCCTGCATAGGAATGAAGCCGGGAGGAGTTACATGAT 

ATGCCCTCGTTGCAGGCCGGGGACACAGCTACCGCATTCAGAGACCAGGAAACAGAGCAA 

AAGCTGTTCTCANAGTGCGGCTGAGCGAGGAGCTACAGGGGAATGGNGGGGGCCAAGCTG 

CATGGMGATTGTCCCATTMCCTGGCTTTTTACCAGGGTGGTCCTNTCCCTAACCCCTA 

AGAATCACACCCTGCATCCAAACGGCAGCAACCCCAAAA 

Sequence 887 

CNCTACTATA GGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI 1 1 I M 1 1 I I I I I 
< < I M M I I I yMAG CTGGGATATCTTACAGAGGAAGGAMMTTMCCTTTTTTACTTT 
CTTTCTCACTTTTTAAATCAGCCAAAGTCAAAGCCCGTTTGCCAACCTGCATGTCCATGC 
CTGTAAGCCCTTCTNTTGGCCAAGGAAGAAAGGAAGAAAGAAAAAAGAAACCCAGGGGCC 
TGTATCCCCTGATTAAACACAGCACAGCACTCCAGGCAGACATGCCCGGNGGCGGCTCCT 
TTGCACCATTGACCTCAGGCCAGACACCTCAGCGCCAACAATGGGACCTCGGCCTTCCGG . 
CTAGGTTTGCCCCAGGCTGGGCAGGAAACCAGCTCGGCCGCTNTAGAACTAGGTG 
Sequence 888 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGACATTGTGGCTAGCCAACCACA 

TGGTCAGCCTCAAAGTTGAGAGGCTCAGTAACCCTCCTATCCCTAGAGAATTCCAAAGTG 

TGGATGTAATTTAACCTAGGAAAGCCATTGGTGACTATCTGTGATCCTCTGGAAAGTATG 

CTATGTTGGGGTATATCTTTGCATCCAAAGCCAGAGGGGAACCACAATGGCCTAGTAAAA 

CCGGTGGGTCCTCAAATGCCCACTTAAGCCTCTGGCCTNTTGGAANTTTGACCCATAGTG 

GGCCGTTCAGCTTGATTAGAGCCGGGGAAAGAAAGAAAATATTGNCA ) II I I I I INTTGA 

AAAAAAATTT 

Sequence 889 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

142/467 



CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGATCTATGCTGC 

TATGGGTGGAGAATCGACATCCTTTGAAACTGGCCACAGGCAGAGCTAAGAGGATGACTA 

AAAGGTCCCTTGGGTGGGTGCTAATGAGCAGGGGCCCAGGAAAACCTCTGTCTTCCCGGA 

GAGCCCTCTTGCATGAGTTTCGGCTTTGCCAAGATTCCAGGGACTTGAGGACAGCTATTG 

AGTTATGGTTACGTGACTGCCACATTGGGGCTTGGAGGCATCTGGCAGATGGTTGGGAAT 

GGGCTGGCACCACACTAATTAGGCCACGATGATCCAGTTTGACTCAGGGAAACCCAGAAG 

TCATAGTNCTCTTTGCAGAATGACACAAGGATGTCAACATGCTTTGNTTGTGTACCTCGG 

CCCGCTCTAGAACTAAGTGGGATCC 

Sequence 890 

CCGC GGTGGCGGCCGAGGTGCATATATATATATACACACACATATATATATTTATGTATC 

TTTTAMCATATMTTACTCTCTTTMTTCATTTGGACTTTTCATCTAMCTTGCTCT 

TTGCACAGGTCTGTTAGGGTAAGATATGTTCCTACCTTGAGAAATGTTGTGAATATCTAG 

CGAAACACCAAACATCCTCAGCTGACTAATGTGGTATCAGACTTTCTGGTTGCAAGGTAG 

GGGTGAATAAGGCAGGATGGGGTGCGGGGGTGGTGCTGGAAGAAGACATGGCATCAGGTT 

GGGTTTTGCAGGATACTGAAATTGTCTAGGGGCCTTGGCTGTGCAAAGAGCCTTTCCGTC 

Sequence 891 

ACTTAGGGCAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACGCGGGATTTCT 

CAGATAGTTATGCGCAGCTCCAGGCACCAGATTCTGTGCTGGGTGCAGGCAGGACCTGGA 

GGGCGTCCTCMGTGTTGATCTGCAGGGACTGTCTTGATCTTTCCAGCAGTGTCATTGTG 

GGCACGTGACCTGAGCTTTCTGAGCCTATTTCCGCATCTGTAAAGTGCTATCCACTTCCA 

CCTCCTGGGCTGTCGTGCAGATGTAGGAAGGAATTGCACTCACACACTCAGCATGAGACA 

GGCGCTCAGTAAAAGCCCGTCCAGGGGATATGAGATCAGTGAGGGATAGGAAAGCAAGGG 

TGGGTAGAAACAGCAAAACCCTTTCCCA 

Sequence 892 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCAAGCATTG 

GACACACAAAAATACAGGCAGCTTCTTCCCTCAAGGAGGTCACAGGTGGGTGTGTCCATA 

GCAAAGCTGGGAGGAAGTTGTATGAAGGAGCCTGAAGACAATGGGGAGCTAGGGGAAAGT 

TCTGAGTAGAAAGGAACATGTGGACAAAGGTTTGAAATGATGAAGACTGATTAGGAAGTT 

CATATTATGAAGCATAATTCAAGCTTTCTCTACGATGTTCAAATCCCATCTCTCCTACTT 

ACTAGANAGGTGACATTGGGCCMGTTACTTATCTCCTCTGCTCCTGTTTATTTGTGTTC 

AAAAACAGGGACCTCTCTCACAGTGTGATTNTGAAGACTGGACAAGAAAATGGGAGGTTT 
TG 

Sequence 893 

CTACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAGTTTGGAAGTTT 

AGGCAAAAGTCATTTCTTCCCTATATTTTGTCATGCTTATCTCCTGTCTCTTTCTGTTTT 

ACAGATTAGCMTAMCTCCTTMAACCCAMAGGTTTGGGCTTCTGTTCCTTTCACTTG 

CAGTCAGACATGGAGTTAGTGGTAGAAGAAACAGAAGGGGTAACCTGCATGGTGACAGCT 

ACTGAGGGGATGGATAGGAAAGCAGGCTGAGTCCCTGGGGCCAGTGGTTACCAAAGCCAA 

GGAGAGGGCAAGGGGAGCCCAGTGGGCCTGG 

Sequence 894 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTACATCACCCTGCTGA 

GGGACATCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACACGACACATTCC 

GCTTCTGCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGT 

TCTCCTCCMTTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATG 

CCTCATTCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGG 

AGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTNTACCTGAATTTCA 

CCATCACCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGGCACCACCAATTACCAG 

AGGAACAAAAGGAATATTGAGGGATGCGCTCAACCAACTNTT 

Sequence 895 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGGAGTAGTCTAAAACAA 
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■ GTGACTTTACTACTTATTCTTCTGCATGTCCTTACCAGCTTCTTACCTTCTTCAGGTTGA 
GCATGAGATCAGCTTCACAGGGGATGGGGTCCTTAAGGGI 1 1 J I 1 1 CCATACTAGTTTCA 
GCCTTAACMTGAGTTTTCAACCCTTAAACATGAAAAATAAATAGTGCGAAAGAGGGGAG 
GATGGTAGAAATGCTTTAAMTTACCTTTTGTAMTTTTACTTTGTTT 
TGCCTTGCTTATCAGGGAAGTCCTACAAACAAAGAACTCCACGGCTTCTTCAAGTCTTCC 
MGGGAACAGGGTCCCCCTGGTTCCTAAAMTCAATGGGMGTAGGTTTTTGGTAACCAT 
CTACTGGTCMNGGGNAACCATTTCTACCTGGCGGTTTATTACACCTTTGCTAGGCTTCT 
TTTTTCTTTTTCATTTTTAAAAMTAATTTTT 
Sequence 896 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTTCATATCCCAGTTCTA 

GMTCAGTTCA7TTTCTAAGGAGTCCTGGTTCCTTTTA7TGGAAACCAAAATCTGGGCAC 

CAGGTGTGCTCCCATTCTAGTCGTTTTCTGACCACATAACTGCTAACAAAGATGCTTCAC 

TCTGGCTACACTGATGTGMCTTTGAACTTTAGCAGAAGAGCTCAGCTCTAGAGAACAAT 

GAGCTCCTACATTACCTTTTTTCCTCAAAGAATAAGTAAGTCTAAGCAGAAAAAAAATAT 

GCAAAGAATTTTCAGTATGAATGAAATAAGACAAACCATCAGGCTTGCTGTATTGTAAAC 

CAACACAATATAGTTATAACAGATCTGTAGAAGGGATCCTTAGAATAAGAGAGGCATTTG 

TCGGGGGGTCATCAGGGAGAATACTGGATAGNATCTT 

Sequence 897 

TAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA CI I I 1 1 I » I 1 1 1 1 I 1 1 1 1 1 1 1 
TTTTTTTCTTC ATGGCT ACATCTGMCAGCTACTGAGGGATATATATGCCMCTTTGGGA 
GTTGCACAGCTTTTTTGAGGCCATTTTNTMNATGACT^ 

TTGGAATAGCCACAAGTTGTTGTAGCCAAGGTTTGNGNGNGTTTTTAATACAATTCTTAA 
AATTTTAGTAGGCTTCTCATCTGTANATAGATTTGAAGGGGNGGGGTTGCCCCTCCACAC 
CTGTGGGGTGTTTNTCGTAAGGNGGGACCAGAGACTTAGGA 
Sequence 898 

CCGCGGNGGCGGCCNAGGTACACCAAATGGATTACAAGCAGCATCCAGCAGAAGACAGAC 

CCCCCAACCCTGCCCACCAGGGCTCACACTCTACAAAACCCTGAGGGCCTAGAAATCTGT 

AAATGCATCGNCAAGCACTGGGGCTGATTTGCAGTAATTCTCTAAGCAAGGCAAACATGA 

TCTAGCTTTGAAGGCAGCATGAAGGCAGCGGGTTGGNGAGAACAATCTNTCCTTAAGAGA 

AGAAGAAACCTGGGGCGGANGGAGTTTTCCCCGG 

Sequence 899 

AGCTCCCCGCGGTGGCGGCCGAGGTACATGTTANGGTCTTGAGTTAATTGCTCTGTGGCT 

GTGGATTTTTATTTGATGTTCTGATCTCTTCCTTCCAGTTTGATAAATTAGTGTAGAAAG 

TGGAAGAAAAACATGCCGGCGCAGCCTiSTGCGCTTTGTCAGGTTAACAGAATGGAGTCCT 

GCTCTGGCATCAGTCAGTGCTGTTGTCCGAACCCTCTGTGGCTCCTTCCTCCCTCCCTGG 

GGCCCAGAGCTGCAGACGCTAGAGGGGTA 

Sequence 900 

GCTNCACCGCGGTGGCGGCCGCCCGGGCAGGTACCCTAAATGTTAAACTGAGGGATGAGT 

GAAACAATATCAGGATTAATAAATMACACATTCTTGAATTCCATCACTTAATAGAAGTG 

GCCATTTGAATGCTGGCAGGTNGGAAGAAAAGAGGAGGACAAAGAACCCCAAAAGTTTGG 

CATCATAACTACTGCCACAAG 

Sequence 901 

ATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I I I I I I II I U I II I 

TGTATTTTTAGTAGAGATGGGGTTTCACCGTGTTAGCCAGGATGGTCTCGATCTCCTGAC 

CTTGTGATCTGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCG 

CCTGGCCTTTCCAGGGTATTCTTTTAACGTGGTCTTATTTGCCTTTTTGMTT^ 

ATCTATCAGCATCATATACCACCACTGGAATATAAATTTGAAAGAGAGTCCTGCAGATTA 

TATACATGAATCTACTTAGGCCTAATAACCAAGCAGTCCTCAGTGGCAGATCAATGAAAA 

GTGAAACTAAAGGCAAGTGAAGGGTAGGAGAGATTGGCCAGT 

Sequence 902 

GGCAGGTACCCACCTCCTCGGTCACCCACAGAGCCACCAAAGATTCCATGTCCCAGAGCT 
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• TCCCATAGCAGACCTGAAAAGTCCATGACCTGAGCTTTGGCCATGGTAGTGGAGTGGAAC 
AGGAAATAGTCCAGCA GAGGA GTGTGGGGGAAGGGGGCAGGAGAGGCACAAGAATAAGGG 
AGACCTGGACTCTGGCTTTTTGGGAAAAGGAACCAAGCTCATAGCAATTTGGCTGATNAC 
ACAATCAGA I I I I I CCAGGTTAAGCTTCCTTTCTG 
Sequence 903 

CCGCGGTGGCGGCCGAGGTACCATCTACTGMTGCCCAGTTTTGATCTATTTCTAAATGG 

AGCAAACCAATTCCATCTCCTAGAGCTGGAGACTGTATCCAGGCAGTGTGTGGACAGAAC 

GGACAATCTTTTCTGCCAAGGGCCTATTTGAGTGGAGCACCCCCACACGGGTTAGACGGG 

TCGGCACGGGGCTGGTGGGTGAGGAACTCAGGGGTCAGTGCAAGCTGCAGACCCTCATTT 

GGGGAACGCTCTCAGCACAATGCTCTTACAACTACAGGGTGCACTCCAAAATGGAGTTCA 

AGGAAAAAAGGCTAATGAGAAATAAAATCTGAAAAAATAACTTAAAAAGTTTTGCT 

Sequence 904 

CCGGGCAGGTACGCGGGGGCCCTTTGGATACCTGCACTCCCCATCACCGCACTCCCCATC 

GTGGCACTTCCCTTGTTGCAGTTTTATGGAGTGTGCGTCTGGCTCCCCAACTAGACTTGA 

ACCGCTTGGGTGCATAACTCGGGACTTGACCATTTGCGTCTCCCTACGGCCAGCTCAGCC 

TCCGCACACAGGGACCTGCAGAGAGTGGATGTAGCCACTGCCCCAGCGTCCCTGGGCTCT 

GAAGAGMGCCATTGCCCTTCAAGAGCCACCCTCATTTCCTGGGCACTGGTTTGGAAAAA 

ACGAAGAAAAAGAGACACCCAGCTCACCTCCA 

Sequence 905 

CTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACGCNGGGCAACTCATTCATGATATTGGG 

AGAAAAGCAAAGCAAAAACTGCAACAAAATCTCAAACCCTTTCTGCAGCAGCAGATGGCA 

MCAGTGATCAGAGGAGAAGGACCCTTCCAGCATTAGAAGATTTCCAAAGGCTGTTCCAG 

TAGGGGCTGTGGGCTTCTGGGAGCCCAGATGCCCCCTGATGGTATATTTGAGTTTGTGAG 

GTGGAGGCCAGGTGGCAAGANACTGCNNGCCAATGTCAATGAAAAGCCTGGGAGGAAAAA 

GAGATTTCTGGGA 

Sequence 906 

AGGTACTT TGCTA ACCAGCATTTTGGCTGTGTTATTGGCAGTTTTCAAATTTGAA 
TTGC CATC < I I I I GAGAGTGCATAGACATTTMTTTTMGAMTTTATAGAATTGGACTT 
TTTTGTCTCTATACATTT GTAG GTCAGATGCACATTTGT^ 

AGAGCAAMAT GTAM GTTTTGTATGTAGAGGATAATTGTATGATGATGATAAACTAATT 
AGGTATTACAGTTTTCTMCGACAGAMTTTGTMTMTTAGGTAACTTGGTTCATATTA 
AAATATTTGATACATAGGCCGGGCATGGTGACTCATGCCTGTAATCCCAGCACTTTGAGA 
GGC 

Sequence 907 

GGCAGGTACCACACCCATCTTACCCTCTTCCCTCTAGGTTCTGACATTCAGCTATCTTGG 
TGGGAGGCTGGGGAGCACTATTGGGGATGAGGGTAAGGTGGAGTTTTATAAAGCTCTCCA 
G GTGACTCAGAGACCACCCTC ATTCCACCTGGTCACAAATCCCTGAATGGGAAACAGGTA 
Ci i I i i 11 I M l I I l l l l I i l l 1 1 GCAAAGAATTGTAAAATTTATTGTATAAGTATTGCA 

GCTTTTCANAATGTCATCATTGCCACTMTGATTACTGATACACMCAAGCAGTTTCTTC 
AGG CCTGTG G ATTGG CATC 
Sequence 908 

AGGTACTTCCCTGAGCAGTCGAAGTGGATGCCCAGACCAATGGCCAGTGCTAATATCAAT 

GCAATGATCCCAATGACGATGATTGGAi-tAAAACTTCAATGGCAGCAGTGACAGGATCTGT 

GCAGCAACAGCATCTGCATCTGGTGCAACAGGACTTATTTTCAAATCATCAAGGCCAAAA 

AGCGATCGGAATGAGAAGGGGGCTTCAACAGCAGGCGGATCATTTTCCCCCATGGTGACT 

ATTTCAGGACCTCTGACATCCGGCTCCGCCTCCACCTCTACCTCATAATTCCCGAGTCCC 

AAAAATGTAGATGGCACCACGGAAGAGATAGTAGGCCACAGTGTTACTGGCTTCCCATA 

Sequence 909 

AGGTACAGCAAATTAAACCCAATAACAGGAGGAGGAAAACACCTAATTAATATAAAAATT 
TCAAGGATATGTTAAACAAACAAACAATTACAAGAAACTCCTGTCAAGTTAAACAGAGAG 
AGAGAGAGACAGGGAGAACCACTAACAGGAAAGAGAAAAAGATATACAAGCTATCCCGCA 
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GAAATTAMAAGAGACTAAGAATATTACAMCAACTTTATTTCATTTAGATGAAATGGAC 

AMTTTATTTTAAAACACAATTCGCCAAAATTGACAGAAGTGGMGTAGAAMTCTATTT 

CTGTATTTATTAAAGATACTGAATCTATAATTAAAAATATCTTCACACAGAAAATTACAG 

TTTCAAATG 

Sequence 910 

AGGTACGATCTGAAGAMTGAMGGCATTGAACTTTGGTGGGTAMTTGGGTCTTTTCCA 

GCAAAGGTATAAATCCTTAAAAGCCAAGATCATATTGTTTGATTTCTCTGGGCTCTCTGC 

TGGATACAGTGCCAAGTCCATAACTGTATACCCCATGGACACTCTATGTTAAATGGAGAT 

TAATGTGTAAG AGGTG I I I II I I I I I I GTTTTGTTTTG I I I I I I AATTTGGAAAAGAAGC 

TTAAAGACCACAATGGGTGTGGCATTGGCTCGACCCACAGATCTGCTTAGTCTCAGACAG 

GCACTTTGAACCAGTCTTTTAAAATTGCGTCACAACAAC 

Sequence 91 1 

AGGTACAGATCACTATGGCTTGTCTTTTCTCCTAACTAATGTAAAATTCCCAATAATTCA 
TMCT T GTATGAGGACMC AGTTGTGTGMTCTACCCTGGTCCTTCTGATNATTTTTAAT 
TTTTTNAI II II I I I I 1 1 I IGGGGACAGAGTCGTGCTGTTATCGCCCGGGCTGGAGTGCA 
GTGGCATGATCTCGGCTCACTGCAACCTCCACCTCCCAGGTTCCAGCAACTCTCCTGCCT 
CAGC TTCCCGAGTAGCTGGAATTACTGGTGCCCACTACCACACCCGGCTAAI I 1 1 I I GTA 
TTTTTAGTAGAGATGGGGTTTCACCATGTTGGCCAGGCTGGTCTTGGACTC 
Sequence 912 

AGGTACAAATTGTCGTTITTATTCCTCT^ 

TTTTTATTGTTGTTGTTTGATCCCTMCCCTACAAAGAGCCTTCCTATTCCCCTCGCTGT 

TGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTTTGACTTCTTGCTTCAGT 

CATCAGCCAGCAAGAGGGAACAMACTGTTCTTTTGCATTTTGCCGCTGAGATATGGCAT 

TGCACTGCTTATA 

Sequence 913 

TGGCCAGNTCAAATNACAACCCCCCAACCCCCCCCCCCCCCCCCCACAAACAGACAAGGA 

CACAGNTCACCANACAATGGATGTNCAGGNANTNGATATCAGCAGATATNTTAGNCCTNT 

AGATAGGCTAATTTNANTNAGCAAAGGAAAGAGGAGGTANCATTAGNCAGATGGGNTATT 

NACCTCTGAATTAGATGGCACTTACCCANCTTCTGGNACAGNCCTGCTGGNGGCGTCTAG 

ACTAGTGATCCCGGCTGANGATCGATTAACTATCATCCGCGACCTCAGGGGGGGCCGGAC 

CCACTTTTGTCCTTA 

Sequence 914 

CGAGGTACGCGGGACACTGGTGGGGGAGAGTCCGACGCGCCTGGCTAGGAGCGCCGACCG 

CAGGGCCTCTACGGACCTTACTAGAAAAATGAAACCTGATGAAACTCCTATGTTTGACCC 

AAGTCTACTCMAGAAGTGGACTGGAGTCAGMTACAGCTACATTTTCTCCAGCCATTTC 

CCCAACACATCCTGGAGMGGCTTGGTTTTGAGGCTTCATGCGAGAAAGGGGAATGGGGA 

ATGGCTGCTTAACGGCATGTC I I I 1 1 I I I I I I I GAGACGGAGTCTTGCTCTGT 

Sequence 915 

CGCCCGGGCAGGTACGCGGGGACTTGACTTAAACTCTGGGGCCCGGGAGGCCGCCGGTTT 
TCTCCCCGCTTGCCGGGGTGGTCCTCTTCCCTTTGTCGGACCAAAGAAGTAAACACTGTG 
TGGAGAGGGACTGCGTGTTTGGAGGGAAATGGGAATGTACCT 
Sequence 91 6 

CCCCGCGTCCGCTCTCTGTCGGGGTCCCCTCCATCTCGCTGCTGCTGAAGGCCGCGAGGG 

CGGCGGCGATGGCGGAGGCGGCGCTGTTGCTGCTGCCTGAGGCGGCGGCGGAGCGGGACG 

CTAGGGAAAAGCTGGCTCTCTGGGATCGGAGACCGGACACGACGGCGCCGCTGACCGACA 

GGCAGACGGACTCGGTATTGGAGCTGAAGGCGGCGGCAGAGAACTTGCCGGTGCCAGCTG 

AGCTTCCMTTGAAGACTTGTGCAGTTTAACATCCCAGTCACTGCCCATTGAACTGACTT 

CAGTAGTGCCTGAATCTACAGAAGACATTCTCTTGAAGGGCTTCACTTCCTTAGGAATGG 

MGMGMAGMTTGAAACCGCACAGCAGTTTTTTCTCATGGTTTGCAAAGCTGCAAACT 

CAGATGGATCAAGATGAAGGAACTAAATATAGGAGCAGTGTGATGCTATATTGAATGATG 

TAAACAGTGCTCTTCAGCATCTGGAGTC 
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Sequence 917 

GGCTGTGGCCAAGAAACGCAGGGACCGCTCTCTCCCCCGGGCTTTCGAAATCTTCACAGA 

CAATAAAACCTATGTCTTTAAGGCCAAGGATGAGAAGAATGCAGAAGAATGGCTCCAGTG 

CATCAACGTGGCAGTTGCCCAAGCCAAAGAAAGGGAAAGTAGAGAAGTAACCACATATCT 

GTAGGGAATTTATMGTCAGCCATGACMTTATACACCACAGGCATTGTATTATCATTGC 

CAATGTCAAGAAAMGAGCTAMTTTACCMGCCATGGTTGGNTTTTTACTAAATACCAT 

GGGMTTTGTTGGTCCTTTMGAAGAAGGGCCTTAAAATGGCAGGGATTTCTTAGTNAAA 

TGNCMTACTCTMCAGCTTTAGTATTGACTTTAGAATATATCTGATGCCCACAAAAATT 

AAATAAAAGGGNTTNGAGGAGGTTTGCCCNAAATAAGTGNGGGGCCCGGAGGGGAA 

Sequence 91 8 

AGTCNCCACGCGTCCGCGGACGCGTGGGCGAGTGCCCAGTGACCCTTTACGGGGGTAGCT 

TTTACTCCGCACTCTCAGCCCCTGCCTCACCCCTCCCTCAAGGCCCGGATTGACCATTTC 

CTGCTCCAGCACTCCATCCCTGGCTGCCACCTGCTTGGGAGAGCACAGACGGCATTGGCA 

GTGATCCCTTCTTCCATTGTTCTGCCCTCTCAGAAAAGGAAGATAGAGCAGGCTGAACAT 

GTCCCAGACAGTMCTTTGGTGTAMTGCTTCCTGTTTTTCTGCCACAAGCCCTTTGGTC 

TTACCCACTACCTCAGAGCACACTGCTAAGAAAATGAAAGCCACCAATGAGCCCAGCCTG 

ACACATATGGGACTGCTCGACAGGTCCACTGTCCCACGAGCAGAAGCTGGTCACAAAGCT 

TGGGAAAT 

Sequence 919 

GGG AGTCG ACCACGCGTCCGCGGACGCGTGGGCGAGTGCCCAGTGACCCTTTCACGGGGG 

TAGCTTTTACTCCGCACTCTCAGCCCCTGCCTCACCCCTCCCTCAAGGCCCGGATTGACC 

ATTTCCTGCTCCAGCACTCCATCCCTGGCTGCCACCTGCTTGGGAGAGCACAGACGGCAT 

TGGCAGTGATCCCTTCTTCCATTGTTCTGCCCTCTCAGAAAAGGAAGATAGAGCAGGCTG 

AACATGTCCCAGACAGTAACTTTGGTGTAMTGCTTCCTGTTTTTCTGCCACAAGCCCTT 

TGGTCTTACCCACTACCTCAGAGCACACTGCTAAGAAAATGAAAGCCACCAATGAGCCCA 

GCCTGACACATATGGACTGCTCGACAGGTNCACTGTCCCACGAGCAGAAGCTTGTCACAA 

AGCTTGGAA 

Sequence 920 

AGTCGCCCCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCG 

TGGGGGATGGATGACAGTCCACCAGAAAAAAGTTAGTGGAGCGGGGACAAGCAGGGTTGC 

AGAGTGGAAGAAAAATGTTCCTGTGAGAAGAAACTGTCCAAAGAGTNTGAAGAGAAAAGG 

GAACAGGGTGAATTTGANGCCCTACAAGAAAAACAGGAGACCATTCAACAGGAGACGCCC 

AGGGAGCAGGTGGCTTTGTGGGCCTGATGTCCAAGAAAGAAGTNCTGGTGGTAAACAGAG 

ACTTGTGGATTGCAAGCTACTGTTGTCTTTCTATTGAAA 

Sequence 921 

TCGGAGTCGCCACGCGTCCGGCCAGGCGTGCCTGGAAATCCGCT7TCGCAGCGCCCCCTC 

GTAGCCCGCCTCCGCCCGCAGAAGGCGTTCCCTGGACAGAGAAGCGGGCGCGCGGGGGCG 

GGCGCGTGGGGCCTTGCCG GAGA ACCTGACTCTCCGCAGCAGCAGTGGAAGCCAGAGTGA 

CGCGTTGTGTTGMCACCAGTTTTCTGGAGCGCTGTGTGTTCTCAACAGCTGAGCAGTCT 

GTTTCTCCAATCAGGTTTCAAAGCCACTTCAACTGCACTGGCCCCTGTGGGTCACTGCTG 

CACCGCCCTGGCCCATGTGGGTCCCTGAGGAGCGACCTGCCGGGGCCACCTGGCTGGACG 

AAAAAGACACACTTTGGGACTTAAGCCGTGAGAAAAAAACTTCATCAGTAAGAAACAAGT 

CAATAGACAAGTAAAAGACTAGGAGAAAATATGCATAAAACATAAAAAGTGACTTGGATT 

CCTGATCTTGGAGTATTTAMGAATTCCTATAACTTANAAAAGGTTTCMGTTTTTTNAA 

ATGAGCAAAANGG7TTGGGTAAA 

Sequence 922 

TCCGCAGGCTGGGGGATCCCAAAGGGTGCGCTCCAGCCCCCAACCCAGGCACTGGGACTC 

TGGTGGCACCCTGGGTGGCAGGCAAGCCTTGAAATCAAGTGCACGAGCCTTGGAAAGGAG 

GACCGGGAGAGTTATGGCATTTATGAATGAAGAAGAGAAAGAGAATCACTCGGATGGGAA 

AAGTTAACTGGATTGTTCCCACCTGCATGGATCACCCGGGTAACTGCAGTGGGACCGAGG 

GGGCGAGGCTGCGGGCTGGGGGATGTGCCGGGTTTCCTTGTGTTGCCACGAACCCAGAGA 
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. GGGAGAGGAAGAAGATGGAAAGAAAAAGGAAAAAGGGAAGGAAAGTAAGAGGGGAGAGAG 
GGGGAAAACTTGAGGATGAAAGAAGAGACAGAAGAAAGAAGAAGACCTTGAGAGAGGGGA 
GGAGAAAGGAAAGGAAGCNGGAAGGAGGAATTGGAAAGTGAAGGAAAGGGGGGAAACCAG 
GCNGAGAAAGAAAAGAGAAAGGGGGAAGGGAAAGAAGAAAGGGGAAAAGNAAGGGGGGGN 
GGTTGAAAATCAACNCGAAAAAAAGAGG 
Sequence 923 

CNCGCGTCCGGCTGTGATGAATGAGGTCTAGGAAATAATTTGCATGTGTCTTGGGGGACA 

CAACAGTAACNGAGAGGAAATACATTATTACAGCAACTTGCGACGTACTAATACCTGTCA 

GTGTTGGCCCCCGTAAGGTATGTAAGGCACCTGNGANGTGCCCAGTNAGTNCCTTGGTGN 

AAGGCCAACATGTACTAGTTATGTAAGTATTGGTGTCTGCTTTAAAAAAGGAGACCCAGA 

CTTCACCTGTCCTCTTTAMCATTTGAGMCAGTGTTACTCTGAGCAGTTGGGCCACCTT 

CACCTTATCCGACAGCTGACTGTTGGATGTGTCCATTGTCGCCAGTTTGGCTGTTGCCCG 

GACAGGACAGGACCTCCATTGGGCGCAGCAGCAGGTGGCAGGGGGTGTGGCTTGAGGGTG 

GGTGGCAAGCGT 

Sequence 924 

CCCCGCGTCCGCACAGATCCTTGAGCTCCGCTGCAGGATAGTACAGTTTTACCGCAGAGG 

GAATCTGGAACAGTGGAATCAT GTGT CTGCCCTGTGTATTGCAGTTTGTATTGCCACAAG 

CTATATTTATAC CAGT GTCACCCTTTTCTTGTAGAATATACTAATAAATCTGTGCCAACT 

C TACC TTCTCACTTTTACCTCTGACGTCATTCTTTTTTTCTGAAAGAGGTMTAATTCT 

GTTTTGATAGACTCTGAGGATTATGTGMCAGGACATTTTTC 

ATACTGTCMGGTACTTGCT TGTGT CTGAACTCTAGTGCACTTATGATTTTGTAGACCCA 
TGTGAAATTTAATAAGATACG I I I 1 1 I I I CCTTTCTTTGGTGTGGTAGTGCAGCAACAGT 
TTGGTCTGCATTTGTTAGAAGTTTAACTCCTAACA^.CCCAAAGACCTATTTA 
Sequence 925 

GCGTCCGACCCCAAAGGGAGGGACCACATTGCACACACTGTAAGAAATGCACTTTCCGAG 

GAAGGGGATGGGGGAGCCCGGACACCCAGAGCTCCCCGAGTTGGGGGTGCCCGTCTGGAG 

CGCCCCCGTCAGCCCCTGGCGGTGGGAGGTGAGAGCGAGTGGTTTAAGTGCCTGATTACC 

ACCACCCGCCCCCCCCTTTGTCCAGCTGGGACACGGAATGGCCGCGGGCCTCCTCCCCCT 

CCCCTCCAGCCTCTCCACCAGCCCCTCCAGTCAACCCTCATCGCCGTGCCCCCCCAGAGC 

TAGAGAGATGGGGCCCCTGCGTGGCCCGAGGGGCAGAGCTGGGCGTCACTTCGCAAGCGT 

CCTGCCCTGCCGGGGCGCGGGGGTGGGCTCTGGGGAAGCCGGTGCGCCCCCCACGCCTNC 

GCTGCCAGTGCCTTACATTCTGGAGCGACCCCCCTCCCTGGTGCCTCCCAGCGAAGGGGG 
ACCCGC 

Sequence 926 

AGACAGCTCAAGCCTTGCCACTTCGGGCTTCTCACTGCAGCTGGGCTTGGACTTCGGAGT 

TTTGCCATTGCCAGTGGGACGTCTGAGACTTTCTCCTTCAAGTACTTGGCAGATCACTCT 

CTTAGCAGGTAGGTGCCGCAGACCCTGCGGGTTAAGAGGTGGGGTGGGGGGCAGTGCTTG 

CCAAGGCCCTAAACTGGGAGCGCTGGGTGAGGGGAACAACCCACTTTGGAGGGTTCTCTG 

AGAGATAGATACACCCCATATCCTGGGCCCAGCTCGTGCACACAGCTGGAGGTCCAGAGA 

CCCAGTCCCCTCTGCTCCGTCAGCCAAGTTCCAAGAAGTTGAGCAGAGACCCTTCTGGGA 

GCCTGGCGGGGTGCAGCGGCCTCCCCTGCGGGGCCTGTCACCCGGCCGGGCGCGTGCAAA 

CGCCTCTGGCGCCTNTNTGCGCGGGAGGGGAGATAAGCGTCTGAGCCAGGGAAAGCCGCC 

GGGCTAAAACCCGCCTTTTCCGGGGGCCCC 

Sequence 927 

CGCGTCCGGTCATATACMTGTCATTGTTTGGGACCCGTTTCTAAATACATCTGCTGCCT 

ACATTCCTGCTCACACATACGCTTGCAGCTTTGAGGCAGGAGAGGGTAGTTGTGCTTCCC 

TAGGMGAGTGTCTTCCMAGTGTTCTTCACTCT7TTTGCCCTGCTTGGTTTCTTCATTT 

GTTTCTTT GGAC ACAGATTCTGGAAAACAGAATTATTCTTCATAGGCTTTATCATCATGG 

GATTCTTC7TTTATATACTGATTACAAGACTGACACCTATCAAAGTATGATGTGAATCTG 

ATTCTGACAGCTGTCACTGGAAGCGTCNGTGGAATGTTCTTGGTAGCTGTGTGGTGGCCG 

ATTTGGAATCCTCTCGATCTGCATGCTCTGTGTTGGACTAGTGCTGGGGGTCCTCATCTC 
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GGTCANGTGACTTTCTTTACTCCACTGGGAMCCTAMGMTTTTTTCATGATGATTGGG 
TGTATTCTGGGTCACTTTCTCTTGCCATAAGCTATNCTCATTCCAGTAGTTT 
Sequence 928 

CCACGCGTCCGGACGCTGCGTGGAAGCGGCGGAGCCGGAGGGAAGCAAAGGACCGTCTGC 

GCTGCTGTCCCCGCCCCGCGCGCTCTGCGCCCCTCGTCCCTGGCGGTCGCTCCGAAGCTC 

AGCCCTCTTGCCTGCCCCGGAGCTGTCCCGGGCTAGCCGAGAAGAGAGCGGCCGGCAAGT 

TTGGGCGCGCGCAGGCGGCGGGCCGCGGGCACTGGGCGCCTCGCTGGGGCGGGGGGAGGT 

GGCTACCGCTCCCGGCTTGGCGTCCCGCGCGCACTTCGGCGATGGC I 1 1 1 I CCGCCGCGG 

CGACGGCTGCGCCTCGGTCCCCGCGGCCTCCCGCTTCTTCTCTCGGGACTCCTGCTACCT 

NTGTGCCGCGCCTTCAACCTAAGACGTGGACAGTCCTGCCCGAGTACTCTGGCCCCGAGG 

GGAAGTTA 

Sequence 929 

CGACGGCCANGGCGCCTCCGAGTTQCCCGCCAGGACTCGGAGGGCCAGGAGGGCGCGACC 

TGGGTGGATATTTTTGTTGGACGGCGCAACTCTTGGGGTGGCCCGGGAGCGGCGGAAACC 

GAGCGAGAGAACCAGGAGGCGCTGCGCAGAAGGAGGCCCGGGGGCTCCGAGGCGTTGAGG 

GGCTCGATCTGCGTTCTGGGGTTGGCAGCCGAGAGGCCGCGGTCCCTGAGTGCCAGAGGT 

GGTGGTGTTGCTTATCTTCTGGAACCCCATGCAGCCAGATCCCAGGCCTAGCGGGGCTGG 

GGCCTGCTGCCGATTCCTGCCCCTGCAGTCACAGTGCCCTGAGGGGGCAGGGGACGCGGT 

GATGTACGCCTCCACTGAGTGCAAGGCGGAGGTGACGCCCTCCCAGCATGGCAACCGCAC 

CTTCAGCTACACCCTGNAGGGATCATACCAAGCAGGCCTTTGG 

Sequence 930 

CGTCCGCTTTNAGACCGGMGACATTTAAAAGCCAGTTTACGTACANGMGCATGGTTTT 

AGATTAACTGCCTGTTGGTACAGCTAGAMCATTGCAGCCCTATCGCTTATTTATCTTGC 

ATGTTGCTCTGCTTTGCTATGAAAMTATCGT7TTATGATAAMCTTGTTGMTTTTGAT 

ATGTATTCGGTTATACTCTTAGGGAAAATAATAGAAATTAGAGTGAGAGAAAGTGCTATG 

TATATTAGGCTITTCAGATTTTATAGATATAGGCTTMGGGAGGGTGGAGGTTCT^^ 

MGTTGAATGACTACTTAMTTTGTTGATGTGMTTTMGTTTTAMGATTATTATTMT 

TAACTCTTCTCTTTGTC TTTGC ATTTACCTTCCCAGATGTTCCAGCCTATCATTTTACTT 

ATTCTCATTCTTGTATTATTTTCATCACTTTCTTACACAACMTATTTAAACTTGNCTTC 

CTTTTTACACTGGTTTTTGGTAC 

Sequence 931 

CACGCGTCCGTGGAGTATGTGCCATCTGCCAAAGTGGAGGTGGTGGAGGAGCGCCAGGCC 

ATCCCTCTAGACGAGAACGAGGGCATCTATGTGCAGGATGTCAAGACCGGAAAGGTGCGC 

GCTGTGATTGGAAGCACCTACATGCTGACCCAGGACGAAGTCCTGTGGGAGAAAGAGCTG 

CCTCCCGGGGTGGAGGAGCTGCTGAACAAGGGGCAGGACCCTCTGGCAGACAGGGGTGAG 

AAGGACACAGCTAAGAGCCTCCAGCCCTTGGCGCCCCGGAACAAGACCCGTGTGGTCAAG 

CTACCCGCGTGCCCCAC 

Sequence 932 

GGTTCGCCCACGCGTCCGCCCTGCTACCCTGGGAGAAGCCTCAGCTTTCTGGGCAGAGTT 

TGTCTCCCTGTCATTTATACTCTCAGGCTTTATACATTTACACAGTAAGTTCTCCCTCCT 

GGAGGGTTAAAAGGAATAATTTCAACAGGGTGAAGGCCTGGCACGGTGGCTCACAACTGT 

MTCCAAGGACTTTGGGAGGCTiGAGGTGGGTGGATCACCTGAGGTCAGGAATTTGAGACC 

AGCCTGGCCAACTTGGTGAAACCCTGTCTCTACTAAAAACAAAAATTAGCCAGGTGAGGT 

GGCACACACCTATAGCCCCAGCTACTGGGGGAGGCTGAGGCAGGAGAATTGCTTGAACCT 

GGGAGGCAGAGGTTACAGTGAGCTGAGATGGCACCACTGCACTCCAGCCTAGGTGACAAA 

GCAGCAAGACGCATTCTNAAAACAAAACANCAACAACAACAAAAAACGGGAAAACA 

Sequence 933 

CNCGCGTCCGGTCCAGTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTG 
GTTCTTTCGGCCGTTCACGTCATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGT 
GGCCTCTCGTGGCTGGCGGCAGGTGGGGTGATGGTGCTGGCCTCGGCGCTGCTGTGTGTG 
ATTGTGTCTGTTCTGACCAACGTGCTCGTGGGTGGAAACACCCCAAGGAAGAACCCCATG 
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CATCCCAGCTCAAGGTGGTCAGAGCTAGACCTTCTTATTCTGTTGGGGACGGCGGGCCAC 
GTCTTGAGCCTGGGCGCCAGCAGCTTCGTGGAGGAGGAGCACCAGACCTGGTACTTCCTT 
GTGAACACCCTGTGTCTAGCTCTGAGCCAAGAAACCTACAGAAACTACTTT 
Sequence 934 

TCGCCCCGCGTCCGGTTATTTTACCCAGAAGCCGGATAGAGAAAATATTACAGAGAAAAT 

CACATATCACATGGGCTCGAAAGATGTAGAGGTTTTTGACAAATGAAGAACAACCATAAC 

AGGTAGAGGGAACACCATGAACCAGGGCATGAAACTGAAAGTGCATAACATATTCTAGAG 

AGAGAAGGGTGTGGGCATGAGTTAGGGCTGGAAAAACAGGTTGGAAACAGATAAGTAAGG 

GTCTCAAATGCMTGTCAMGAGCTTGCAGTTTATTTTCCAGGCAATGAGTAGGCAGCCA 

AAAAAAAAAAGTAAGGATG I I I I I I I I I I I I I I CCCATGGCATCATATTTAAGAGGATGG 

ATTTAMTTGTGTGAGACCAAAGCATAGAGACTAGATAAGAGGGCGATCATTATTTCAAA 

AAGAAATAATGAAGATCCAATGAAGGAAGTGGGAAATTAAAATAGGGGAAGAGAGGTA 

Sequence 935 

CCGTCCGGTrTTTTGTCTCAGAGTTCTTCAGGCTGTACAGGAAATGTGGTGCCGGCATCT 

GCTTCTGACGGAGTCTCACTCTGTCGCTCAGGCTGGAGTGCAGTGGCATGATCTCGGCTC 

ACTGCAACGTCCGCCTCCTGGGTTCAAGCCATTCTTCTGCCTCAGCCTCCCGAGTAGGTG 

GGACTACAGTGGCCATGTGTCTGAGATCTAACCAAGGGAACATGGGTGGAACTGATGTAA 

GCCACTTTGACACCACMAACCTCCCATGGGTTCTCTCTCTTCTTCTCTGGTGTACTTGT 

TGGATGGAGAAGATGCTGAGAAATAGTGGGAAGTCCTAGGGGATGGAAGAACCCAGGATT 

CTGMTACTCCATTGGACCTTACGTTTTGGAATCAGGNATGATGCTGGCCTTCATAAAAT 

GAGTTATGGAGAMGTCCCTCTTTTTCTGGTGTTTGGAACANGTTTTCAGAAANGMTTT 

GNTACCCAGCTTCCNTCTTTTGTACC 

Sequence 936 

CCGGTGAGCGCCCCGCCCGCTCAGCCGCCAGATCAACCTTAGCGCTGGGGCGCGGGCTGG 

GGTCGCCAGGCGGTGCGTTCTGCCCGCGCGGGGCTGAGAGTTAGGGGCCGGGGCCGGATC 

CGGGGCCGGGGGTCGCGCCGCTAGCCGCCAGCAGCGCAGTCCGGGCCGCCACCCTGCACC 

CTCCGCCCTGTTTCTGCACCCGTCTGGGTTCTTGTGCCGCCGCCCGCAAGCCTTCCCGAG 

CTCAGGGTGGTGAGCTGCGGAGACCCGTGATAATTCGTTAACTAATTCAACAAACGGGAC 

CCTTCTGTGTGCCAGAAACCGCAAGCAGTTGCTAACCCAGTGGGACAANGCGGATTGGAA 

GAGCGGGAAGGTCCTGGCCCAGAGCAAGTGTGACACTTCCCTCTTTGACCATGAAACTCT 

NGGGTGTCTGCATTGCTGATGGC 

Sequence 937 

GTCCGCCGGCATGAGCTGTCCATGAAGGATGAGCTGCTTCAGTTCTACACCAGCGCTGCG 

GAGGAGAGTGAGCCCGAGTCCGTTTGCTCAACCCCGTTGAAGAGGAATGAGTCGTCCTCC 

TCAGTCCAGMTTACTTTCATTTGGATTCTCTTCAAAAGAAGCTGAAAGACCTTGAAGAG 

GAGAATGTTGTACTTCGATCCGAGGCCAGCCAGCTGAAGACAGAGACCATCACCTATGAG 

GAGAAGGAGCAGCAGCTGGTCAATGACTGCGTGAAGGAGCTGAGGGATGCCAATGTCCAG 

ATTGCTAGTATCTCAGAGGAACTGGCCAAGAAGACGGAAGATGCTNTCCGCCAGCAATGA 

GGAGATCACACA 

Sequence 938 

CCCGCGTCCGGAATTCCAGTTGTGGATGAAGGAAATGGTGTTATGACTGCCTCAAGGTTT 

TGTAGCAAGTCATAGGGAACCAAAAGAGGAATCTTGTTTTCCTCAGAGGTCATGCCAACT 

CCAACTCCCGTTCCCTAAACTGTCTCTGAGCCATAGACTAGTAATGGACTCTTCAAGCTC 

TACCATTAGGTATCTTTTAAAGAMGCTGGTTATTACTATTTATTCA I MM i I CTCTTC 

TGTGCAGTGCAAAAGATATGAAACATCGGCTAGGTTTCCTGCTGCAAAAATCTGATTCCT 

GTGMCACAATTCTTCCCACAACAAGAAGGACAAAGTGGTTATTTGCCAGAGAGTGAGCC 

AAGAGGAAGTCAAGAAATGGGCTGAATCACTGGAAAACCTGATTAGTCATGGAATGTGGG 

CTGGCAGCTTTCAMGCTTTCTTGAAGTCTGAATATAGTGAGGAGGAATATTGACTTCTG 

GATCAAGCTGTGNAAGAGTACAAGAAAATC 

Sequence 939 

CGTCCGGCCGGCGACGGCGGCAGTGGCGGCCCGGCCTGCAGGAGCCCGACGGGGTCTCTG 
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' CCATGGGGGAGTGACGCGCCTGCACCCGCTGTTCCGCGGCAGCGGCGAGACATGAGGAGA 
CCCCGCGACAGGGGCAGCGGCGGCGGCTCGTGAGCCCCGGGATGGAGGAGAAATACGGCG 
GGGACGTGCTGGCCGGCCCCGGCGGCGGCGGCGGCCTTGGGCCGGTGGACGTACCCAGCG 
CTCGATTAACAAAATATATTGTGTTACTATGTTTCACTAAATTTTTGAAGGCTGTGGGAC 
TTTTCGAATCATATGATCTCCTAAAAGCTGTTCACATTGTTC 
AMCTTGGGACTGCATTTTTTATGGTTTTGTTTCAAAAGCCAT^ 
ATTACCAAACACCAGATAATTGGATCACTAAAAATTCCTGGTAGAAAAGAATTTAAAGAC 
AAA 

Sequence 940 

TCCGAAAGNGTACTGCCATGANCCGAGATAGGAGACACATAAGAGGACAGCAGAAGCCCT 
GGCCCTGGGGAGGCTTCTCGGAAGGCCTGGCTTCACAGGCAGGCCACAGAAGGATATCGC 
GGGCACCGTGCACCCAMGCAAGATAGTGGCTTCCCTTTTATATCCAATCTAATCCTGAT * 
TGGATGTCCCTGAGGCCCCTGCTGGAAACAGCCATAGGAGAGGGCCCATGGCAGTAGGGG 
MAGAAGGMGAAATTCCCTGCAACAAAACTTCAGCTAAACTTTGATTTGTGTATTGTTT 
ACATAATAATTTTAAAGGGTACATAATGTGTAAAGAGTTTGGATAGAACCTCTCTTCATA 
CTATGGTTTTCGTAAAGG ATCTGTTGTTGTTACGGATTCA I I I I 1 1 CCCTCTATTTTTAT 
AAAGAGCAGCAGAGTTGTCTTCTCAAAACGGCTGCCAAGCTCTGCTTCTTGGGAAGAT 
Sequence 941 

CCCGGTCGTGCGGGTCGGGCGCGGGCGGGCGCGGCGGCAGTGGCGCTNTCAGGTGATTGA 

CTGGCCAGCTGCCTGAAGGAGCGCCAGGTCCTCCTTGCTGGCAGGTGGCGAAGCCCATTG 

GGGCGGCGGTGCAGACCCGCGGCGGCNGCTGCGGCGGTCTGGCTCGGGAGGCGTTCCTGG 

GGCCAAGGCCATGGCCCCGCGGCTGCAGCTGGAGAAGGCGGCCTGGCGCTGGGCGGAGAC 

GGTGCGGCCCGAGGAGGTGTCNCAGGAGCACATCGAGACCGCTTACCGCATCTGGCTGGA 

GCCCTGCATTCGCGGCGTTGTGCAGACGAAACTGCAAAGGAAATCCGAATTGCTTGGTTG 

G 

Sequence 942 

CACCCACCCCAGATGCCGCCTGGCACCAAGCGCAGCCGCCAGCTGCCGCACTTTCCACTT 

GTATTGATCACCTATNANNCCCGCGCANAACGGCTACGNCCGAGCGGACCGCGGCCAGCG 

CGCCAGCCCTTGGCACNCCCTNGGAGCAGAAAGGGCTCCGGGAGGAAACTCCTTGGGAGC 

GCCCTGTCCGGANTGCCCTTTGCTCTCTGCAGTGTGATTTCTTTCTGTTCTGGGAGGAGG 

AGGAGGAGGANGAAGAGGAGGANGAGGNAGAACGANANNCTGCCCTTCCAGAGGTTGGTG 

AGGGAGATCGCGCATGGATTTNAAAACCNACCTGAGGTTTCAGAGCGCAGCCATCGGTTG 

CNCTGCANGAGGCTAGCGAAGCGTACCTGGTGGNTCTGTTCGAAAGA 

Sequence 943 

GTCCGGTTTTGAAACAGAAATGTAGGCATTAGACTTCCTGGGCGGCAGACAAACCAAAGA 

GCGGAAATTCATGCAGCCTGCAAAGCCATTGAACAAGCAAAGACTCAAAACATCAATAAA 

CTG GTTCTG TATACAGACAGTATGTTTACGATAMTGGTMGCTTTCACATTTGATTTCT 

TCTGTTTTTCCAGTAACTGTGAAGGGAAATTGGTAGGAGGTGTTGTAACAGGGCAGGACC 

CAAATGGGAACGGGGGGATGACATTGGTTTGTCAGGTACCGAGCAAAGAGTGAGGATTTT 

GGAGTCTCCCTTCTGCTGCTCTGATGTTTTCCACATGCTTATTTCTTTGCCAGGCACTGG 

AGATGCAGTCAAGAAGTGGGAAGTGGCTCTTACTTCTAGTCTGTGTGTGTATAAGTCACT 

TAAGATGGCCGTGTTGACTGCTTCTTTGGGAAATGCCCTGAATAGGAGCATGTAGGGGAT 

GCTTACCGAGGCTGGGGAAGG 

Sequence 944 

GCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGGCCCGACCCCGAGCCCGACC 

CCGAGCCCAAGCCCGAGCCGGAGCCCGAGCGAGACCCCGAGCCCGAGCCCGAGCGCGACC 

CCCGGTGCGGCGCGGCTACCCCGGCGGAGGCGGNGGGCGCGGGGCGCGCTCTGAGGCCCG 

GGGGATGCGCCGCCGCCTCGACCATGGGCGCCGCCGCCTCCAGGAGGAGGGCGCTGAGGA 

GCGAGGCCATGTCCTCGGTGGCGGCCAAAGTGCGAGCAGCCCGAGCGTTTGGAGAGNACC 

TGTTCCCAGAGTCACCCTGAGAACCGCAACGGCCGCAGATCACCTGCTGGCTGATGCCTA 

CTCTGGCCACGACGGGTCCCCCGAGATGCAGCCGGCCCCCCAGAACAAGCGCCGCCTGTC 
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CCTCGTTCTCCAACGGCTGCTACGAGGGCAAGCCTTCTCAGAGGAAGCCCAAGCATTAGG 

AAGCCCGCAGGC 

Sequence 945 

CGCGTCCGGCACGGGGGAGTCTGTGGTGGCCNGTTTACCTGGGCATCTGGCTGAGAGGAA 

GAAAGGCCAACCTGATCCTGAGGGGACCCAGACATATCCTTTGCACTGTCCCTAGAGGGG 

CGATGAGCTTTGCAGCATTMAAMTGGTGAAGGGGGGAAATATTTTGAACCAAAGACCA 

MTGTTAGGCCGCCGTTATATTTGCAGAAGCTTTGAGAACCATGCGTATAGCCTCCTGCA 

TTCTCCCCTCTCCTAGGAGCTCTTTTGTCTCTGTCC7TACGAGGCGTCATACAGAGGCAG 

TGGGGTGGGCACAGATGAGCAGAGTGGATGGTTCGGTGGGTCCCCACGAGGGCGAGTGGT 

GGTCATATGTGATGGCACCGTGTTCACACACCCTCCTGTGTACCCCCCCAGGGTCACCCG 

AAGTCCCCACACGCTGGCTCTCCACACCCCTCCTGTTCCAGAAAGCATGTCCCG 

Sequence 946 

TCGACCNCGCGTCCGGCACTCCCTCTGGCCGGCCCAGGGCGCCTTCAGCCCAACCTCCCC 

AGCCCCACGGGCGCCACGGAACCCGCTCGATCTCGCCGCCAACTGGTAGACATGGAGACC 

CCTGCCTGGCCCCGGGTCCCGCGCCCCGAGACCGCCGTCGCTCGGACGCTCCTGCTCGGC 

TGGGTCTTCGCCCAGGTGGCCGGCGCTTCAGGCACTACAAATACTGTGGCAGCATATAAT 

TTMCTTGGAAATCAACTAATTTCAAGACAATTTTGGAGTGGGAACCCAAACCCGTCAAT 

CMGTCTACACTGTTCAMTAAGCACTAAGTCAGGGAGATTGGAAAAGCAAATGCTTTTA 

CAC 

Sequence 947 

ACCCCGCGTCCGCTTTTGCATCTGGATCATTTTTCTCTTGCCCCACCATGTAAGAAGTGC 

CTTTCACCTCCCACCATGAACCTGAGGCCTCCCGAGTCATGTGGAAATCGCCCCCAGCCA 

CCCCACCCCAGAGGGCTACGTCCTGGCAGAGCTGGGTTTGGTTAGTTCTGAGGGCTGAGC 

TGGCCCAGCAGCTCCAGACCTCCAGACCTTGCACTCACCTGTGAACTTGACTCTGCAAAC 

TCCTCCAAGATGCGCCCACCACACTCCCAGTGAACAACACCTACAGGAGCTTGGAGTTCT 

ATTCTCAGATACATCAGCTTCCACATTCCTGTGTGTCCCAGCTGGAGAAGCAAGAAGTCC 

CAGACCATGTGCTAAGCACACGTTGGGGTGGGGATGAAATCCAATTGGTGGTGTGTGAAT 

CCATGCTGGATTCATGAAGCTGAGGCCCAGAGGAGGAAGCTTTCTTAATCAACTTCTTAA 

CATG 

Sequence 948 

TAAMGCCAT6GTNATTTGTGCACTGTGCAGTTTCTTATTAGCAAAGGTGCCAATGTTAA 

CAGGGCTACAGCCAATAATGATCATACAGTAGTGTCGCTGGCATGTGCAGGAGGCCACCT 

GGCAGTTGTTGAGCTTCTCTTGGCTCATGGGGCTGACCCTACTCATCGACTCAAGGATGG 

TTCMCAATGCTCATTGAAGCTGCAAAGGGTGGCCATACTAATGTAGTTTCTTATCTGTT 

GGATTATCCAAATAATGTTCTGTCAGTTCCCACCACAGATGTGTCTCAGCTCCCTCCACC 

TTCTCAAGATCAGTCTCAGGTGCCACGTGTGCCAACGCATACACTTGCCATGGTTGTACC - 

TNCCCAGGAACCTGACAGMCTTCACAGGAGAACTCTCCTGCCCTTTTAGGAGTGCAAAA 

A 

Sequence 949 

CCACGCGTNCGGTCGGCCTGTGCGGCGCTGCGGCGGAGCGGGCCATGGCAGTGGGGAGGG 

GGCGAGTGTAGTGCTGCGCGGGGCAGGCGGGAGGTGATCGAGAGAGGCAGGGATGGGGGC 

GCCGGAGTGGAGCGGTTGCGGCGGNCTGGGCTGCTGACTGCGCACTTGGAATAGTAGCAG 

GCGGCG GCGG CGGAACGCCAGGCAGTGTATGTTTTAACTGGAAAAAGTCTTCCATGAAAA 

CCGTCACTTTTAAMAAATAAGGTAATGCCCATTCTGTTTTTTCCTAAAAAAGACCTGAA 

AAT GGGGG GGCCGMCACATTCCTTAGGGGCCCCCGGTGG^ATTGAMTGTGCCTTTC 

AAG I I I I I CATTAAATGCNCTCCTGGCTTATTGGGCAGGACCATTCCCTTTGMCAATCC 

TGGGGGCCGGGCTGGGATTTCAACAAGAATTAGGCAATTCTTGGAATGGGCCTTCCAATA 

ACCCTGNTGGGGAATTTTTCCCNTTTTNGCCCCCAACCTTGGGGGMTTTNATTATTTNC 

AAGNTTTGGGGMGGGTTACCCTTCNGGGGGMANGCTTMCCCAATTTTTC 

Sequence 950 

TTNGGGAGTCGCCACGCGTCCGGCCGGCGACGGCGGCAAGTGGCGGCCCGGCCTGCAGGA 
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GCCCGACGGGGTCTCTGCCATGGGGGAGTGACGCGCCTGCACCCGCTGTTCCGCGGCAGC 

GGCGAGACATGAGGAGACCCCGCGACAGGGGCAGCGGCGGCGGCTCGTGAGCCCCGGGAT 

GGAGGAGAAATACGGCGGGGACGTGCTGGCCQGCCCCGGCGGCGGCGGCGGCCTTGGGCC 

GGTGGACGTACCCAGCGCTCGATTAACAAAATATATTGTGTTACTATGTTTCACTAAATT 

TTtGAAGGCTGTGGGACTTTTCGMTCATATGATCTCCTAAMGCTGTTCACATTGTTCA 

GTTCATTTTTATATTAAAACTTGGGACTGCA I I I I I I ATGGTTTTTGTTTCAAAAGCCAT 

TTTCTTCTGGGAAAAC 

Sequence 951 

NNTCCGGAGTCGACCNCGCGTCCGCGGCTGCTGCCTGCTCTGGAGGCAGGCTGGGCGGTG 

GCGGCCGAGACTGGCGGGGGTGGACGCCCGGGCCGGGCTGCGCCCGCTTCTTGCAGCTGT 

GAATTCCTTTGGACAATTGATGATATTTATCATTGTGCCCAGTTTCTACAAATAAAAGAT 

GGGTGGATTATTTTCTCGATGGAGGACAAAACCTTCAACTGTAGAAGTTCTAGAAAGTAT 

AGATAAGGAMTTCAAGCATTGGAAGMTTTAGGGAAAAAAATCAGAGATTACAAAAATT 

ATGGGTTGGAAGATTMTTCTGTATTCCTCAGTTCTCTATCTGTTTACATGCTITMTTGT 

ATATTTGTGGTATCTTCCTGATGAATTTACAGCMGACTTGCCATGACACTCCCATTTTT 

TGCTTTTCCATTGATCATCTGGAGCATMGMCAGTMTTATTTTCTTCTTTTCCAAGAG 

AACAGAAAGAAATAATGAAGCATTGGATGGA 

Sequence 952 

TCNCCCCGCGCCGGTTTTGATACAGAATGAAAGTGCGTAGTATTTT^ 

TTGCCTTATACATATAGCAAGCCCTCAATAAATAAATATTGAATGAATGAATGAGTGAGT 

GAAGAATTTGTTTATMCAGTCTGTCATCTTGATAACACTGGAATGTCTTTGGTTCTTCC 

ACTTCATCCTTTATGTTTTAMCTTACACACACCATTCTTACACGTCACTAAAGGAAAAT 

ACCAGTATATATTGGCTAAAA I I I I I I I I I I GTTGTTCAAAACTGAAACTCAAATGCCTA 

ATTGGGCTAGGGGTCCTCTTAAAGGAGGTTGATGTTTGTCAMTGGGTTATTTTTTAAAA 

GCAGTAGATAATTGCTTATTTCAAGGCMGTAAATGAATTTAGACTAAGCTGTTCATAGG 

ATTCATCATTTTTTCCCCTCTCCCAAAGTAATTTGTAAGCCGTAAAC 

Sequence 953 

TCGCCCCGCGTCCGTGATTTCTCAGTGTTCTCCTTAGATACCAAATACAAAGGACGAGGG 

ATCAAGCTCAGCGAAAGTATCAGGCATTTAAGGTATCAGGCAGCAATGCGGGGAAAGGTG 

AATTTTCTTCMTCAGCATAGGATGGTTAGGGMGAGCATTTATCACTTTGGTTCTTATC 

CTTCAAGCCAGGGGAAAAGCAACAGTGAGGACATCAGAGACAAAAGCATTTATAGAACTA 

ACAMCACMACGTTTGACAAGTGAGAAAGCTTTATTAAAGCACACATACATGTCAGGGG 

GGTGGGAAACAAAAGAGCAAGTTACAGCCCGGGATCCCAAGTTATGCCTTCCATTACAAT 

TGCMTCCACACCMATCAATCTTTGAAAACATTCCTCCATTCGGTTCATACATACAGTA 

GAAACCACTGTGGCTGCCCTTAATCCAGTGTGCTTATAGGAAATCAGTTAGCAGCTGACT 

CTGTTGAAAG 

Sequence 954 

CGTCCGGACCCTTATTAAGAATATCCCAGGAAGATGGTGATGAACAGCCTCAGTTTACTT 

TTCCACCAGATGAATTCACTAGCAAAAAAATTACAACAAAAATATTACAGCAGATTGAGG 

MCCATTGGCACTGGTGTGAACMTTMCCAGCAMTGTCCTTTTCTAATACCATTTGAA 

ACTAGACAGCTTTATTTCACATGTACAGCATTTGGCGCCTCAAGAGCAATAGTATGGTTA 

CAGAACCGACGTGAAGCCACTGTGGAGCGAACGAGAACCACAAGCAGTGTTAGGCGAGAT 

GACCCTGGAGAGTTTCGAGTTGGTCG1CTCAAGCATGAAAGAGTAAAAGTTCCACGTGGC 

GAGTCACTGATGGAATGGGCTGAGAATGTCATGCAAATACATGCAGATCGGAAATCAGTT 

CTTGAGGTTGMTTTTTAGGAGMGAAGGAACTGGCTTGGGACCCACATTAGAGTTTTAT 

GCTCTGGTG 

Sequence 955 

ACCACGCGTNCGGGCAGAAATACGGCGGCATGTTCTGCAACGTGGAGGGCGCCTTCGAGA 
GCAAGGACGCTGGATTTCGATGCCCTCAGCGTGGGGCAGCGGGGCGCGAAGACTCCTCGG 
AGCGGCCAGGGCAGCGACCGAGGATCGGGGAGTCGGCCCGGGATCGAGGGGGACACCCCG 
CGCAGGGGCCAAGGCCGGNAAGAGAGCAGGGAGCCCGCGCCCGCCTCCCCCGCCCCCGCC 
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GGGGTAGAGATCCGGAGCGCCACCGGCAAAGAGGTGTTGCAGAACCTCGGCCCCAAGGAC 

AAGAGTGACCGTCTNCTTATCAAGGGAGGCAGAATCGTCAATGATGATCAGTCCTTTTAT 

GCTGATATTTACATGGAAGATGGCTTAATAAAACAAATTGGAGACAATCTGATTGTTCCT 

GGAGGAGTGAAGACCATTGAAGCC 

Sequence 956 

CCCGCGTCCGCTA CTGTA CTTTGCAGTTTGATGTTTATTMCATTCTTTGGGCACCTAGC 

TACMTATMCTCAATTTTCTGTGAAAAACTATTMTCATCCTATTTTTTCCTTGTCTTT 

MTATG AGATAAATTTTATACCACTGTTTCTCAMCCATCTGTTGTGAGGGACAGTTTTG 

CTTTTTAATTTCCMTTGTCAGAGACCMTACTTTTGTAAMTATMTTAAAAACAAACA 

TAAAAATAAACTTATTAGAAAMTGAAATAAAAGAGAAATGAMTMGAATMTTTTATT 

ATTAGATTTMCAGATCAMTATTATTTCAATACTCAGATCAAATGTGCAATAAGACAGG 

GTTGCAAAAMTGCACACTTTTTTTATTAAAATCATTTATATM 

TMTATTACAGTTGCAACTTTCTGGTGNTTCTCAACTATGACCAAACAGGAGGGTACAAG 

TAAAGGAGCAATCCCAA 

Sequence 957 

GTCGACCACGCGTCCGCAGCAAAAGTGCCTGGCTGAAGGACACTGTTGACCCAAAACTGG 

TGACCCTCAACCACCGCATTGCTGCCCTCACAGGCCTTGATGTCCGGCCTCCCTATGCAG 

AGTATCTGCAGGTGGTGAACTATGGCATCGGAGGACACTATGAGCCTCACTTTGACCATG 

CTACGTCACCAAGCAGCCCCCTCTACAGAATGAAGTCAGGAAACCGAGTTGCAACATTTA 

TGATCTATCTGAGCTCGGTGGAAGCTGGAGGAGCCACAGCCTTCATCTATGCCAACCTCA 

GCGTGCCTGTGGTTAGGAATGCAGCACTGTTTTGGTGGAACCTGCACAGGAGTGGTGAAG 

GGGACAGTGACACACTTCATGCTGGCTGTCCTGTCCTGGTGGGAGATAAGTTGGGTGGCC 

AACAAGTGGATACATGAGTATGGACAGGAATTCCGCAGACCCTGCAGCTNCAGC 

Sequence 958 

GTCGACCACGCGTCCGCGCCAACTCCGGAGGCGCGGTGCTCGGCCCGGGAGCGCGAGCGG 

GAGGAGCAGAGACCCGCAGCCGGGAGCCCGAGCGCGGGCGATGCAGGCTCCGCGAGCGGC 

ACCTGCGGCTCCTCTAAGCTACGACCGTCGTCTCCGCGGCAGCAGGGCGGGCCCCAGCAG 

CCTCGGCAGCCACAGCCGCTGCAGCCGGGGCAGCCTCCGCTGCTGTCGCCTCCTCTGATG 

C GCTTG CCCTCTCCCGGCCCCGGGACTCCGGGAGAATGTGGGTCCTAGGCATCGCGGCAA 

CTTTTTGCGGATTGTTCTTGCTTCCAGGCTTTGCGCTGCAAATCCAGTGCTACCAGTGTG 

AAGAATTCCAGCTGAACAACGACTGCTCCTCCCCCGAGTTCATTGTGAATTGCACGGTGA 

ACGTTCAAGAC 

Sequence 959 

CCACGCGTCCGAGGGTGGGGAAAGGAGGAGAGGAAGAGCACTCCCTTCCCTGGCCCCTCA 

TCCAGCCTCCGGTGCTGTAAAACGCAGGCGCTGGGCCGCGGGCGGAGCTGAGGACAGGCC 

TTGGCTGGTCCCAGGATGAGCGACGAGTTTGGTTTTAGCTGGGGATTGTGCTGGCATCCT 

GCGAAGCTCCTCCCAGCCGGTCTCTCTGTGCTCGGTTGTCTTGGGGTGGGGCCCATCCGC 

CGAGGTGGGGACCGATAGGAGAAGCCGGTGGGTTGTACCCTTACACTTGTGGAGTCTCCT 

CTTGCCTCTACCTACTCCGCCTTTGTCCTTAAGGTTTrTGCAGGCCAGTGCCAAACACAC 

ACTAACTGTCCTGGCCTCTCCGTGACACAAGTCTCTTCCCAGCCTTCCTC 

Sequence 960 

CCACGCGTCCGCGGACGCGTGGGGCCGGGACAACTGGTCTTATCACGGAGGCTGGGGCCA 

NGGCAGCCCTTCGGTTCGGGTGGGCCCATGGACCCCAGTCCAACGCCGAGGGAATAGGAC 

CATCCAAAAGCGGAACCTTCGCCTCAGAAAAAGGGTGCGGGACCCCTCCTCACCGTGCGG 

TCACGCGTGGACCCTGCCAGCAGCCAGGCCATGGAGCTCTCTGATGTCACCCTCATTGAG 

GGTGTGGGTAATGAGGTGATGGTGGTGGCAGGTGTNGGTTGGTNGCTGATTCTAGCCTTG 

GTCCTAGCTTGGCTCTCTACCTACTTAGCAGACAGCGGTAGCAACCAGCTCCTGGGCGCT 

NTTGTGTCAAGCAGGCGACACATCCGTCCTNCACCTGGGGCATGTGGACCACCTGNTGGG 

CAGGCCAAGGCNNCCCCGAAGCCAACTGA 

Sequence 961 

NCCCCGCGTCCGGGAGGCTCCATGTTGTCCCCTCAGCGAGTGGCAGCAGCTGCCTCAAGA 
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GGAGCAGATGATGCCATGGAGAGCAGCMGCCTGGTCCAGTGCAGGTTGTTTTGGTTCAG 

AMGATCAACATTCCTTTGAGCTAGATGAGAAAGCCTTGGCCAGCATCCTCTTGCAGGAC 

CACATCCGAGATCTTGATGTGGTGGTGGTTTCAGTGGCTGGTGCCTTCCGAAAGGGCAAG 

TCCTTCATTCTGGATTTTATGCTACGATACTTATATTCTCAGAAGGAAAGTGGCCATTCA 

AATTGGTTGGGTGACCCAGMGAACCGTTMCAGGATTTTCGTGGAGAGGGGGATCTGAT 

CCAGAAACCACTGGGATTCAAATCTGGAGTGAAGTTTTCACTGTGGAGAAGCCAGGTGGG 

AAGAAGGTTTGCAGTTGTTTCTGATGGATACCCAGGGGGCAT 

Sequence 962 

GCCCCGCGTCCGCTTCTCCGAATATAGCAACGTCCAGCAGTGTCCACACTGTGGGAACCT 

GGACTACCACTTCGTGAAGCCATTTTCCTCCTTCAAAGTTCTCGAAGCTTATTGATGAAA 

GCTTTGCTTTAGTMTAGCTATTTTATTGATATTATTACTTTATTACATATCTT^ 

GGAMCATTCTGTGACATTMTTTCCTTTCTAATTTAAAGGAGAGTTACTTTGTTGTATG 

TGTGCCACTAAAATAGGGGCTGCCCTTGCCCTGTCTTGATTCCCGAGTGTTAATCTGTGG 

TTTTGACCAGAGCCCAGATGGGTMTCCTGTGCATTTGGGTTGGGGGTTCACTCTTACCA 

AGMTCTTTGATGCAGCTTTMGATGGTGGGGAGATGGGGGTTGAATTTAGGGAAAGAAT 

NTTTGTGGGTTATAAACTAAGAGCTTGATAGGAGTTGGAAGGAAACTCTTACTAAAATGT 

TAACtTTCTAAAAACCTTCTTTTANATCTTNCTTGGGCCTTTGGAAAA 

Sequence 963 

GTGTTTTGGGGATGCCTTTCCTTACCAGATTCTTCTAAAGCCCAGCTGCACCCACCCTTA 

AGTGGGAGATAAGGCTTCTGCCCGCGGGCTCTGCGTTCGTCCACCCGGCCCCACGTTTGC 

TGTGGACTAAACAGGAGCCACTGGACTAGAGTACACTTGACTCTCGGCTCTGCGGACCAA 

AAATTCCAGGACTAAGGAATAGCAAGGTTAGGCTGAAACAGTCCACACAGGGCTTGCGGT 

MACGTCTTTTCAGGAGCCACTCGCCCAGTGCAGTAAGTCGTGTACTTAGTTGACTCGAG 

CGCTCCAGGGAGACGCCCGACCCTACTCTGCGCCGCCCCGGGGCACCAGCTCTGCTTCCT 

CCAGGTCCACTGAGGCAGGCACGCCCAGCTCTGGGACAGGTCAGTAAACAAGCCACGAAC 

CGCGCCAGGGATCAGAGAACCCANAGTCCCCGCCAGCTGCCGGCACAAGCCAATCGCAGC 

GCANCCAGGCGGC 

Sequence 964 

GTCTAAGGGATCCAGGTCCTGTGTCTCAGGGACCTCTGATGGGATTGAATCCAAGAGGAA 

TGCAGGGGCCTCCAGGCCCNCGGGAGAACCAGGGTCCTGCTCCCCAAGGGATGATTATGG 

GCCACCCGCCTCAAGAGATGAGAGGACCTCACCCTCCAGGTGGACTACTGGGACACGGCC 

CTCAGGAAATGAGAGGTCCTCAGGAGATCCGAGGCATGCAGGGGCCTCCACCCCAAGGAT 

CAATGCTGGGACCTCCCCAGGAATTGCGAGGGCCTCCAGGCTCACAAAGTCAGCAGGGGC 

CGCCCCAGGGCTCTTTAGGACCTCCACCCCAGGGTGGCATGCAAGGACCCCCCGGACCTC 

AGGGACAGCAGAACCCAGCAAGAGGGCCACATCCATCTCAAGGGCCAATACCATTCCAGC 

AACAGAAAACGCCTCTGCTAGGTGATGGGCCCCGGGCCCCCTTCAACCAGGAAGGACAGA 

GCACAGGCCCCCCACC 

Sequence 965 

TGCGCATGCGCGGAGCGCGGCGCGCGCGGCGGTTGGGCCGTTGGCTGTTCGGCCCTGGGA 

TCCGCCGCCACTCCGCGATCAGACCGCTCTGTGCCGCGAGCCGCCGTGAGCACTCGGATT 

CAAGCCGGCGCCAACGAGTCCGGGGGCATCGCCCGCAGCGGCCAAGCTCATGGCCGGCTG 

AGCGGGACGCCGCCTNCGCCTCAGCCACCGCCGCCGCCGCCGNCTTCTTCTCCTCAGCCG 

GCGGCGGCCCGGGCCCAGCAACCATGGCTGAAGACTACTGGGACGGGCGCCTGCGGCGAA 

CAGGAGGAGAAAGGGAGGTCGCGCGGCCTCATTCCGGGCCGCCGCCCCAGGCGCCGCCGC 

GCCGGCCCCGCGGCTCTGAGGTTGCTCGCGCGCCCCC 

Sequence 966 

TGGAAMTNTTTTGGMAAMTTACCCTTGGGACCTTGNTTTTNAANCCCNAGGTTCCCN 
GTTNNGGCAMTAMANAATGNNNGACCCGGGATTTNGGGNTTNNAMCCGGGGGTTTTT ' 
AATTTCCCCNNNNCNNGGGNCC I I I I I I I I I NCCNCCCCCNCAAGGGGGNTTTGGGAAAN 
NAAANCCCCCCCC I I I I I I I I I NGGGGGNGAAANTTCCCCGGGTNNNNGCCN I N I MH 
TTTTTAAA 
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Sequence 967 

GTCCGCGAGGCTCCGCACCAGCCGCGCTTCTGTCCGCCTGCAGGGCATTCCAGAAAGATG 

AGGATATTTGCTGTCTTTATATTCATGACCTACTGGCATTTGCTGMCGCATTTACTGTC 

ACGGTTCCCAAGGACCTATATGTGGTAGAGTATGGTAGCAATATGACAATTGAATGCAAA 

TTCCCAGTAGAAAAACAATTAGACCTGGCTGCACTAATTGTCTATTGGGAAATGGAGGAT 

MGMCATTATTCAATTTGTGCATGGAGAGGAAGACCTGAAGGTTCAGCATAGTAGCTAC 

AGACAGAGGGCCCGGCTGTTGAAGGACCAGCTCTCCCTGGGAAATGCTGCACTTCAGATC 

ACAGATGTGAAATTGCAGGATGCAGGGGGTGTACCGCTGCATGATCAGCTATGGGTGGTG 

CCGACTACAAGCGAATTACTGTGAAAGTCAATGCCCCATACAACAAAATCAACCAAAGA 

Sequence 968 

CGTCCGGGMCTCAGCMCGGTTTCTTCATCCAGGACCCGATTGCTCTGGTGGAGAGGGG 

GGGCTGCTCCTTCCTCTCCAAGACTCGGGTGGTCCAGGAGCACGGCGGGCGGGCGGTGAT 

CATCTCTGACAACGCAGTTGACAATGACAGCTTCTACGTGGAGATGATCCAGGACAGTAC 

CCAGCGCACAGCTGACATCCCCGCCCTCTTCCTGCTCGGCCGAGACGGCTACATGATCCG 

CCGCTCTCTGGAACAGCATGGGCTGCCATGGGCCATCATTTCCATCCCAGTCAATGTCAC 

CAGCATCCCCACCTTTGAGCTGCTGCAACCGCCCTGGACCTTCTGGTAGAAGAGTTTGTC 

CCACATTCCAGCCATAAGTGACTCTGAGCTGGGAAGGGGAAACCCAGGAATTTTGCTACT 

TGGAATTTGGAGATAGCATCTGGGGACAAGTGGAGCCAGGTAGAGGAAAAGGGTTTGGGG 

CCGTTGCTAGGCTGAAAGGGAAGCCACACCACTGGCCTTCCTTCCCCAGGG 

Sequence 969 

GATTGGAGGAGTCACATCCCCTCTTCAGCCGCAGCACCCCTCCCTCCCATCCTCTAGCTC 

TTCCCGCGGTGGTCGCCTCCCTCCGACCCTGCTCTCCCCTCCTGGGCCCCGCGCAAAGCC 

CCCTCTGTTCCAGCTCCCGGGCCTCGGCTGCCTCCCCGCCCTCCCATCCCTTCCTCTTCC 

CAGGGCCTGGAGCGCTCCCTTACATTCCTGAGATGCCCTTCCTCGGGGCTGTCCCCCTTT 

GCCTCCCCAGCATCCCATTTCTAGGCCTTTTTCAAGACCCTTCCAGAGCGGCCCCTTTCC 

AGCTCCCTTTCTCGTTTCCATTTCCAACTTTGCCTCTTTTGCCTCTTT 

TCCMGCTCCCCTCCCTCTTTCCCTTCGTCTTACCCTGCTTTGATCTACGCAGCCCCAAA 

CTCAAGCTCCCCGCTTTCAAGGTGGTGCGAGGTTGTTGGGGGGTGCGGAAGGGCCTGCCA 

AGTCCATTTTTCGAGGGG 

Sequence 970 

GTCCGAGATCGCGAGCCGCCGCCC I I I I I I I 1 1 1 I I I I I ATAAG ATTATTAGTATAAAAN 

GGGGAGACGAGGTTAGGGCCCTGGGAAAGGTGGGAGATCAGCCAGAGACAGGTTTCCCAG 

AACAGAATGTCTGGCCTTTGTGGTGAGGAGGGACTGTGGTATGAGCCGCAGAAGCGGGCC 

AGGGGTAAACCCTCCTGTGCGTCCTTCCTTCAGCCTGGTCCTGAGGGTGACCCTTTGATC 

CTGGGTTCTCCAGGTAGGGCTGTGAGCTGTGAGTTGGATCCTTTTGGTGAAATGGTCTCT 

CTCATCTGGCCTGTCACTCAATGTGGAATAGAGTGAGTGAGTTCTATGGGTTCTAAGTCC 

TGCTCTGGAACCATAAGTMGTTATCCTCTCTGGGCTTCAGTTTTTCATGGAAAGTTGCG 

TTMGMTCTAGTTTAAGGCCAGGCATGGTGGCTCACCGCCTTGTAATCCCAGCACTTTG 

GGGAGGCCAAGGAAGGTGGATCATGANGTCAGGAGATCGAGACCATCCTNGCTAACATGA 

TNAAACCCGTGTCTTTACTTAAAAAATAC 

Sequence 971 

CCTGCCAGTGGTGAGCACCTTCGGCCTCCAGGTGCCTTTCTTCTTCTTCGCGGCCATNTG 

CTTGGTGAGCCTGGTGTTCACAGGCTGCTGTGTGCCCGAAACCAAAGGGACGTCCNTGGA 

GCMATCCGAGTCCTTTTTCCGCACGGGGAGAAGGTCCTTCTTGCGCTAGGTCAAGGTCC 

CCGCCTGGAGGGGGCCAAACCCCCA 

Sequence 972 

GCGTCCGCGGACGCGTGGGCGGACGCGTGGGTCAGCCTCCACCTGGAAGAGAGCTANGGG 

CCGGGCAGGCCGGGCAGCTGCCACCCCGCCCGGCCCGACGCCCCGCATGCCCCGAAGTCC 

CTGGCGCCCACCCGGCCGCGGCCCTGCGTGTGACCCGCGGGTCGATACCTGGCAGCCCCA 

GTGCTGGGGCGCCGCGGCCCTGCTCGCCCAGGAGGAGAGCGAGGGCCCCACACTGAGTCT 

CTTGAAGCCTCACGTTTCCCTGGGGGGGTGCTGCATCGTCGGGTGTCCCTCACCCCACCT 
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GGGGAACCTCTGTCTTCAGGTCACCCCTTTTCAGGGGCCTGG 
Sequence 973 

CGTCCGGGACCCTGCTCATGGAGAACATCAGCAGCTGGCTGCTCCTTCGCTGACGCCCTG 

GGCTACGTGMCCTGCCGCTCACCTTTTTCTGCCGGGCAGAGCTGGATAGTGAGCCCGAG 

CGGGTGGCGTCCGTCCTGGAAAAGCTGAAGGAGGACTGNAACAACACTGAGAACAAAGAA 

CGGAAGTCCTTNCAGAAGGAGCTTGTGATGGCCCTACTGAAGATGGACTGCCAGGGCCTG 

GTGGTCAGACTCATCCAGGACTTTGTGCTCCTGACCACGGCTGTAGAGGTGGCCCAGCGC 

TGGCGGGAGCTGGCTGAGAAGCTGGCCAAGGTCTNCAAGCAGCAGATGGACGCCTACNAG 

TCTCCCCACCGGTGACAGGAACGGGGTTGTGGACAGCGAGGCCATGTGGAAGCCTGCGTA 

TGACTTCTTAC 

Sequence 974 

TCACCACGCGTCCGCGAAGCGTGCACCGCTGCGCCCCCGCCGGTGAGCGCGGGGAGCGCC 

GCAAGCCCAACGCCGGGGGCAGCCCCGCTCCGGTGCGCCGCCGGNCGGAGGCCTCGCCGG 

TGCAGAAAAAGGAGAAGAAGGACAAGGAGCCGGGAAAACGAGAAGGAGAAGAGTGCCCTA 

GCCCGGGAGCGCAGCCTCAAGAAGCGCCAGTCGCTGCCCGCCTCCCCACGTGCCCGCCTC 

TCTGCCAGCACCGCCTCTGAGCTCAGCCCCAAATCCAAGGCCAGGCCATCCTCTCCCTCC 

ACATCCTGGCACAGGCCTGCCTCCCCCTGCCCCAGCCCAGGGCCAGGCCACACTCTGTCT 

CCAAAGCCACCGTNCCCCCGAGGCACCACTGCATCCCCCAAGGGGCCGGGTTCGGAGGAA 

GGAGGAGGCAAAGGAGAGCCCCAGCGCCGCANGGCCCGAGGACAA 

Sequence 975 

TCCGCAGAAACGGACTTTCTCATCATGCTTTCCTATGGTGGGTATGAGGGGCCAGCTGAT 

ACCAACCAACTGGCCTGTATCTATCTATCTGGATTTGACTTGAATTTTTAAAATGTGTAT 

CGTTTAAAAAAAMMTGTTTGCAAATTTTGCACATAGGATCTTGCACTGTTCATTTTCA 

GTGGGGTGAGTCTTCACACTAAAAACACAAGCAGAGCTCCTGGGAAAAGAGACTGGAAGT 

GGTTCAGGATAAAGAGATCCATGGTGGGCAGGGCTCTTAGGTCACAGAGCTCTAGAAGCA 

GCTGGACTTGAACCCACAATGGCTTGTGTAMTTCGTAAATTTCATGGTTTCTAGGAAAA 

GCTGCATTG 

Sequence 976 

GCGT CCGG MGAACTGTGGMCTGCGGTTGGCAGACMTTTTGTTACAAATATTAAAAGG 

AMGATTTTGCCAATATGACCAGCTTGGTGGACCTGACTCTATCCAGGAATACAATAAGT 

TTTATTACACCTCATGCTTTCGCTGACCTACGAMTTTGAGGGCTTTGCATTTGAATAGC 

AACAGATTGACTAAAATTACAAATGATATGTTCAGTGGTCTTTCCAATCTTCATCATTTG 

ATACTGAACMCMTCAGCTGACTTTMTTTCCTCTACAGCGTTTGATGATGTCTTTGCC 

CTTGAGGAGCTGGATCTGTCCTATAATAATCTAGAAACCATTCCTTGGGATGCTGTTGAG 

AAGATGGTTAGCTTGCATACCCTTAGTTTGGATCACAATATGATTGATAACATTCCTAAG 

GGGACCTTCTCCCATTTGCACAAGATGACTCGGTTAGATGTGACATCAAATAAATTGCAG 

AAGCTACCACCTGACCCTCTCTTTCAGCGAGCTCAGGTACTAGCAACCTCAGGAATCATA 

AGCCCATCTACTTTTGCATTAAGTTTT 

Sequence 977 

NCTCCMCMTTATGGCTCATCCTTCCTTTTACTCTGTCTCACCTCCTTTAGGTGAGTAC 

TTCCTTAMTAAGTGCTAAACATACATANACGGAACTNGAMGCTTTGGTTAGCCTTGCC 

TTAGGTAATCAGCCTAGTTTACACTGTTTCCAGGGAGTAGTTGAATTACTATAAACCATT 

AGCCACTTGTCTCTGCACCATTTATCACACCAGGACAGGGTCTCTCAACCTGGGCGCTAC 

TGTCATTTGGGGCCAGGTGATTCTTCCTTGCAGGGGCTGTCCTGTACCTTGTAGGACAGC 

AGCCCTGTCCTAGAAGGTATGTTTAGCAGCATTCCTGGCCTCTAGCTACCCGATGCCAGA 

GCATGCTCCCCCCGCAGTCATGACAATCAAAAAATGTCTCCAGACATTGTCAAATGCCTC 

CTGGGGGGCAGTATTTCTCMGCACTTTTAAGCAMGGTMGTATTCATACAAGAAATTT 

AGGGGGAAAAAACATTGGTTAAATAAMGCTATGTGTTCCTATTCAACAATATTTTT 

Sequence 978 

CCCCGCGTCCGGGTCCCCGCGACTCCCGGACTGGAGAAAACGGCTCTTGCGATGGGGCGA 
AGTCCGAGCTGCGGCGGGCGTTGGTCCGTGCAGGGAAGTGGGAATCGTTAGGTTCGTTCT 
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GGACCCGCCGCCCCATGGCCCAGGCGTCTCGCTCAGGTAGCCTGCCTCCACTCGTTATCG 

TGCCCCCGCTGAGGGCGCAACCCGGGGGCACTGGGGAGGAGCAGTGGGAGAGAAGTCGAA 

CGGNCGGNCTTCCGCTGGGCAGAGCTCAGCAGTACTTGGCAGCATGGGACCCAGCTTCCT 

TCCTGCTCCTGATCCAAAAGGACTTACCTNCTCTGTTGCATGAGGCAGAAGCTTTGTATA 

GCCTGGCCTCAGAGGAAAGCTTAGCTCTGGAAGTGGAGCAGCAGCTGGGCCTGGAGATCC 

AGAANCTGACTGCACAGATCCAGC 

Sequence 979 

AGGCTGNTACGAAGCGAGCTTGGGAGGAGCAGCTGGCCTGCGGGGCAGAGGAGCATCCCG 

TCTACCANGTCCCAAGCGGTGTGGCCCGCGGGTCATGGNCAAAGGAGAAGGCNCCGANAG 

CGGCTCCNCGGCGGGGCTGNTACCCACCAGCATCCTCCAAAGCACTGAACGCCCGGCCCA 

GGTGAAGAMGAACCGAAAAAGMGAMCAACAGTTGTCTGTTTGCAACAAGCTTTGCTA 

TGCACTTGGGGGAGCCCCCTACCAGGTGACGGGCTGTGCCCTGGGTTTCTTCCTTCANAT 

CTACCTATTGGATGTGGCTCAGGTGGGCCCTTTCTCTGCCTTCATCATCCTGNTTGTGGG 

CCGANCCTGGGATGCCATCACAGACCCCCTGGTGGGCCTCTGCATCAGCAAATNCCC 

Sequence 980 

ACCCCGCGTCCGGAAAAGAAGAGTGGCCNGTTCCAGGGGTAGCTCCAAAAGAGACTGCAG 

AGCTGTCCGAGACCCTGACAAGGGAGGCCCAAGGCAACAGTTCCGCAGGAGTGGAGGCAG 

CAGAGCAGAGGCCTGTGGAAGATGGCGAGAGGGGCATGAAGCCAACAGAAGGGTGGAAAT 

GGACCCTGAACTCCGGGAAGGCTCGAGAATGGACACCCAGGGACATAGAGGCTCAAACTC 

AGAAACCAGAACCTCCAGAGTCAGCAGAGAAGCTTCTGGAATCTCCCGGTGTGGAGGCTG 

GAGAAGGGGAGGCTGAGAAGGAGGAGGCGGGGGCTCAGGGCAGGCCTCTGAGAGCCCTGC 

AGAACTGCTGCTCTGTGCCCTCCCCCCTCCCACCAGAGGACGCTGGGACTGGAGGCCTGA 

GACAGCAGGAAGAGGAAGCAGTGGAGCTTTCAAGCCCCCACCACCAGCCCCTCTGTCTCC 

CCCACCCCCAGCCCCAACTGCCCCCCAACCTTCTGG 

Sequence 981 

GCCCCGCGTCCGAAMGAATGGGTGAACCAATCGGCCTTTGTGMTTTATTCAGTGCCTT 
CTCTGTACCAAGCACTGGGTAAGGCACTTTTGTGGAGCATTAGACAGTAACCCTCAAGGA 
GCTAGAGAACCGGATGGGAGACATGAGCGGTAATTAACTCACTTGTTCCCCAGAGTTTCT 
ATTTGTTTTGATTITCTTTTTCTGTGACT^ 

TTCACTATGGCCCAACTAATATAAACACCTGGAAATTACAAGGAAAAAAAATTCTTCCTC 

TAATMCTTTCCAAATTTGTGGAATATTTATTTGTAATAGCAGTTATCAGTTATGCTTAT 

ATAGCATTAAAAATTCTCCTCCTTTGACTACACACACAACCACAGTGTGGTTCTAATCAT 

GGAGATATCAGTMTTTTTAGTMCTGMTTTTGAGGACATTTCTNTGTTTAGCATGTAT 

GCAAACTG ATATGTAATCTG AGGTTCCAAAGTCA A I I ) I I I I CI U 1 1 I 

Sequence 982 

TNGGGAGTCGACCCCGCGTCCGGTTTTTGTGAGGCAGTGAGACCTAAGGTAACCTTTATC 

AAAAGGATGGAGTTGGGAAAAGGAAAACTACTCAGGACTGGACTGAATGCGTTGCATCAA 

GCAGTGCATCCGATCCATGGCCTTGCCTGGACCGATGGGAATCAAGTTGTCCTAACTGAT 

TTGCGGCTTCACAGTGGAGAGGTCMGTTTGGGGACTCCAAAGTCATTGGACAGTTTGAA 

TGTGTCTGTGGGTTGTCCTGGGCCCCACCTGTTGCAGATGATACACCTGTTCTACTCGCT 

GTCCAGCATGAGAAGCATGTCACTGTGTGGCAGCTGTGTCCCAGCCCTATGGAGTCAAGC 

AAATGGCTTGACGTCTCAGACTTGTGAGATTAGGAGGGATCACTACCTATCCTTCCCCAG 

GGCTGTGTGTGGCACCCAAA 

Sequence 983 

GTGTCGACCCCGCGTCCGCGCCCTGCCTGCAGTTGAGATTCAGATGCCTTCTGACAGAGT 

TCAGCCTCTTGGAGAGTCTTGGGGATTGTTGGCACCTAAACAGAATCAGNGACCCGGGTG 

CTTTGTGGCCAGCAGCACAGAATCAAACCCGCATCCCAGCATTGGGCCACCCATCTGAGG 

GAGGCCAAAATCATCACAGATGCTGCTGTGCTGCAGACAGATACATGCTAGTCCAGAGAG 

CCGCCCCTGAGATGGCTGTGAGAACCATGTGTCTAAGGCGTAAGATAAGGATGGAAGGCT 

GTCCAAGTTATTTGGAAGGCCTCGGCAGCTTGGGATTAGCTTGGGAGCGCAGCGCTGCAA 

AGTGGAAAATATGAAAAGACCACACAGGCCCAAGCAGTCCAGAAACTGGGCAAAAATATT 
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■ CTGCAGTGGGGATTTA I 1 1 1 1 1 
Sequence 984 

CACG CGTCCG GAGTACGGAGTTGTTCCTTTACTGGCTGAAAGATATATTCGAATTGTAAA 

GATGCTT TTTCTCATGCATTGAMTTATACATTATTTGTAGGGAATTGCATGCTTTTTTT 

1 1 I 1 1 1 I CTCCCGAGACAGGGTCTTGCTCTGGCGCCCAGGCTGGAGTACAGNGGCATGAT 

CTTGGCTCACTTCAGCCTTGACTTGGGCTCAAGTGATCCTCCTACCTGAGCCTTCTGAGT 

MCTGGGACTACAGGTGTGCACTCCTCGCCTGGCTAATTTTTTATTTTTTGTACAGGCAG 

GATCTTGCCACCTTGCCCAGGCTGGTCTTGAACTCCTGAGCTCATGCCATCTGCCTGCCT 

TAGTCTCCCAAAATGCTGGGATTACAGGAGTGAGCCACCATGCCCGGCTGGCAGTTGCAT 

GGMGAGMCACCTNTTTATGGCTTACCCTCTAGAATTTCTAATTTATGNGNNCTGTTGA 

AAI 1 1 1 IGGI I I I 1 1 IACCT 

Sequence 985 

GTCGACCACGCGTCCGCTCGGCTTCCTGCTGATGGTCAGGGTTTTGGCAACTCCCCGGTG 

TGAGAGGGGTAGGGAGTGCTCCCGGCGGCGACGGGGCCGAG7TCACCAGCCGCCGGGGCA 

GTAGTCGAAGGCCCGGCGCGGCATGTCCTGGGTGCCGCGGTGCGGGCAGTGAACGCGCGC 

CGGGCGGGATGGGCCGGCGCCGGGCGCCAGAGCTGTACCGGGCTCCGTTCCCGTTGTACG 

CGCTTCAGGTCGACCCCAGCACTGGGCTGCTCATCGCTGCGGGCGGAGGAGGCGCCGCCA 

AGACAGGCATAAAGAATGGCGTGCACTTTCTGCAGCTAGAGCTGATTAATGGGCGCTTGA 

GTGCCTCCTTGCTGCACTCCCATGACACAGAGACACGGGCCACCATGAACTTGGCACTGG 

CTGGTGACATCCTTGCTGCAGGGGCAGGATGCCCACTGTCAGCTTTCTGCGCTTTCAGGC 
ACATTAACAGCA 

Sequence 986 

CGCCACGCGTCCGCGTACGCGTGGGCGCGACCGAGCGTGCGGACTGGCCTCCCAAGCGTG 

GGGCGACAAGCTGCCGGAGCTGCAATGGGCCGCGGCTGGGGATTCTTGTTTGGCCTCCTG 

GGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCA 

CAGAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGATGATTGTACCTGTGATGTTGAAACC 

ATTGATAGATTTAATAACTACAGGCTTTTCCCAAGACTACAAAAACTTCTTGAAAGTGAC 

TACTTTAGGTATTACAAGGTAAACCTGAAGAGGCCCGTGTCCTTTCTGGAATGACATCAG 

CCAGTGTGGAAGAAGGGACT 

Sequence 987 

GGTCGCCCCGCGTCCGTAGCAGTTACATCTACGAGGCTATTATGGATTGGAGGATGAGAA 

GGGAACT GCATGT ACCTCAACAAGGCGTCGGTCAACACCGCGAAGTTTGGCAGGCTTGAC 

AAGTGGAGTTTTTGAATCTATAATGGTTCAAGTTTTGAGACAGGAAGAACAGCTGAGAGC 

AAAAGAAGAAAAAAGGCTTCGGGAGCAGGAAAGAAAAGAAGCAGAAGAAGCTAGTCAAAA 

GGAAATAGAAGAATGGGAAAGAAAACTTCTAGCTCAAGCAGCTCCAACTTGTATGGAGAC 

CATGTGGGAAATT CCAG CTATTGGGCATTTCCTTTGTTTAGCTCAGCAAATTCTAAATTT 

GCCAGAAATAGTCTTTTACCGAACTGGAACCGTTGTCTTCTGATGCCTCAGTGTAATGCT 

TTTCTATCGAAAATAATGACTTCTTATTTAAGTCCTCCCCATCGCAGA 

Sequence 988 

NCCCCGCGTCCGAGTCCCCTGTCTGTGGCACCAGACACTCCCGACTGTGCGCTGACTCTC 

CCCGCCCAGCCAGCAGCCTTTTCCAGAGAGGCTGTGGTCCATAGCCTCTGTTCGTTTTCA 

CTGCAGGACCAGGCACGAMGTTMMCAMATGMGATTTTTTCTGAATCTCATAAAAC 

AGTGTTTGTTGTGGATCACTGCCCTTATATGGCAGAATCTTGCAGGCAGCATGTCGAGTT 

TGATATGCTGGTGAAGAATAGAACCCAAGGAATCATTCCTTTGGCCCCCATATCTAAATC 

ATTGTGGACTTGCTCAGTAGAATCTTCCATGGAATATTGTAGAATAATGTATGATATATT 

TCCTTTCAAAAAGCTGGTGAATTTTATTGTGAGTGACTCTGGAGCACATGTTTTAAATTC 

TTGGACTCAAGAAGACCAAAATTTACAGGAGCTAATGGCAGCATTAGCCCGCTGTTGGGC 

CTCCTAATCCTCGGGC 

Sequence 989 

GTCGCCC ACGCG TCCGTTCGTTGTCTGATGGACCTGCTTGCAAAAGGCCAGCTCTGTTGC 
ATTCCCAATTTTTGACACCACCTCAAACACCAACGCCCGGGGAGAGCATGGAAGATGTTC 
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. ATCTCAATGAACCCAAACAGGAGAGCAGTGCTGATCTGCTTCAGAACATTATCAACATTA 
AGAATGAATGCAGCCCCGTTTCCCTGAACACAGTTCAAGTTAGCTGGCTGAACCCCGTGG 
TGGTCCCTCAGAGCTCCCCCGCAGAGCAGTGTCAGGACTTCCATGGAGGGCAGGTCTTTT 
CTCCACCTCAGAAATGCCAACCATTCCAAGTCAGGGGCTCCCAACAAATGATAGACCAGG 
CTTCCCTGTACCAGTATTCTCCACAGAACCAGCATGTANAGCAGCAGCCACACTACACCC 
ACAAACCAACTCTGGAATACAGTCCTTTTCCCATACCTCCCCAGTCCCCCGCTT 
Sequence 990 

GTCCGGCTGGGACCTCCTCCTGTTGGGGTCCCCATGAACCCTTCCCAGTTCAACCTTTCA 

GGACGGAACCCCCAGAAACAGGCCCGGACCTCCTNCTCTACCACCCCCAATCGAAAGACA 

ATGCCTGTGGAAGACAAGTCAGACCCCCCAGAGGGGTCTGAGGAAGCCGCAGAGCCCCGG 

ATGGACACACCAGAAGACCAAGATTTACCGCCCTGCCCAGAGGACATCGCCAAGGAAAAA 

CGCACTCCAGCACCTGAGCCTGAGCCTTGTGAGGCGTCCGAGCTGCCAGCAAAGAGATTG 

AGGAGCTCAGAAGAGCCCACAGAGAAGGAACCTCCAGGGCAGTTACAGGTGAAGGCCCAG 

CCGCAGGCC 

Sequence 991 

NCGCGTCCG CTTAM TGACTCGTTATCATTTTGCAATGAATGGAAAATCATTCTCAGTGA 

TACTGGAGCATTTTCMGACCTTGTTCCTMGTTGATGTTGCATGGCACCGTGTTTGCCC 

GTATGGCACCTGATCAGMGACACAGTTGATAGAAGCATTGCAAMTGTTGATTATTTTG 

TTGGGATGTGTGGTGATGGCGCAAATGATTGTGGTGCTTTGAAGAGGGCACACGGAGGCA 

TTTCCTTATCGGAGCTCGAAGCTTCAGTGGCATCTCCCTTTACCTCTAAGACTCCTAGTA 

TTTCCTGTGTGCCAAACCTTATCAGGGMGGCCGTGCTGCTTTAATAACTTCCTTCTGGT 

GTGTTTAAATTCATGGCATTGTACAAGCATTATCCAGTCTTCCAAGTGTTACTCTGCTGT 

ATTCTATCTTTMGTAACCTAGGAGACTTTCCAGTTTCTCTTAATTTGATCTGGCAATCT 

TTTGGGTAAGTGGGTATTTANAAT 

Sequence 992 

TTITCACTGCAGGACCAGGCACGAMGTTAAAACAAAATTGAAGATTTTTTCTGAATCTC 

ATAAAACAGTGTTTGTTGTGGATCACTGCCCTTATATGGCAGAATCTTGCAGGCAGCATG 

TCGAGTTTGATATGCTGGTGAAGAATAGAACCCAAGGAATCATTCCTTTGGCCCCCATAT 

CTAAATCATTGTGGACTTGCTCAGTAGAATCTTCCATGGAATATTGTAGAATAATGTATG 

ATATATTTCCTTTCMAMGCTGGTGMTTTTATTGTGAGTGACTCTGGAGCACATGTTT 

TAAATTCTTGGACTCAAGAAGACCAAAATTTACAGGGAGCTAATGGCAGCATTAGCCGCT 

GTTGGGCCTCCTAATCC 

Sequence 993 

CGCGTCCGGGCAGGAGCACCACTCAAGGAGCTACACCCCTTGCATCGGCTTGACCGCCTT 

ACCTCAGGGGTGCTTATGTTTGCCAAGACAGCTGCAGTCTNTGAGAGAATTCACGAGCAG 

GTTCGGGACCGGCAGCTGGAGAAGGAGTACGTGTGCCGGGTGGAAGGGGAGTTCCCCACT 

GAGGAAGTGACCTGTAAAGAACCCATCTTAGTGGTGTCTTACAAAGTAGGGGTGTGCCGT 

GTAGATCCCCGGGGCAAGCCCTGTGAGACAGTGTTCCAGAGGCTAAGCTACAATGGCCAG 

TCCAGTGTGGTACGGTGCCGGCCACTCACAGGCCGCACACACCAGATTCGAGTCCACCTT 

CAGTTCTTGGGCCATCCCATTCTCAACGACCCCATCTACAACTCAGTTGCCTTGGGGTCC 

TTCTCGAGGCCGGGGCGGCTACATTCCCAAGACAAACGAGGAGTTGCTACGGGACCTGG 

Sequence 994 

ACGCGTCCGCGACCGCTGGGCATGCGGGTGTTGGCGCGGTATCCCCGCCCTGCCCAGCAT 

CTGCCCCACGTTTCTTCAGGCTAAACTACCGGGATCCCGGGCTTCTTCCTAAAGTAAAAC 

TCGCTCCGGAAAGGCCAACAGTCCAGCGGCCAGACGGGCACCTGGGAACGCGGGCCTAAC 

GCGTACTGGAGACGGAGTGGCGCCCGGCACTGCGCGCCTCCTCCCCGCCGGGAGACTGCG 

TGCTAAGCTCAGCAAAGCCCCGCTGTGGAGACGGAGCCATGTCGCCCATTACCTAATGAA 

ACTGAGAAGGGAGACTCAGTCTCTCTTCTAGCCCCGAGCGCAAGCTCTGCTGGACTTGGC 

ATCGTCCGCCCTCCACGATCCCACACTCCGGGTTTTCCCCATTCCCAGCTCGGCTGCAAC 

CGAGAGACAGACGGAAGAAAC 

Sequence 995 
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TCCTCCTGGCCCTGTTAATGTCGGGGCCNGGCCGGGGGAGGATGGCGCCCTAGAACCCGG 

CCTTGCTGGGGTAGGGGCGGGAGGGGACGGGGTGGGGACCGGCCATGTCGGAGGTGACCC 

GGAGTCTGCTGCAGCGCTGGGGCGCCAGTTNTAGGAGAGGCGCCNNACTTCGACTCTTGG 

GGCCAGCTGGTGGAGGCGATAGACGAGTATCAGATATTAGCAAGACATCTACAAAAGGAG 

GCCCAAGCTCAACACAATAATTCTGAATTCACAGAAGAACAAAAGAAAACCATAGGCAAA 

ATTGCAACATGCTTGGAAT 

Sequence 996 

CGCGTCCGGCCTGAGCCGGCGGGTCCCCTGTGTCCGCCGCGGCTGTCGTCCCCCGCTCCC 

GCCACTTCCGGGGTCGCAGTCCCGGGCATGGAGCCGCGACCGTGAGGCGCCGCTGGACCC 

GGGACGACCTGCCCAGTCCGGCCGCCGCCCCACGTCCCGGTCTGTGTCCCACGCCTGCAG 

CTGGAATGGAGGCTCTCTGGACCCTTTAGAAGGCACCCCTGCCCTCCTGAGGTCAGCTGA 

GCGGTTAATGCGGAAGGTTAAGAAACTGCGCCTGGACAAGGAGAACACCGGAAGTTGGAG 

AAGCTTCTCGCTGAATTCCGAGGGGGCTGAGAGGATGGCCACCACCGGGACCCCAACGGC 

CGACCGAGGCGACGCAGCCGCCACAGATGACCCGGCCGCCCGCTTTCAGGTGCAGAAGCA 

CTCGTGGGACGGGCTCCGGAGCATCATCCACGGCAGCCGCAAGTACTCGGGCCTTATTGT 

CAACAAGGCGCCCCACGACTTTCAAG 

Sequence 997 

GTCCGGCCAGGAGCCAGGCCGAGCGGGAGCTGACCANGGCTTGACTCGGGTACAGAACGA 

GGCACCAGTCCCCTTGCGAACCGAAGGGCCTCGCAGTGGATGGAGGAGGCCCAGCCCTGA 

GGTCAACGCCAACCAGGCTAGCCTGGCACGGGGCCTACAGGGTGGGTAGGCGGGCGTGCC 

GCAGCCGTCCAGGGCCTTCCCTCAGGTCCCGGGCCGAGGGGCCTACGCTGCGGCCCGGCA 

ACAAGGCCCGACTCGGCCCCTCGGGACCAGAGCCCCACCCGATCGGAAGGCGGATCCTTT 

ACCAGGGCCATAGGCCAGTGACTGGGCGGGCCCCTTNGGGCCTCCCATTCGGGGCCCGGA 

CTANGGAACNAGGCCCGGNNGAGGCCCCTTGGCCTACCAGACCCTTTTNTNANGCCGACA 

GCCGNCANGGAAAGAT 

Sequence 998 

CGTCCGGCCCAGAGCCCGCAGCACGCCGCCGCCGCAGCCTAGGTCACCTCCAGCATCTAG 

CACAACGTCTGCAATGGAACAGGCGAGCTGTGAATATTTGTGGAATGCATGGGTGGACTA - 

AAGACCTATCACCTCACTCTAGAATGCCCAGCATGTTGGAGCATGAGGACCAAGAACCAT 

GGTGTTCCTCCACTCATCAGAGCCGTATCATTTTGATGCATGCGCCAAGAAAGAAAATTC 

AATCATCAGACTGAAGCAATCAAACTCCAAATGGTGCTGTAAACTGAACCACACATAGAC 

ATGCCATTCTTCTAAGGACCCTTAAGATCCACCCCAGGAGGAGCGCTAGCTGCTGTTCCC 

CATTCGATGCCCCTTTTCGGCCCGGAAGTAGCCGGAAAGATTCGCCCGCCAAAATTCCCC 

TAACCAGCAAGTTAGGTGTGGCATCTTCCACAAGCANGGAGCCGTTGTAGGAAAAAGGNG 

GTCTTGGGGAAGGTTTTTCG 

Sequence 999 

CCCGCGCCGGCAGTTTCNATGGTGTGTACTAATTTGAGAAAATGAATGTGTATACATACA 

AGAGTMGTCAGATTGTTAGACTCATCCCTCAGTATTCCATATGTTTTGTGACTGATTTT 

ACAGTTCTCTCTACCTTTCTCATTTACAAAMMAAAMGAAAAGAAAATTTGATTCAGC 

AATTCCTAAMGTATTGTATTCAGTGACATCTTTGGAMCACCAGTTTCTGTTATCAACT 

TCAAATAATAGTCMGTTTTATGTATGATCTAMGGGAAAACAAGTTTGTTTTTCAATCC 

TGTGATAA 1 I I I I CTTTTAGAATGAGGTGTTGCAAGAAATGGAAAATTAAAAAAACCACT 

CTGTMCMTTrTGCTGTGCTTCTTTGATTTTTCTCTGTT^ 

TATTTGTACCTTTACATATTGMTTCATGAGGAGAGGTTATGCACAGCCTAGTTATTTGA 

CATTCCAGGGGGTTTAAAAAAAA 

Sequence 1000 

CCCGCGTCCGGCGGTGGCGGTGGTGGCGGTGGCGGCGGTGGCGGCGGCGGCGAAGGGGGC 

GGAGAGGAAGGAGCGCGGCGGGACCGGGCCGGGACAGCGCGTACTTTGGGCTCCGGGAGT 

CGCTCCGCGCCCGCGGTTGTAGCAGCTGCCGCTGCAGCCATAGCAGCAGGTCAGTCATTG 

GCACCATGAACTGGAATAAAGGTGGTCCTGGCACTAAGCGAGGATTTGGCTTTGGAGGTT 

TTGCCATCAGTGCTGGGAAAAAGGAGGAACCCAAACTCCCACAGCAGTCCCACAGTGCCT 
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TCTACAAMTTGGATCTAAGCGGGCCAACTTTGATGMGAAMTGCCTATTTTGAAQATG 
AGGMGMGATTCTAGCAACGTTTGATTTACCTTACATTCCTGCT 
Sequence 1001 

CCGGCCGCCGCCGCCGCCCCGCCGCCACCGCCTGGGGGTTGGTTGAGGCGGACGGCGGGG 

TCCGGGCCGGAGTACGTCGTTCCCGCTGCGCTAGGGGAAGCGGGCAGTCAGAAAAATGGG 

TAAGAAGAGTCGAGTAAAAACTCAGAAATCTGGCACTGGTGCTACAGCAACTGTGTCACC 

AAAGGAAATCTTGAACCTGACCAGTGAGCTGCTGCAGAAATGCAGCAGTCCGGCGCCTGG 

CCAGGAAGAGTGGGAAGAGTATGTGCAGATCCGGACTCTGGTTGAGAAAATACGAAAAAG 

CAAAMGGTCTTGTCCGTTACTTTTGATGGAAAAAGAGMGATTACTTTCCTGATCTAAT 

GAAATGGGCCTCTGAAAATGGGGCTTCTGTCGAGGGTTTTTGAAATGGGTTAACTTCAAA 

GAAGAGGGGCTTTTG 

Sequence 1002 

GTCGACCACGCGTCCGACGCACCAAAGGGCAAATACTCGGTAGCGACTCAGAGGGAAAGT 

GGGGTCTCTCCTGGGAGAGCAGGAGGCTGCCAGAAAAGAACTCAGGTCAGGGGTGCATAG 

GCGGCTGAGGAGTGCGGGACGGGCTGAGAGTTGGGGTGCCTCCCCGCCCGCAGGTGGGTC 

CGCAGATCCCGCGGGCCGCATTGGCCCGGCTGCTGCGGGATGCCGAGGGGCTGCAGGAGC 

TGGCACTGGCGCCGTGTCACGAATGGCTGTCAGACGAGGACCTGGTGCCGGTGCTGGCGC 

GGAATCCGCAGCTGCGGGAGTGTTGGCGTTGGGCGGCTGCGGGCAACTGAGTCGCCGGGC 

GGTTGGGGCTTTGGCCGAGGGCTTGCCCACGCCTGCAGCGCCTGT 

Sequence 1003 

CGCGTCCGCTTTNCCTTCTTGGTTCCACCTCAMCATCCCTTCCGAAGTGAGGCTTTCCC 

TGACTGGGGAGCATAAAGTAGCATCTCTCACATNCCATACACCCCTACAACGAATCTATG 

CAATGGCCCTGCTCTGCCATCGCCACCTGAAACCATCTCAATAAACACATTTTGGATAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGG 

Sequence 1004 

ACGCGTCCGTTGGCTGCGAGGAGCGCCCGAAAGGTCAGAGGAAGGAGCTGTGGGAAGCTC 

GCAGCAGGTATCGGAGCTTAAGCCAGTGGATTTGGGGGCCCTGGGCTCCCTAGCCGGCTG 

CGGTGTGAGAATGGAGTGGGCAGGAAAGCAGCGGGACTTTCAGGGCTCTGGGTTTGCAGA 

GAGCCGAAATGACCATGACTGCCAACAAGAATTCCAGCATCACCCACGGAGCTGGTGGCA 

CTAAAGCCCCTCGGGGGACTCTGAGCAGGTCTCAGTCAGTCTCTCCACCTCCAGTTCTCT 

CCCCACCAAGGAGTCCCATCTACCCGCTCAGTGATAGTGAAACCTCAGCCTGGAGGTACC 

CCAGCCACTCCAGCTCCCGGGGTGCTCCTTAAGGGACCGGCACCCCCCACT 

Sequence 1005 

NCCACGCGTCCGGCAGCGCTGCGACGGGACCGCGCGATTCCTCTCCCACGCATCTGGCCC 

GCGTTCCTGGGCCTCGGCACCGGATCCCGGCGGGGGTGTGGACCCAGGGCCCACTCTCCC 

CGGCGCGGCCAGGGCCCCCCAGCGTGCGAGCGCCTAGGGGATGCCGAGCTGCTCAAGATG 

AGGAGGTGCGCGGGGCCGGGGCGGAGCAGTCGCAGTTCCCCGCGTGTGAGCCCCCACCCA 

TCCCTGGCGCCAGCGCTTTCCCGACCACTCGGGTTCGGCTATGCGGGAGCCGNGAGGAGG 

AGGCTTGCAGTCGTGGACCTGAGACCTCGGGAGGTCATGCTGTCTTGCTTTTAAGTGGCT 

TNGGGGAAAGTGAAAGAAAACNCCNAAAATTGGAGGACTTTGCTACCAGGGACCTAACGG 

CACCAGTGG 

Sequence 1006 

ACCACGCGTCCGGGAAAAGCCCGGAAGTGCCACGGGACTTCCTGTCTAAGGAAGAGCCTC 

GTGAAGCTCCTCCACTGGGGAGTCAGTGGCCTTCGTTGTATCTGCCCCGCTTGTCCACCT 

CCTAGAGTGAATCCCCGCCTGGAGGCTGGGACACTAACCAAGAAGTGGCACATGGCATAT 

CACGGGAGCAATGTTGCCGCTGTACGGAGAGTGCTGGACCGAGGGGAGCTGGGAGCAGGT 

ACTGCCTCCATCTGANGCCGTCCTTTGAAGGGAGAACCTGGGGTAGGGTTCGAGGAGCCN 

GCGAGAACTGTGCACCTCCTCGGGAGGAGCAGCCCCCTCCTGTGCTGCTTTCCCCCTCCC 

TTCAATATGCTGGGGGCGGAGACCCTGGCCTCCAAAGTGCAATTCCGGGACCCCAAATCC 

CAGCGGACGCACCAGGCTTTAGGTGGGCGTCAAGTTGNTGTGTGCCCCCTGGCTTCTACA 
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■ CCCCGGGACCCCCTTCCG 
Sequence 1007 

TCGCCACGCGTCCGGAAAMTTTATGCCTTTTTATTCATAACCCAGCTGTGGACCACTGC 

CTGAAAGGT TTGT ACAGATGCATGCCACAGTAGATGTCCACATAATAAAATTCATAGTTA 

CCMTGCAGTTTTGATATATCATTGGATTCTGTCTTTGAGTTGTAGGTTATTTCTTAGCT 

GCATGTTTTAMCTGAATTTGCATAGAGTTGTATGTTAATGTTTCAGTTAAGAGAAAAAC 

TTAAGATACATGAGTCATTACATAATGGGTATGAAATCTTTATAATCACCCTTCCACCCT 

CTATGGTGTCAGTACACATCACGTGTCATAGATACTTAAAATGTAAATGTTAACACTTTT 

CCTTCCTGCTGAGGATGTTTAGAGCCTAGTGCCAGACCCATTCATTTCCTTTTGATT 

Sequence 1008 

GCGTCCGGGCNGCGGAGTTTTGTCCATAACGTGGGCAACCGCGCAGCTGGAGGATGGCCT 

CACTCGGGCCTGCCGCAGCTGGGGAGCAGGCGTCGGGGGCTGAGGCGGAGCCGGGCCCCG 

CGGGGCCGCCGCCGCCGCCCTCACCGTCCTCTCTGGGGCCCCTGCTCCCCCTGCAGCGGG 

AACCTCTCTACAACTGGCAGGCGACCAAGGCGTCGCTGAAGGAGCGCTTCGCCTTCCTCT 

TCAACTCGGACTGCTGCGATGTGCGCTTCGTACTGGGCAAGTTGGCGGNGCCGCCGCCCG 

CTGGGGGCCCGCAGCGCATCCCCGCCCACCGCTTCGTGCTGGCGGCCGGCAGACGCCGTC 
TTTG 

Sequence 1009 

GCNCCCCGCGTCCGTTAGAGCTCAGGAAGTTATTAGGTGCAGCCTCTGGAGCCATACTCA 

CGCTGCAGTGCATAATGGGAAAATTAGGAGCATTAATAAGAAATTTCAGTAGTGTTTGTA 

AGGAAAATAAGCTACTTACTGAGATCTGTTTCTTCTATTGCATGTTTGCTTTTGAGGGAC 

AGCTT CTGTCA AAAGTGAAATCATCACCAGAACTGGGCCTGTTAGGAAGAATAQGGTTTT 

ATTTACI I I 1 1 ATGTCAATTAACTTCAACAAAAAGGCCACGCTGGCTGCTGTCATGCCAT 

CTGGGTATGCATTAAACATTAATGATGATCAGCCTTGAGGTTCTATTTATCTTGATTTGG 

CTTTATAAAGTTTGTCAGAATGGTGCTGGAGGNCCAGAAGTGCTAAGGAGAAAGAAGCTA 

TGGGCCAAGTTAAAGAATTTGAATGCAAAGGCCAGGNATGGGAGTTTTCATAA 

Sequence 1010 

CGCCNCGCGTCCGGTGAGCCCCAGCAAGGAGATCAAGATCGTGTCTGCCTGAGGAAGCAG 

AGCCATGACAATCGGAAATCTACCAGCTCAATGTCCTGCATGTAGACTACCGGACCGTGA 

GCAATCTGATTCTGACGGGCCCACGGACGATTGTCATGGAAGTCATGGAGGAGTTAGAGT 

GCTGAGCTCCTGGGCCTCCCAGCCCTCCAGTGGCCTGTGGGTGAGGGAAGCCAGAATGAC 

ACAAAGCAATGCAAAGACAAGATTGCCATGCAAATGGATGGTTTTGGACATACGAGTCTT 

CTCCGC ACATAC ATGTCTAAAGTTGAGTTTTATACACTGGAATGTGGAAGAACCCGGGTA 

TCATATCTTTTTTAAAAAATGTCCAGTGTAGAAAACATTTGGGAAAC 

Sequence 1 01 1 

ATTmCTMCATGGGTTTGMCGCTTATAACCAGTTTTATAAACCCCTTGAACACTGCA 

GTGAGTTATCAAAGCCACTGCCTGCAAAGTGGATGATTTAAGATTTTACACGCATGAAAA 

TGAGTGTGCCA TCTCC TGACCAGTGCCTTTTGACTTAGGTACCCAGATGCCACTTGTCAG 

CAGCAGGATACTTTTTACAACACGAAAGCATAATTATTTTAGAAGAAGAGAGTAGAAGGG 

CAGAATAGAATTCAACTTAC AGAA GCACCGGAGTAGTGTTGTGGTTGGCTGTTATCTGTC 

CCCCTGGGAGGAGGGACTGTTTTGCTCCCTTGTTTTNGATGTTAAACAGTAGCTTAAAGG 

CTTTCCCCCCCATAC CAACTT ACAGNCAAATGACAAAGAACCGGTGGNGGT7TTCAACAG 

ATTCTACAAACATGCATTTTTCCCTTCCCACTAAATGGG 

Sequence 1012 

GTCACCNCGCGTCCGCTCGTCCTCCGTGGGCACTGATGTCACCGAGGGCCCTGCTCACCC 

AGCCCCCCACACTAGGCTGTTCCATGCAAATGAGGAGGAGGAGCCAGAGAAGAAGGAGGT 

ATCGGAGCTGCGCTCTGAGCTATGGGAGAAGGAAATGAAGCTTACAGACATCCGCTTGGA 

GGCCCTCAACTCTGCCCACCAACTGGATCAGCTTCGGGAGACCATGCACAACATGCAGTT 

GGAGGTGGACCTGCTGAAAGCAGAGAATGACCGACTGAAGGTAGCCCCAGGCCCCTCATC 

AGGCTCCACTCCAGGGCAGGTCCCTGGATCATCTGCATTATCTTCCCCACGCCGCTCCCT 

AGGCCTGGCACTCACCCATTCCTTCGGCCCCAGTC7TGCAGACACAGACCTGTCACCCAT 
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TGGATGGCATCAGTACTTTGTGGGTCCAAAGAGGGGAAGTGACCTTCGGGTGGGTGGTGA 
AGG 

Sequence 1013 

CGCGTCCGAAGAAAATGGGATCCATGAAGAACAAGACCAAGAGCCACAGGATCTCTTTGC 
AGGGGATGGTATGAATGCATATGTAGCCTACAAAGTTACAACACAGACAAGCTTACCATT 
GTCTCAGMGCAAACAGTTTTGCAGGTAMAAAGNAAAGATNTTAGTGACTTTCTTGGGT 
CTTTATTGAGMGCTTTCCNGAGNAAGCCACTCTCAGAAATTGGCTTCATATGTTCCCTC 
CCGCCCCCGGAGTAAGNAGCCCTCATAGNGGGATTGACATAAAAGTTGAAAAGNTTGNGG 
MCGGAAAGMTTCNTTCTTTCTGGCAGNAATmCTTTGAAAAAMCCNGAGGGGCCCG 
CTTTAGAAAMGGGTACCCTTTCAGGAGGGATTTGTTAAAATTCANTCCTACCCCATGGT 
TTMCAAGGGACTCCTTGACNGTTCAGTMGTAGGTTTCTTTGNNAAAAAAGGAAAGGAA 
GCTGCCCACCGTGCCCG TNGG GGATACCCCNAAGACAATTGANGTTGGGTGGCTTGGNTC 
CTTTNCTCAMGGMTGGTTTTCAAANCAAAAAGCCCACCAAGATTGCCCNGTCCANGCC 



Sequence 1014 

GTCGCCCCGCGTCCGCGGNCGCGTGGGGTGCTNGTCACCAGACTGCACCCTTGCCAGCAG 

CTTCGCAGCTCTCGAAGTAANTTATCGCANGATGGCCGGCGCCTCACCTAGGAGAACCAG 

GMGGCAGGCCNCGCTAGMCGACGGNATTGAATTTTACTATTGNCAAAACAATCACATT 

CAAATTCATTCCACTTAAAACCTGAAAACATTGGACCACACAA 

Sequence 1015 

AGTCGACCACGCGTCCGGGCGGAGGGAGCGTGACTGCGCTGCGCAGGGCGCTAGGAGGO 
TTGTCGCCGCTCAGGCCCTTTTGTGAGAAGCAGACCAGCCTGGGGGCTGGCGGCAGGACA 
CCTGTGTCTGCATGCTGAAGAAGATGGGTGAGGCCGTGGCCAGAGTAGCAAGGAAGGTCA 
' ACGAGACGGTGGAGAGCGGCTCTGACACTCTGGACCTGGCCGAGTGCAAGCTGGTCTCCT 
TTCCCATTGGCATCTACAAGGTCCTGCGGAATGTCTCTGGCCAGATCCACCTCATCACCC 
TGGCTAACAACGAGCTTAAGTCCCTCACCAGCAAGTTCATGACCACATTCAGTCAGCTCC 
GAGAGCTCCACCTGGAGGGGAACTTCCTACACCGCCTCCCCAGCGAGGGTCAGTGCCCTG 
CAGCACCTCAAGGCCATTGACCTGTCCCGGAAACCAAGTTCCAAGGACTTTCCT 
Sequence 1016 

CGCGTCCGCTTTCAGTGAAGAAAAGGGAATTACACATNGAATCGACACATCAGTAATACC 

GATA CAGTGAA ATGGGCCTCTMTAAGMTTTNAGCGNGTTTTCTGATGTGCCATTTTTT 

TTGTCTTTTTTAAAMTATACCATANTTATAAAANTGGNAMTANNTTTTTGNACACCAT 

TTAAATTGACCCCCTTA NAGNACN CTTGCCGTNATGNTGAAANGCTAGACCTATNGAAGC 

TGNCTTGANGATATNTGTTTTTTTAAAAAMTTTTTTACM^ 

TAATATGCACTATAAAATATGATCNTATATCCTATTATCTATNATCTAAAAACACTTCCT 
T GGACNC ATTTANACGTAAAATTAAAAATGGGTCTTTAANGAAGANTAATGGGGAGGCCC 
Ci I I I I i AAMCCTATGGNNCMTCTTTTTATGNCAMGGGGNGGACCATTTTATTAAAA 

Sequence 1017 

GCGTNCGCTGCGCCCGTGGGACCGGTGAAGTTCTGGCGACCCGGTACAGAGGGGCCAGGT 

GTAAGCATCTCTGAAGAGAGACAAAGTCTGGCTGAAAACTCTGGGACAACGGTTGTTTAC 

AACCCTTATG CTGC CCTTTCCATAGAGCAGCAGAGGCAGAAGCTGCCGGTATTCAAGCTT 

AGGMTCATATTTTATACTTGATAGAAAATTATCAGACAGTGGTGATTGTTGGTGAAACA 

GGATGTGGGAAGAGCACACAGATTCCTCAGTACCTTGCAGAAGCCGGCTGGACAGCTGAA 

GGAAGAGTGGTAGGAGTGACCCAGCCTCGAAGAGTGGCTGCTGTTACACATGATCTTTCT 

TNCCAAAGGTTGCAGGGAGAGTAGCTGAAGAAAGGGGTGCAGTGCTGGGCCACCAGGTGG 
G CTACTG C ATCCG CT 
Sequence 1018 

AGTCGCCCCGCGTCCGGTGGGAATCTTTCNACTTCTTGATCCATCTGGGAGAGAAGACGT 

ACCATTATGTGCCCGAATTCCGAAAAGTGTCCATAGCAGCTACCATCATCTATGCCTATG 

CCTGGCTGGTTCCTCTTGCACTCTGGGGTTTCCTCATGTGGAGAAACAGCAAAGTTATGA 



AAAAAAT 
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ACATCGTCTCCTATTCATTTCTGGAGATTGTGTGTGTCTATGGATATTCCCTCTTCATTT 
ATATCCCCACCGCAATACTGTGGATTATCCCCCAGAAAGCTGTTCGTTGGATTCTAGT 
Sequence 1019 

GGAGTCGCCACGCGTCCGGTGGCACGATCTTGGCCCACTGCAAGCTCCGCCTCCCAGGTT 

CACGCTATTCTCCCGCCTCGGGAGCTGGGACAACAGGTGCCCGCCACCACGCTCGGCTAA 

TTTTTTGTATTTTTAGTACAGACGGAGTTTCACCGTGTTGGCCAGGATGGTCTCGATCTC 

CTGACCTCGTGATGCACCTGCCTTGACCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCA 

CTGCGCCCGGCCMTAATTTTTTTAGTTTMGTTCATTTTTGTCCCCGCTGATGAAA 

ACAGCAGCTGCCCAGTGCCTCTGTCCACACCCACCTCCCTGGTGTACTTGCCCCCTACAG 

CAGCAGCCCAGATCCTCCCCTGGATTAAAATTGCAACTGGTGCCCTAACCCAAAACTTGA 

GAAAAAATTCTCTATCACATCCACTCTTCTGGCATTTGTAGAATCTC 

Sequence 1020 

GTCCGGTNGATATATATATTTACCTCCTTAGTAATGCAAGAAGTGTTTGTGGGAAGCAGA 

GAAGCAAGCAACTGTATTTCTTGTTCTCACCTAAGCATTACTGGAGGATAAGCCACATCA 

GTCTACAAAGAGGTTTTCATACAAACATAATAAGATGTAAAATGGACCAAAAGTTGAAAG 

CACATTCTTGCMGTAAGCACCTGTTACTCTCCMGCMCCATGGGTTTACCATATTTGG 

GGATTTTTTGAACACTTAGNCACTTTCTTGCTCCCNMGGGGACNTTTACAAAAAGTGNA 

NACATTTTTGTANTGTNNCCCGTTATTAAAAMGCTAACNTTTTGTMCCTNCTTGTT^ 

CAAAAGGGCTTGGTNTTTTTGGACAAATTCAAAMTGGAAAAATGGATT1TCCACCGTTT 

TMGCCTCAATTTTAAGCCCCAAGGTTCCCAAAMTTTTTTMNGAAAAAAGTTAT^ 

GTATTAGGTNGGNCCTGGGTTAAAAAACCAAGAANAAAACCATTNAAAACAAAGCCCATT 

CAAAAATTCTTNTGGAAAAAAA 

Sequence 1021 

CAGGGAGTCGACCCCGCGTCCGAGCATCTTGGGGMTTTATATTCCTTTGTGAGAAATGT 

TTTGATCATAAGCCTAGAATGATAAGTAAAAGAAATAAAGATAATTCTACTGCTTGTTCT 

CACCCGGTTACAAAGCATGAGTTTGAAGACMTMGTGCCTTGTCCACATTTTGCGAGAG 

ACMCAGTAAAATACTCCAAAATACGTTCTTTTCATGGTCAGTGTCAGCTTGATTTATGT 

CGACATGMGTTCGGTATGGCTGTTTAAGGGMGATGAGTGCTTTTATGCCCATAGTCTT 

GTGGAACTGAAAGTCTGGAT AATGCAAAATGAAACAGGTATCTCACATGATGCTATTGCT 

CAAGAGTCTAAACGATATTGGCAGAATTTGGAAGCAAATGTACCTGGAGCGCAGGTCTTG 

GTAATCAAATAATGC 

Sequence 1022 

CNCGCGTCCGCAAAGCAAACGACAGCTCAGGGGGCTCCAAAGACCTCATTCATAGCAGCA 

AAAGGAACTCAGGTAGTCAAAATATCAGTACACATGGGACGTGTCAGTTTAAAACAGGAG 

CCCCGGAAGAGTCATAGTCCCAGCAGTGACACATCAAAACTAGCAGCTGAAAGGGACTTG 

AATGTGACCATCAGTCTTAGTACTGATAGACCAAAGCAGCGATCACAGGCAGTAGCAAAC 

GAGAGGGCACACCCTGCCAGCACAGCAGTGNCGAAGTCTGGGGAAGCCATGGCCTTAAAC 

AAAACTAAGACTCAGAGCAAAGAAGTCAATGCAAATAAACACAAAGCCAATACGAGTCTT 

CCTTTTCCTMGTTCACTGTCAATTCAAATCGCTTAAGGAAGCAATCTATTAATGAGACA 

CCTTTGGGAAGTTTGTCAAAGGATGATGGAGCTAGAGGGGCTCATGGGG 

Sequence 1023 

CNCGCGTCCGGCCAACCGCCGAGGAGCAGTGCCCTATTCAGCACAGTGCGCCAGGGCCAC 

TGGCAGATTGTTGATCTTTTACTCACCCATGGAGCTGATGTCAACATGGCAGACAAGCAG 

GGCCGCACTCCCCTGATGATGGCTGCTTCCGAAGGCCATCTAGGAACCGTGGACT7TCTG 

CTTGCACAAGGTGCCTCCATTGCTCTTATGGACAAAGAAGGATTGACAGCCCTCAGCTGG 

GCTTGTTTGAAGGGCCATCTCTCAGTAGTACGTTCTCTGGTGGATAACGGAGCTGCCACA 

GACCATGCTGACAAGAATGGCCGTACCCCACTGGATCTGGCAGCTTTCTATGGCGATGCT 

GAGGTGGTCCAGTTCCTGGTAGATCATGGGGCCATGATCGAGCACGTTGACTACAGTGGA 

ATGCGCCCTTTGGATAGGGCAGTGGGGTG 

Sequence 1024 

GTCGCCCCGCGTCCGAGAAGTCAGGGAGTGGAGGTTCTATAAGGAATTAACAGCTGAGGA 
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CGGM GGGTTTGTTTCCCGTTTGAACCTAAACGCAAGTGGAAAAGAATACTCAGAATGTA 

TTTTTCTACTTTACATCTGCTGGGGAAGGAAATGTGTCAGGAAGCCGCTGCATCTGGTCA 

TTTCATCGCATCAGAATCACAGCAGACGTGGAAGATTCCATGTGGTGGGGAATAAAGAAA 

TAACTTTATGCTCTCCTGAAAAACAGCGGGAGCCTATGTGTGTGTGCGACACTGTAATCT 

CAA GGAGA TTCACTCAGAGCTGTCTCAGTCCAACTCCTGCATGACCAGATCTTCCCTTAG 

CATCTTTTCTGTGATGAAATATTATCTTGTGTTAGAGTTAGGAATAGGAACTAACCTGTA 

GGAGCATGTCCCCAAATGGACATTTGAATGGACTAACAAAAACAACTGGAAAGACTGAAT 

TTCCGACACAAAGGAATGATGGGATCAAAAAGAAAGC 

Sequence 1025 

GGAGTCGACCCACGCGTCCGGTCGACAGCCTCCGCCACATCCTCCACCTCTCTTGGTCCA 

GCGAGCGTTGCCGGGCCAGGGTCAAGCGGAGGGCTCCGACGGCGCGGACGGAGCGAAGCG 

CCGAGCCATGGCGCACCAAACGGGCATNCACGCCACGGGAAGAAGCTGAAGGAATTCTTT 

GCCAAGGCACGGGCTGGCTCTGTGCGGCTCATCAAGGTTGTGATTNGAGGACCGAGCAGT 

NTCGTGCCTGGGTGCNCTTCGCAAGGGAGCCAGNTAAGGCNCGCTNGGGGATCAGGGACT 

ATTGAACAGGNGGCCCGTGGCTTGCNCACCTGCNTGGGACCGCCCCAGGCAGGCCCTGCN 

TACCTGGCTCTACCGCGCTTCGACTNACAAGAAATGGCTCAGGGGNCTTTCGAAATGGGG 

CTTCTTTCCTTCGCCCTTGGTTCCGCNCTNGAATAAACCTCCCCCCCGTGGCGGCTTGAA 

AGANTGCCTGTACCGCCGGGNCAATGCNGGGCCCCACAAGTGGAAAAAAGGGAAG 

Sequence 1026 

AGGGAGTCGACCCACGCGTCCGCTCCCGCCAGGCGCTTTCTCGGACGCCTTGCCCAGCGG 

GCCGCCCGACCCCCTGCACCATGGACCCCGCTCGCCCCCTGGGGCTGGNGATTCTGCTGC 

T7TTCCTGACGGAGGCTGCACTGGGCGATGCTGCTCAGGAGCCAACAGGAAATTAACCGC 

GGAGATCTGTTCTCCTGCCCCTAGGACTACGGACCCTGCCGGGCCCTACNTTCTCCGTTT 

ACNTACTACGACAGGGTACACCGCAGTAGCNTGCTCGCCAGTNTCCTGTTACNGGNGGGC 

CTGCNGAGGGGGCMCCGCCCMCCMTTTTCTTACACCCTGGGGNAGGGCTTGCCGAAC 

GMTGCCTTGCTTGGGAGGGATTAGNAAAMMGGTTTCCCCAAAAGTTTGCCCGGCTGG 

CAAGG ATGGAAGTGT TGGGACCGAACCCAGGATGTGAAGGGGGGGTTCCACCAGAAAAAA 

AGGTTAi I I I I CTTTTAATCTTAAAGTTTCCANTGGMCATGGTNGAMAAAAATTTCTT 

TTTTNCGGGTGGGGGTGGTTCACCCGGGAAC 

Sequence 1027, 

CGTCCGTAGTCTCTCTCGTGGCCCGGAAAAAAAAMGAAAGAAGGTTGGGGCCAGTCACC 

CCCACATCCCTTTATGGAGGCTTCCAGATCATGGATCCTGTCACGCGCATCCCGGTGAAG 

AGAGTCCACCAACAGGCTTTGTATGGGGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTG 

GTTCGATATGGCTCACTGCAGCCTCAGCCTCCCTGGGATCAAGTGATCCTNTCACCTCAG 

CCTCCCAAGTGGCAGGGACCGCAGGAAGGCCTGGACGACGGCCCGGACTTCCTCTCAGAA 

GAGGACCGCGGACTTAAAGCAATAAA 

Sequence 1028 

CGCGTCCAACCCCTTCCTCAGCCTGTCGGGAGCAGAGGGCAGTGGCGGTGGCCCCAAAGG 

AGGGACCGCTGACAAAGGAGCCTCAGCCAACCAGGAAAAAGGCTAAATCCACCCTTACCC 

CTCCTGACCCCCCCAAGTGGAGGGAACAGATCCTGGCCTGAGGGGTCCTAGCCTGGAGCA 

GGCGCCTGCGCCCAGACCCTGGAGAGCCTTGACCCAGAGCCTGTGCTGAGGTCCAGGGAG 

TGTGGAGAGCTCCTGGTGTCGAGGACTGAGACTGACAGGGGAGCCCCCTCCATCTGGCCC 

CCTTCCCTTTCCGCACTGTCCGCTTTGTGAGGCTCAGAGGAAGGACAGTCTGCAAGCCCG 

CCTAGGAGGTCCATCCCCAGCAAATGTTTTGGAGGTCCCCCCAGAGAGCAAAGTGGGCCA 

TGGCAAGAAGTAGGGGGTTGGTTGGACCTGTCACATGAAATGGATCAACACTTGAATGGG 
GA 

Sequence 1029 

CGTCCGGAGAAGATGGGCCTCCCGGGCTCAGACTCACAGAAAGAGCTGGCCTGACCACCA 
GGCACCTCACTGGCACTGCTGACCCATCCCAGAAACACAATCTCAGGGACCCGAGCAGCT 
CCAAGGACGAGAGGATACAGCAGACACAACCTAATAGAGAGGGCGCCTGCAGCCTTAACC 
TCCACGGCCTTCGATACTTATGCAAGCCTGGTGTTGCTCCTGTCCTCAGAGTCATCCTGC 
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GCTCATGCCTTTTCCCGAATGGGTTCACCTCTGGCAGTTGCCGCTTCAGTCTTGGCCTTA 
GCCTCATCTTGAAGTGGGTAGCTGGCGGGAGAGGGTGGGCTGCGCCCCCTGCTGGCCCTG 
AGGCTGCAGAGTTGGGAGCAGGACACCTCACCTGAGTTTCAI 1 1 1 I I I I CATGTCCAAAC 
CATGCACATACTATAGTCCAGAATCAAAGCACTTTTGAAAA 
Sequence 1030 

CCCCCCCGCTTTTTTNNANNCNCGGGGGGTGNGGCAAAACTTTTTCCTAGAATTNGCGCC 

ATGTTGGAAACNNTNTCNCAGCANCCCGGCTGCACAGGGNGTAGCCCANCCTCCAGGTCC 

ACGGAGTGGTGTGGACCTCCCACCTCACAGCTGCCTCTGGCAGCCAAGCCTCTTTTCGCC 

CGGCCCCAGCCCCTCTGGTTGATAAACGGGTGGGCCTCCTCAGCAGCGTGGCTGCCTTTC 

ACCTTGATTTCCCCAGGGCTCTCGGCAACATCGATAAACCAAGCCTCGCCCACCAGCTGG 

GCCCTCCCCCACCCAGTCTGCCAGGCTGGGAGCTGGAGCTTGCTGAGTCTTGAATGCCCT 

TCTAGATGGCTTCTCTAGAGGCTCTCCTGGCAAGAGAGGGTCCCAAGGGGAGCCCTGCAA 

AGCAAAGGCTCCTTGTCTGGGGCGGGATAGAGAATCTCGCCTCTGTCTGGTGTGTTACCT 

ACTGGGGGCACAGGAACAATTTCCTCAAGGAGACAGTGGCATGGAGCTTTGAAAGACGAA 

GTANGTGTTAGCAAGGAAATAAGGAGGAACGGGGGTTACGGGCAGAGGAGAAAGCACATG 

CCMGTCAGCAAAGAAAAGTAGAATTCGAAAAACTTTTTTA 

Sequence 1031 

GGCCAGAGCTACTACGCCGGCCGATGGCGAGGAGCCCGCCCCGGAGGCTGAGGCTCTGGC 

CGCAGCCCGGGAGCGGAGCAGCCGGTTCTTGAGCGGCCTGGAGCTGGTGAAGCAGGGTGC 

CGAGGCGCGCGTGTTCCGTGGCCGCTTCCAGGGCCGCGCGGCGGTGATCAAGCACCGCTT 

CCCCAAGGGCTACCGGCACCCGGCGCTGGAGGCGCGGCTTGGCAGACGGCGGACGGTGCA 

GGAGGCCCGGGCGCTCCTCCGCTGTCGCCGCGCTGGAATATCTGCCCCAGTTGTCTTTTT 

TGTGGACTATGCTTCCAACTGCTTATATATGGAAGAAATTGAAGGCTCAGTGACTGTTCG 

AGATTATATTCAGTCCA CTATG GAGACTGAAAAAACTCCCCAGGGTCTCTCCAACTTAGC 

CAAGACAATTGGGCAGGTTTTGGCTCGAATGCACGATGAAGACCTCATTCATGGTGATCT 

CACCACCTCCAACATGCTNCTGAAACCCCCCTGGAACAAGCTGAACATTGTGCTTATAGA 
CT 

Sequence 1032 

TCG CCCCGC GTCCGCMTTTCTTTTGAATTTCGATCACTTCTACATTCCCAGCTTGCCAC 

ACTCI I I I I IGATGAAGTTGTGAAGCAGATGGTAGCTGCCTTTGAAAGAAGAGCATGTAA 

GCTGTATGGTCCAGAAACAAATATACCTCGGGAGTTAATGCTTCATGAAGTCCATCACAC 

ATAAAGGCAAAAAAGAACTCGTGCCACCTGCTTCTGACTTTAGTTTGTTCACTTTTAGGA 

AGTATTTTCATGACATGTTTTCAGAAGCCAGAAAGCATTTGTTAAACGCAGCTTTGGTTA 

TAAACCTGCACCATTGAAAATTTGCACATAGAATATAGACTCACTTGTACATAGAATTAT 

TTCTTCAAGTATAATTCAAAATAATATGGACATTATCATGTTCTGCATTACAATAATGGG 

ATGTCATCACCATTGCTAGAATACTGGCATGATTCTTCTGAGCAGAAGTTGAAACTGTAA 

ATTTAAACCTTTTAATTATCACCTTACCT 

Sequence 1033 

NCGCGTCCGGCCTTTTGTTCAGCTTGCCGGGCAACCGGCCCCTGCTGGGGACTACAAGTC 

CCGTAAGCCTCCGCGGCGGCACGTCCTACCCTACACTGTCCAGCCGGCTCCCTTTTTCCC 

CCTCCCCGGGGGCCAAGGGCTCCGGCTGCTGCCTGGCGGCCAACGGGCCAGGTAGGATTT 

CCGGGAGAGGCTGCTGTGGAGGCTGAGGAGGCGGCGGCGGAGATCTGGAATGAATTTTCT 

ATCTGTGAAATTGTGAAGACGAAAAAAGAAAATTGTGCTAGTTTTGCTGTTGTACTCCAC 

ATTGCAGAAGTTACAATCACAGTGCACAAAACAGTATCTCACCTCCCTAAACTGGTTAAT 

AGTGGCATGGAAGATCCATTTGAGGAAGCAGACCAGCCCACTACAGAGCCAGGCATGGTC 

CTGGACAGTGTGGAAGCAGGAGACACAACACCTTCTACCAAAAGGAAGAGCAAGTTCTNA 

GGCTTTGGCAAGATCTTTAAGCCCTGGAAATGGAGGAAAAAAAAAAGTAGTGATAAATTT 
A 

Sequence 1034 

CGTTTTNNTTCTTCCCTTGGATNAATATTTTCTACTGAAAAGAAATGAAATCTCAGTTCC 
ACTATGATAAAAGAAACGAATCACCAGTGTCTGTCAGTGCCCCCACTGCTCCTCCTCATA 
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CCAGCATTGACACTGGTAGCTGAGGAAGGGGACTGCTCAATTTCTAATGTGATCTATTCA 

AGAAGCCACATATAAAAAGGCATTGAGGGCTCACTGTCCAGAGAATTGCTTTTGTAAATG 

CTCCACAACGTATGGGCAAAAGTTTAAGGACTACTGCCTGATGTACCAGGAATCCTGAGT 

TCTGTGAAAATCTGTTCATTCCAMTCTGTTAAGCATTTCCAAACATCCAGAGAATGGTT 

TCACTGTTGAGGGTGCAT GTGG CAGAATCTGTCTCTCTCTGCACCTGTCTTCTGTTACCA 

TCCCTGGACAGTGACAGATTTTMGCCAGCCACCAGAATTCTTTTCGAGTTAACCATTTC 

TTAGTTTCTGTAAA 

Sequence 1035 

GTCGACCACGCGTCCGCTCTGACCGCCGCCGGCTTTCACCCACCTGCCCGGCTCATCACC 

TCTGACTCCTGCGGGCCTTCCAGCCGGCGCTATCTCGCAGCCCCCGACCAGCTCGGCCTG 

CTGTCGGGCATCTCTGCTGTCCTCCGGCTGTAGGGCTACCTGCCTCCCTCCCGGGGGGTG 

CACTGCGAGTCCGACTCTAGTGGGGGCAGCGCCTAGCGGACTCCCAAACCCGCTCAGGTC 

CTGCCGGCCTCACCGGCACCGCCCACCCTCCAGCCGGCCGCGGCTCTGCGCTTGCGCGCC 

GGCAGCACACAGTGTTAGGGGCGCGCCCTCTGCTCTGGACATGCGCCGCCGTCG 

Sequence 1036 

GTC CGCC CCGCGTCCGG GAAAA TGCCGCAGTTTGTTGCCTTGAAACCTAAGAGCAATCCT 

TGGTTTTGTTGCTACATTATTTTTCCAGACCAACACATCTACCAAGTNGMTTTTATNNA 

CTTTMTTTCATAATAAAGTTAGTAGAGTCACTCMCTTACAAACTTTATTTATGNTGGC 

TTT GGGC AAAAAAATCACTTATAAGGCAGCTCTAAATTTGCCTTGATAAGCTAAATAAAT 

TACTTTTATMCTTACTAAAGCAGAACAMCAGTGAAACTTTCTAAAATATTCTATNCTG 

GAAATAGNG GACAGG GG GATCTTTT ATTTATAATNCTCATCAAGATGAGTGAGGTTGTTC 

ACCAGGATATTTTTTATGG I I I II I 1 1 I MTTTTTCTCCAAAGNAATTATTTTTAATAGG 

AATTCCCAAAAGNMTCMGAAATTAGTTTTTCAAAAATAMTTTTTTCCAGNTNGATAA 

MAGGAAGGTNGTTTGTAAA ATTAA ATCCATTATTTTACCACCTTAAAAAATTTGGGGGG 

MTACCATTCTAMGGGMCCTTTTAATTCTTTACCTAATTCANGNTAGGGGNGTCTTGG 
C AATTTN G AAT ANTTTTCNTTTT 
Sequence 1037 

GCGTCCGAAAATATTTAGGTAATGTCATAAAATTTATTTTACCTTTCTCATTT^ 

AMTAAATGAAAAAMCCCTAGATATTGCTTTATTACCAACAGTGTGTAGGTTTTTGTAC 

ATATGGAAATTTGACACAMAAAATAGGGMTTTGTATAGAGAAGTTTCCCTCTTATAAA 

AGGACTCCCATTTGATTGTTCGAMCTATAAMTGCACTTTTACTTTACCATATCTTGAA 

ATGACAAAATATCGCCCTTTGGMMCCTGACTCTTTGCACCGTTGTAAATTCCCAGAGT 

CTACCT CAGTT AACC AG G CCTT AG TTTT AG G C AAG G AATG AATTG AATTAAAATTTC AGT 

TCMTCATTTrATGCCAGMTTTTGTTTTTCTTTTTAAGGCACCATCCTTCCCTCC 

CTGGNTGGCCCTTCCCTCCCCATTTMCTmCTTTTTTTMTNCTTTG 

TTAAAAATATTTCCMTCCTT1TCCTTTTCCTCTAGCCAAAGTNGGTTTTGTNATTTNCN 

AAATAAAAGGGCCCTCTGGTGGAAAANGGGNCTGGAAATTAACCTT 

Sequence 1038 

CGCCNCGCGTCCGCAAGACTTTGAAAAATNNGATCATGGTTCTTCTCAAAATACCAGCAT 

GTCTAGCATCTATCAGMTTGTGCMTGGAGGTTTTGATGTCCAGTTGTTCACAGTGTAG 

AGCTTGTGGAGCTTTAGTTTATGATGAAGAAATTATGGCTGGATGGACAGCAGATGACTC 

AAATTTGAATACAGCTTGTCCATTCTGTAAAAGCAACTTCTTGCCTCTTCTCAATATAGA 

ATTCMAGATTTGAGAGGTTCTGCAAGCTTTTTCCTGAAACCAAGTACCTCTGGTGACAG 

TTTACAAAGTGGAAGCATTCCATTGGCAAATGAATCCTTGGAGCACAAACCTGTATCCAG 

TTTAGCAGAACCTGACTTGATCAACTTTATGGACTTCCCAAAACATAACCAGGATCATAA 

CTGAAGAAACAGGCTCTTGCAGTTGACCCAAGTGATGAAATAAAGAGAGCCAGTGGGAGA 

TGTCCAAACT ATTG A AAATTTCATCTGTG N CTTAATAG GTTTATC 

Sequence 1039 

TGANATTAGCATCACTTCGTCTACTMGAATCTTMTAGATGTAAAAATATCTTTTAAAA 

CATATGGTAGGATGGGTAAAATTTGGCAATACTATCCAGGAAGTCACTAAGTACAGATGA 

ACTGATTTAGTCCTAATTCCAAGAAGTGTGATTCCACCTACTTGACTAGAAATTTATACC 



• 
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TGGTAATAACTCCTTGTCCTTGMGATTTTCMCTMGGAAMCTGTTTTTCAGCAGGAC 
CTGATTATGCA CTGCTA TCTAGGTAGGGTCACTTATGGTTTTATMTATATTTAATTGGA 
TTATAATATTCC I I I I I I CTTGTCTCTTG G AC AA AATCCT AG CTTTACTGT AATTTAAAA 

AGATGAGTTTAAMTTTCAGGCTTTAAAAACATACCAAACATTGATAAAAATGAAATCTA 
GATAAAAGTATTTTATCAATGTTCAGTTGCCTGGATTCAATAACTGTATTATGGTTAtGT 
AAGGATAATATCTTAGGAAATCACATTATGGTATTAA 
Sequence 1040 

GTCGACCCCGCGTCCGGTAGCTTAGTTGAGTAGATAATCTTTTGTTGTTTCCTCCTTGTA 

ATATACAAGCCTTGGCTTCTGTGACATCATACTCTCCTAGATTTCCCCCTGTCACTGTGG 

CTTCTTCTCAGTCTCTGTCCATCCCTGGTGCTCCTGAAGGTTCTGTTCTCAGCCTTACAC 

ACATTACCTGGGTGATCTCATTCTCTGCCATGACTTCACTTGCCATATATGTGCTGATTT 

TCCCCAMTTCCTATTTCTCCCGACCTTTACATCTATTTTATTTGCAGGTCATATATCTA 

ATAAGGAATTGATATCCAGTGTACATGTAGAACTCCTGTAATTCAATACAGAAACCAAAC 

AGTCCAATTAATAMTGGAG MGA GATTTGMTGMCATTTTTCTAAAGAACATCTCAAG 

CTCMGATTTCCCAGATAACTTTTCTTTCTTCAAAATCTGCTTCTGTGTTTCCTCATCTG 

TAGGTGGCACAGCATACATCTGATTTCCCAAGCCAGAAACCTCATAGTTATTCTTGACTC 

CAGGAAGAAATATTATTGAGTTTTTAAAAACTC 

Sequence 1041 

CGACCCCGCGTCCGTGCTGMCTGAGCTCAGGTGTGTTTTTCTTCCAAGCTTTCTAGCAA 

GGTTTCTA CTTAA MTCACCTGTGTGCAAGCCCAMGGACATTTCATCTATTCTAAGCAG 

AAAGGCTGTTTTGTTCATTACAGTGAGTGCTGTTCATCTCATGGAGTGGGAGGAGCACTA 

AACCAGGAGACAGAGGACATGGATTTGGTTTCCAGCTTAACCAGTTAGGACTCTGTCCTC 

TGCATTCTGGAACCATGAT GCCTG CCTGCCTGCCTCACAGGGCTGTTGTGAGGACCAGAT 

GAGATGATGTATGTTCATACTTTTGGMTCTCTMTTTAMGTCTTMTATTTTGTCTTC 

TGAGTGTGAGGGGATAAACCTGGATGTAGACTATTAAGCAGCATAGGAGAAAAGAACAAT 

AGAATCTAATGGACTGGGTTTGCAATCTCTCTCTAAATGCACTGCTTCAGACAAAGTGAA 

ATCCAAAGGTGTGAAAAAGTATAGCTGCAAATTGGAAAAATGTGTTTCAAGAGT 

Sequence 1042 

AGTCGACCACGCGTCCGCTCTGACCGCCGCCGGCTTTCACCACCTGCCCGGCTCATCACC 

TCTGACTCCTGCGGGCCTTCCAGCCGGCGCTATCTCGCAGCCCCCGACCAGCTCGGCCTG 

CTGTCGGGCATCTCTGCTGTCCTCCGGCTGTAGGGCTACCTGCTCCCTCCCGGGGGGTGC 

ACTGCGAGTCCGACTCTAGTGGGGGCAGCGCCTAGCGGACTCCCAAACCCGCTCAGGTCC 

TGCCGGCCTCACCGGCACCGCCCACCCTCCAGCCGGCCGCGGCTCTGCGCTTGGGCGCCG 

GCAGCACACAAGTGTTAGGGGCGCGCCCTCTGCTCTGGACATGCGCGCCGTCGCGAGCGT 

CTCTGGGACCGGAAGTGCGGGCGAGCGCGGNTCCCCGGGTCTGACAGGAGCAAGCTGTGG 

GCACCGNGGCGGTAGTTGGAGGCGGNAGAGGGTNCGTAGCCGNGCCGNCTGCCCGNCATG 
GGCCTNC 

Sequence 1043 

AGTCGACCCCGCGTCCGCAGGGGCGTGTTGGCCCCGCACAGATTGAGCCGAGTTGTCGCC 

CCGCTGGGAGMGTGACCCTCCTGCGCCTGAAAAGAMTTTCATTTCAATATAGGTGACT 

ATGCAGCCTGCAATTCAAGTATGGTTTGGAGAAGATCTGCCTCTAAGTCCTCGGAGTCCT 

CTGACTCCCAGACACGGACCAGGATTGGCTAATGTTTGTCAGTACGATGAGTGGATAGCT 

GTGAGGCATGAAGCCACTTTGTTGCCCATGCAAGAAGATCTGTCAATCTGGTTATCTGGT 

TTATTAGGTATTAMGTTAAGGCAGAAAAATTATTGGAAGAACTTGATAATGGAGTACTA 

TTATG TCAAC TGATTGATGTTCTTCAAAACATGGTGAAAACATGCAACTCTGAAGAATCA 

GGGMTTTTCCMTGAGMMGTGCCCTGTMGAMGATGCTGCATCAGGTTCATTCTTT 

GCTCGGGACAATACCCGCAAACTTCCTTCACTGGTGTAGGGACATTGGGGTTGATGAAAC 
TTA 

Sequence 1044 

ACGCGTCCGCCCACGCGTCCGGAGTCCTCCTCCCCGGGTGCCTGCCCGCAGCCCGCTCGG 
CCCAGAGGGTGGGCGCGGGGCTGCCTCTACCGGCTGGCGGCTGTAACTCAGCGACCTTGG 
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CCCGAAGGCTCTAGCAAGGACCCACCGACCCCAGCCGCGGCGGCCGGACTTTGCCCGGTG 

TGTGGGGCGGAGCGGACTGCGTGTCCGCGGACGGGCAGCGAAGATGTTAGCCTTCGCTGC 

AGGACCGTGGTGAAGCCTCTGGGCTTCCTGAAGCCCTTCTCCTTGATGAAGGCTTCCAGC 

CCGCTTCAAGGCACACCAGGATGCACTTGCCACGGNTTGCCGTGCCCCCTNTTCAGCAGT 

CCCT 

Sequence 1045 

GTCCGCAGAATTGACNAATTCAGGAGGTGTAAAAATAAACAGTGTTCTCTTCTCTACCCC 

AAAGCCACTACTGACCAAGGTCTCTTCAGTGCACTCGCTCCCTCTCTGGCTAAGGCATGC 

ATTAGCCACTACACAAGTCATTAGTGAAAGTGGTCTTTTATGCCTCCCAGCAGACAGACA 

TCAAGGATGAGTTAACCAGGAGACTACTCCTGTNACTGTGGAGCTCTGGAAGGCTTGGTG 

GGAGTGMTTTGCCCACACCTTACMTTGTGGCAGGATCCAGMGAGCCTGTCTTTTTAT 

ATCCATTCCTTGATGTCATTGGCCTTTNCCACCGATTTCATTACGGTGCCACGCANTCAT 

G 

Sequence 1046 

ACCACGCGTCCGCCCACGCGTCCGGGCGGGGGCATGGACTACTGACCCATGCGGGGCAGC 

GTCCCTGTGACCTGGCCGATGAGGAAGTACTGAGCCTGTTGGAGGAACTGGCCCGGAAAC 

AGGAGGACCTTCGGAACCAAAAAGAAGCTTCCCAGAGCCGGGGCCAGAGCCCCAAGCGCC 

CTCTAGCAGCAAACACAGAAGGAGCTCTGTGTGTCGTCTGAGCAGTCGCGAGAAGATTTC 

CCTCCAGGACTTGTCCAAGGAGCGCCGGCCTGGTGGGGCTGGGGGGCCCCCCATCCAGGA 

CGAGGATGAGGGGGAAGAAGGTCCCACCGAACCACCCCGTGCAGAACCCAGAACCCTCAA 

TGGCGTCTCCTCCCCGCCGCACCCCAGCCCTAAGAGTCCCGTGCAGCTTGAAGAGGCCCC 

CTTCTCCAGGCGCTTTGGCCTCCTGAAGACAGGGAGTTCTGGTGCCCTGGGTCCCCCTGA 

AAGGCGGACAGCGGAGGGAGCCCCTGGGGCTGGGCTTGCAACGCTCGGCTTTCTTCTCCT 

GGCTGGAAGGGACCTTCACTTANGCCAAGGAGCTTCGTNTTGGCAGAATTACCCCGACCC 

CCTTCCCGMGCTTGCCGGAGCCCTTNTGTCCTTGTCTGAGGGTCACCAAGCCTTCTTCC 

TTTGCTTGGGAGAACTTCCTTGNCTTNCCTTCAGGAATTCCNGAGCCTGGATTCCCAGCG 

AANCCNAACGTTCCCACAGGCTTTCACGGGGC 

Sequence 1047 

TGTCGACCCACGCGTCCGCTCCCCGCCGAGGCCTCCTGCACCACCTAGAGCCCCACCCCC 

GACCCCACCCCGGGAGGGCAGAGCCAGAAGAAGGCTCATTAGACCTGGGGGACCCAAAGG 

GTCTGGCCTCTTTGGGCAGCCCCAGAGATGAGGGGTCAGCAGAGGAGAGCTCTGGGGTTG 

GGGATGGGTTAGGGACGCAAGCTTGAGTTCTAGCCCTTGCTCTCATTCAGCTGTTGTGTG 

ACCCTGGGTMGACCCTTCCTTGTTTGACCCTCAGCTTTCCCATCTGTTTAATGGTGGCT 

TTGGCCAAGGCAATCCACAAACGTCAAAATTCCCCTTCCCATCAGTACACACACCGATGC 

ACACACACTCTCTCTTTCTCTCTCTCTCTCTCTCTCTCACACACACACACACACACACAC 

ACACACACACACACTAGTTAGTGCCTTGGATGAGGCGGGGCAGTGTGTATATGGACCCCT 

GGACTTGCTACCTTCAGGGTTCCATACTCGTCCCTCCCCTCCTGGCTCTGCTGTCTGGAG 

TCTGGCAAGCGGG 

Sequence 1048 

CGCGTCCGCCACGCGTCCGCCGNCCGCCGCCTGCCTGGGCCGG6CCCGAGGATGCGGCGC 

AGCGCCTCGGCGGCCAGGCTTGCTCCCCTCCGGCACGCCTGCTAACTTCCCCCGGTACGT 

CCCCGTTCGCCCGCCGGGCCGCCCCGTCTCQCCGCGCCCTACGGTCGGGTCCTCCAGGAG 

CGCCAGGCGCTGNCGCCGTGTGCCNTCCGCCGNTCGCCCGCGCGCCCGCGCTNCCCGCCT 

GCGCCCAGCGCCCCGCGCCCGCGCCCAGTCCTCGGGCGGTCATGCTGCCCCTCTGCCTCG 

TGGCCGACCTGCTGNTGGCCGCCGGGCCCGGGCCGAGCCTGGG 

Sequence 1049 

NCCACGCGTCCGCAAGGCGCTACGfTTATTGCCTCGTCTTATTCACTGACCTTTGTAATG 

ATACACAGTGMTTCTTTTTGACAAAGAGAAATGCAGTGTAGTATGCAGAGCTGCTGTTT 

TAATGC CTATGCA TTTACTCTTTCCTGATTTAGGCAGAGGTGGCATTTTCTTTATTGCAT 

TTCTCTAT1TTTTTMTGTACCCTACCTTCAGTATTCTCTTTGTAAGTTGGTGACTT 

TCTGTGGCCTTGMTATTTTATTATCACATGTGGCATAACAGTATCCACACTTTTTAGTT 
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CTTTAI I I I I I ! I I I I I I ATTTTGAGCAATTCTCCTGCCTCAGCCTCCCAAATAGCTGGG 

ATTACAGGGTGCATGCCACCACACCCAGCTAATT 

Sequence 1050 

CGCCCCGCGTCCGGGATGGACMGACAAATCTCTTGTAAAAAAMTTACMGTMTTTTT 

ATAGAAGCTCTGCCCTGAGGGAGGGGGAGCGTGACTTCTCACTCCTTCAGTGTGGGCTGC 

ACAAGTGACTTCCTTCCATATGGGATCGTTATAACAAAAGACTGTAACAAGGGCTATGGG 

AGTTATAAGACAGGAATTGTGGACAAAAACCAGTGTATATCATAACATCACACCTTGTAA 

TGTTGGCAGTACAGTCACTGACCTTTGATAAATGTTGATGACATGTTGAGTAAAGGAATG 

AGAGAAAGAGGATTGTTTATCTCTGTTTTTATCCTTCTCAGAGAAACTTAGAGTAACAAG 

GTGTGTTATCAGCCATGCTGATGCCTTTGGTAAACTATTGTGTGANATNGGGTGGTTTGA 

ATTGGTCAAGTAGAA ACTG GGGCTGCCAGGCGCAGCCGGTAAGCATTTNCATGTGAGCCT 

TAGGGAANMGTGCATTTTGGTAGGAGCCATCAAMATAGCTTCTTGATATTTCATTAAA 

AG 

Sequence 1051 

GACCACGCGTCCGGGGCTCCTGGGTGTGCCCGCGGCCTCTGGCGCGCAGCGACTCAGAGA 

ACGTCTACGAGGTCATCCAGGACTTGCACGTCCCGCCGCGGGAGGAGAGCGCAGAGCAGG 

TGGACGACCCACCGGAGCCCGTGTACGCGAACATAGAGAGGCAGCCCCGGGCCACTTCAC 

CGGGCGCCGCTGCAGCCCCCCTTCCCAGCCCGGTGTGGGAGACGCACACGGACGCGGGCA 

CCGGGCGCCCCTACTACTACAACCCAGACACGGGAGTTACCACCTGGGAGTCGCCCTTTG 

AGGCTGCCGAGGGTGCCGTCAGCCCAGCCACCTCCCCTGCCTCGGTGGACAGCCACGTGA 

GCCTTGAGACCGAGTGGGGCCAGTACTGGGATGAGGAGAGCCGCAGGGTGTTCTTCTACA 

ACCCGCTGACGGGCGAGACGGCCTGGGAGGACGAGGCCGAGAACGAGCCCGAGGAGGAGT 

TGGAGATGCAGCCGGGCCTGAAGCCCTGGCAGCCCAGGGGGACCCGNGGNCCC 

Sequence 1052 

CGGCTTTGCCGCMCATGCTCMTTCCCGATCATCGCTCAMGTGCTAMTTTTCAGGAG 

TGAAAAGAAAAAGAGGAAGGAAGAAACCCCTCTCAGGCAATCATGTACAGCCACCCGAAA 

CAATGAAATGTAATACATTCATAAGACAAGTGAAAGAAGAGCATGGCAGACACACAGATG 

CMCTGTGAAAGTTCCTTTTCTTAAGAAATGCAAGGAAGCAGGACTTCTTAATTACTTAC 

TTGAAGAAATATTAGACAAAGTTCATTCAATTCCAGAAAAACTCATGGATGAGACTACTT 

CAGMTCAG ACTAT GMGAMTCGGGAGTGCACTTTTTGACTGTAGATTGTTCGAAGACA 

CATTTGTAAATTTTCAAGCAGCAATAGAGAAAAAAATTCATGCATCTCAACAAAGGTGGC 

AGCAGTTGAAGGMGAGATTGAGCTACTTCAGGACTTAAMCAMCCTTGTGCTCTTTrC 

AAGAAAATAGGAGATCTTATGTCAAGTTCTACATCAATATCATCCCTGTCTTATTAGGGA 

TTACCGTTTCCTAAGCCAAGAGTCATGTCAAATTGCAATCAGGC 

Sequence 1053 

GACCCCGCGTCCGGGAGGTTGAACGTTCAGGCTAAGACCGTTACTGAATTGGTTACTAAG 

AAGAAGCCAAAGGCTGAAGGCTATGCTGAGGGTGACCTCACTCTCTATCACCGTACCTCA 

GTCACTGACTTCCTCCGAGCTGCCAACCCTGTTGACTTCCTCTCCAAGGCCAGCGAAATC 

ATGGTAGATGATGAAGAGTTGGCACAGCATCCAGCTACCACTGAGGACATACGGGTGTGC 

TGTCAGGACATCAGAGTGTTGGGGCGCAAGGAGCTCAGGTCGCTACTAAACTGGAGAACA 

AAACTTCGGCGATATGTGGCCAAGAAGCTGAAAGAACAAGCAAAGGCACTGGACATCAGC 

CTCAGCTCTGGAGAGGAAGATGAAGGTGATGAGGAGGACTCAACAGCTGGAACCACAAAG 

CAGCCCTCTAAGGAGGAGGAGGAAGAGGAGGAGGAGGAACAACTGAACCAGACCTTGGCA 

GAAATGAAGGCCCAGGANGTGGCGGAATTGAAGAGGA 

Sequence 1054 

GTCGACCCCGCGTCCGCAAGGACCATGTTGTACCACAGCCTCTGCTGAGCTGAGGGACAC 

ATGTCCTTGGTGAAGACCTGCACCCCTGGAACCTCCCACCATCATCACAACTGNAGTCTC 

ATTTGCAGTGGAGAAAAGAACCCGACGTCCCACAGCCAGATATACACCCAGCTCCATGCC 

AGCCCTTCATGTTTACCTTTTGCTTTGTTAATTACATGTCAGACTCCTAGAGGGCCTCCA 

GACTAATAGGAAGCATTTCTGTAACCAACCTGCCACCCACTGATTCAGAAATGGAAATCA 

CATTCCACAATCTATGGCTTCCACCAGCTAGCCCAGGAAATACTTGAAATCAGCATTCCT 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

171/467 



NAAAAAAAAAAAAAAAAAAAANAAAAANNNANAAAAANNAANAAANAAAANNAAAAANGG 

GGNnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGGTNCAAAT 

AAAG C AAT AG C NTCC AAATTTCC AAAT AAAG C A I I I I I I I I CACTGNATTCTAGTTGGGG 

GGTTGG 

Sequence 1055 

CCACGCGTCCGCTGTTCCCGACAGCATGGATTAGCTTCCGTGTTCTGMGTTGTTCTTTT 

CAT GGTG TCTGACACCGAGGGCCGTTGTTCGTCCATCAGGCGGGATTGGATGGAGTCTTG 

GTGTTTTGCCTTCTCAGGGACCAAAMTGTATCATTGACTCCTTAACAGTGACCTTCCTC 

CCAAGGACATATCCGTGTTCATTTTTCATAGGTTTTACTCATATTCATAGGTAGATTCTG 

TTMTGTGAGTTGGAAAGAAAAGACCMTTTGTACACCAGTCACACCACAAGACAGTTTA 

TCATATAAAATACCTCAATTTTTTGTATTCCTCATTTCCACCTCACAATTGTACTGGTGA 

TGAATTTTAAGGGTCTG 

Sequence 1056 

TCGCCNCGCGTCCGGGCAAGGAAGGCTCTGTTMTTTATAGCTGTTTAGAGGGGAAAGCA 

GTGCAGACCACTTATTAAGCCCGCTGAGGACTAGCTTTCTGTCTTTCATACATTTGGGAA 

AGATAGGAATGACTGTTTCAAAGAAGAGAGGTGCACATAATTTATGCAGGCAAGTATGAT 

ACTTATTTCATTGGTTTTGTGAGTCATATATATACATATATATATATACACATATATATA 

TGACTTGAAMTTAAGATTTMTACTTTMTGTTTTMGTGTGGGGGGTTTGGGAAGGAA 

GGMTGTAATATNTGGGATTTAGCCTTAGGCTTTAAGTTTTAGGCTGGCAAAAGAAATGT 

TATTCAGTGGGTTTGAGGTTTGGACTACTTTCTTCAMCTTAGAGAATTATAACAGGATG 

GTGTTTACCTTTGTTCACCTGGGATGTCCCCAAGACTCTAGCTTCTTCTATCAAGTGGTT 

GGGTCTGATAGAAGAGGTAAAAATTGCTCTTGAAAATNNTCTACAATTATGCAGGTTCTT 

TGATAAAATTTTCTGGTTAAATCC 

Sequence 1057 

CCNCGCGTCCGCCNCGCGTCCGCCCAGGCCTGGGCCGGGCCGTGACGGCGGCGCTAGGAC 
CCGGCGGGCCGCGGGTGCGGCGAGGCCTGGGCGGCCTGAGGAGCGCGGACCCCGGCGCTC 
GGCTCCCGGCGCCATGTGAGGGGGCTCGGGGGCCGCGGGGGGCCGGGCGCTCCCCGCCGG 
1 AGGTGTGAACCCACATCCCTGCCCCCAGGGCCACCTGCAGGACGCCGACACCTACCCCTC 
AGCAGACGCCGGAGAGAAATGAGTAGCAACAAAGAGCAGCGGTCAGCANTGTTCGTGATC 
CTNTTTGCCCTCATCACCATCCTCATCCTNTACAGCTCCAACAGTGCCAATGAGGTNTTN 
CATTACGGNTNCCTGCGGGGCCGTANCCGCCGNCCTGTCAACCTCAAGAAGTGGAGCAT 
Sequence 1058 

TGGGGCCGGGAGGAAGTCTAACCTTTGGGAGACTCCAAGACCGCAGCTCCGAGGTCGGCG 

GGGGTCTGGGTGGCCATGGAGGAGCCCCCTGTGCGAGAAGAGGAAGAGGAGGAGGGAGAG 

GAGGACGAGGAGAGGGACTAGGTTGGGCCCGAGGGGGCGCTTGGCAAAGAAGCCCCTTCC 

AGCTGACCGCCGAGGACNGTGTATNACATCTNCTACCTG 

Sequence 1059 

TGGCGATCGCTGAGAGGCACGGAGGGCCGAGGCGGNCCTGGGAGGCGGCCCGGAGGTGGG 

GCGCCGCTGGGGCCGGCCCGCACGGGCTTCATNTGAGGGCGCACGGNCCGCGACCGAGCG 

TGCGGACTGGCCTCCCAAGCGTGGGGCNGACAAGCTGNCGGAGCTGCAATGGGCCGCGGC 

TGGGGATTCTTGNTTGGCCTCCTGGGCGCCGTGTGGCTGCTCAGCTCGGGCCACGGAGAG 

GAGCAGCCCCCGGAGACAGNGGCACANAGGTGCTTCTGCCAGGTTAGTGGTTACTTGGAT 

GATTGTACCTGTGATGTTTGAMCCATTGATAGATTTMTMCTACAGGCTTTTTCCCAA 

GACTACMAMCTTCTTGAAAGTGACTACTTTAGGTATTACAAGGTAAACCTGAAGAGGC 

CCGTGTCCTTTTTGGAATGACATCAGCCAGTGTGGAAGA 

Sequence 1060 

CCACGCGTCCGGGGTGTTGGTGCGGCGCTGTTGGGGTCCTCCGCTGGCTCATGGCGCCAG 
GCGTGGGAGGCCGAGTCCCCAGTGGCGAGCACTGGCCCGACTCGGCTGGGAGGACTGCCG 
GGACTCCAGAGTCCGCGAGGCGCTGCGGGCGCTGCACGCCGCCAGGGAAAACAAAGAAGA 
AGAGTTAATCGACAAACTGGAGGTGGTCACAATGCCTTCCCCATCACCAAAAGGACTGCC 
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• AGTGAAGCAATATGCTGTGCAGTCTCAGCTTCCCGTATATGAGTGGCCGGATGTGGGATC 

TGGAGAATATGATGTTGGAGTAGTGGCTTCGTTTGGCCGACTTTTGAATGAGGCTCTTAT 

TCTTAAATTTCCCTATGGCATATTGAATGTTTCATCCCAGTTGCCTCCCGAGATGGCGTG 

GCCCAGCCCCTGTAATCCATACAGNTGCTTCACGGAGACACAGTTACNTGGAGTAACAAT 
TTATGCAAATTAGACCTAA 

Sequence 1061 

GCCGGTTCTTAGGGAGGCAGGTGCTGGCCTGGCCTGGATCTTCCCCATGTTCCTGTTGCT 

GCCTTTTGATACGCCTGATTGTCAACCTTCTGGGCATCTCCCTGACTGTCCTCTTCACCC 

TCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGGAGTCTCCTTTGGTATCCGCAAAC 

TCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTACCTTGAGAATGGAGCGAGGAG 

CCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAACGGAATCATTGCAAAGGATC 

CCACTTCACTAGAAGAAGAGATCAAAGAGATTCGTCGAAGTGGTAGTAGTAAGGCTCTGG 

ACAACACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCCGGAAAGGAATGGAGA 

CCATTATGGATGATGAGGTGACAAAGAGATTCTCAGCAGAAGAACTGGAGTCCTGGAACC 
TGCTGAGCAGAACCAATT 

Sequence 1062 

CCNCGCGTCCGCTTTGAATNCTTATCTTTGATTTMTTTACACGCCAGCATTTTGCCACG 

TTCTAAATAATATTTAGCTCAACTGATTCATACGTATTAATGACCATTCTAGCAAAGGCC 

TACAAGTGGTGTGGGAATCAGGGAAAGGCTGCCTCTTTGGTATCTCAACTGGTATTGATT 

ATTGCTATCMCTATTTGGGGAGAAAAAATCAAAATGAAGCCCTGTCAAATTTTAGAAGT 

ACTATCTTTGGTCCTTCAAACACTTTGTGATGACACCTTAAGAAAAATAAAGTTGAAGTT 

CAGGTCTTGCCATTGCCATTACAGACAAATTAGGAGACTTGGTTTACCTGGGAACAAATT 

T ACTTG AATATTCAGTACCTGAAACTATGCCAAACCAAAGAGCAGCTGCAGTACATTCGT 

TATTTTAAATGMCAAGGTTTACAMGNTTATTTTCATCTATACCGTAAGGNTGGATTTT 
TTTTNAA 

Sequence 1063 

GTCACCACGCGTCCGCCNCGCGTCCGGCGTGCATGGAGGAACGCTGGGCACGGGCCCGGC 

GCGGGTCGGGGGGCGCCCGAGGGGCCCGGGCCGAGCGGCGGCGCGCAGGGCGGCAGCATC 

CACTCGGGCCGCATCGCCGCGGTGCACAACGTGCCGCTGAGCGTGCTCATCCGGCCGCTG 

CCGTCCGTGTTGGACCCCGCCAAGGTGCAGAGCCTCGTGGACACGATCCGGGAGGACCCA 

NACAGCGTGCCCCCCATCGATGTCCTCTGGATCAAAGGGGCCCAGGGAGGTGACTACTTC 

TACTCCT7TGGGGGCTTGCCACCGTTACGCGGNCTTACCANAACTGCAGGCGAGAAGACC 
ATTCCCCGCCAAA 

Sequence 1064 

GTGCCACGCGTCCGCCACGCGTCCGCCTGCCCCTCGCCGCCCGCCGCCTGCCTGGGCCGG 

GCCGAGGATGCGGCGCAGCAGCCTCGGCGGCCAGGCTTGCTCCCCTCCGGCACGCCTGCT 

AACTTCCCCCGCTACGTCCCCGTTCGCCCGCCGGGCCGCCCCGTCTCCCCGCGCCCTCCG 

GGTCGGGTCCTCCAGGAGCGCCAGGCGCTGCCGCCGTGTGCCCTCCGCCGCTCGCCCGCG 

CGCCCGCGCTCCCCGCCTGCGCCCAGCGCCCCGCGCCCGCGCCCAGTCCTCGGGCGGTCA 

TGCTGCCCCTCTGCCTCGTGGCCGCCCTGCTGCTGGCCGCCGGGCCCGGGCCGAGCCTGG 

GCGACNAAGCCATCCACTGCCCGCCCTGCTCCGAAGAGAAGCTGGCGCGCTGCCGCCCCC 

CCGTGGGCTGCAAGGAGCTGGTGCGAGAGCCGGGCTGCGGCTGTTGCGCCACTTGCGCCC 
TGGGCTTGGGGA 

Sequence 1065 

CGCGTCCGAACGGCATCATCACGCCCGCCACCATCCCCAGCCTGGGCCCCTGGGGAGTCC 

TGCACTCAAACCCTATGGACTACGCCTGGGGGGCCAACGGCCTGGATGCCATCATCACAC 

AGCTCCTCAATCAGTTTGAAAACACAGGCCCCCCACCGGCAGATAAAGAGAAAATCCAGG 

CCCTCCCCACCGTCCCCGTCACTGAGGAGCACGTAGGCTCCGGGCTCGAGTGCCCTGTGT 

GCAAGGACGACTACGCGCTGGGTGAGCGTGTGCGGCAGCTGCCCTGCAACCACCTGTTCC 

ACGACGGCTGCATCGTGCCCTGGCTGGAGCAGCACGACAGCTGCCCCGTCTGCCGAAAAA 

GCCTNACGGGACAAGAACACGGCCACGAACCCCCCTGGCCTCACTGGGGTGAGCTTCTTC 
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' TTCTTGTCGTCATCGTCCTTCTTCAAGCTTGGCCAGCAACGAGAACGCCACAAGGAAACT 
NGTGAGCCCACGTTNGGCCGTCGGGAAAACACGGGGN 
Sequence 1066 

CGCGTCCGGCCTCCAGCACATCCTGCCTGCAGAGGGTCTGGCTAGCTGCCTTTTCAGCTC 

TCGAGGGATAGAGATTCTACAACCTCCCTCTGTCATCAGTTCCAGAGCCACTCCCCTTTG 

CACTAGAAGTTCTTGCTTTCAAAGAATGAGGGTGTGAGGGAGGGAGGGGTCAAGAAACAG 

AGTGACAGGGGAAACAGGCAGAACAAGTCAGGGCAAAGGACCCCAGCATGAATAGTTGTG 

GAGGTGGAGGTGGGGAAGCAGCCTCACATCTCACACTTCCTTCCTTCTCTTAAATGTGAG 

CAGCTGACTCCAAGCCTTGTGGAAACTCTAGAAGGTAGAACCAGCCATCTGGGGAAGCTG 

GCCTTACAGATGCCCCGTCTGGCATAGTGGGAGGTTCCTGTGCTCTGAGAACCCCAGTGT 

GMTCTAGACATTCCACTGCAGCCTGGGMGAAGCCTGTGTTTTTCTTTAAAAAGTCT 

Sequence 1067 

GCGTCCGGTTCTTAGGGAGGCAGGTGCTGGCCTGGCCTGGATCTTCCACCATGTTCCTGT 

TGCTGCCTTTTG ATAG CCTGATTGTCAACCTTCTGGGCATCTCCCTGACTGTCCTCTTCA 

CCCTCCTTCTCGTTTTCATCATAGTGCCAGCCATTTTTGGAGTCTCCTTTGGTATCCGCA 

AACTCTACATGAAAAGTCTGTTAAAAATCTTTGCGTGGGCTACCTTGAGAATGGAGCGAG 

GAGCCAAGGAGAAGAACCACCAGCTTTACAAGCCCTACACCAACGGAATCATTGCAAAGG 

ATCCCACTTCACTAGAAGANGAGATCAAAGAGATTNGTCGAAGGGGNCNNAGTAAGGCTC 

TGGACMCACTCCAGAGTTCGAGCTCTCTGACATTTTCTACTTTTGCCGGAAAGGAATGG 

A 

Sequence 1068 

TCGACCCCGCGTCCGGCTGGTTTTCCGTCTGGTGAGGGGTTACTTCCGGGTCGGACGGCG 

CTAGCTGCAGCATCGGAGTGTGGCAGTGCTGGGCTGGCCGGCGGGCTGGGCTGCGGCCCG 

CGCGCGGCCGGCGATGCANGGGGGCAACTCCGGGGTCCGCAAGCGCGAAGAGGAGGGCGA 

CCGGGGCTGGGGCTGTGGCTGCGCCGCCGGCCATCGACTTTCCCGCCGAGGGCCCGGACC 

CCGAATATGACGA ATCTG ATGTTCCAGCAGAAATCCAGGTGTTAAAAGAACCCCTACAAC 

MGCCAACCTTCCCTTTTGCMGTTTGCAMCCMCTCTTGCTGGGTTTCTTTGCTGGAA 

GCACNTTGAGCCCACTGTGCATGAACCA 

Sequence 1069 

CCGTCCGGGAGGTTGAAGTTCAGGCTAAGACCGTTACTGAATTGGTTACTAAGAAGAAGC 

CAAAGGCTGAAGGCTATGCTGAGGGTGACCTCACTCTCTATCACCGTACCTCAGTCACTG 

ACTTCCTCCGAGCTGCCAACCCTGTTGACTTCCTCTCCAAGGCCAGCGAAATCATGGTAG 

ATGATGAAGAGTTGGCACAGCATCCAGCTACCACTGAGGACATACGGGTGTGCTGTCAGG 

ACATCAGAGTGTTGGGGCGCAAGGAGCTCAGGTCGCTACTAAACTGGAGAACAAAACTTC 

GGCGATATGTGGCCAAGAAGCTGAAAGAACAAGCAAAGGCACTGGACATCAGCCTCAGCT 

CTGGAGAGGAAGATGAAGGTGATGAGGAGGACTCAACAGCTGGAACCACAAAGCAGCCCT 

CTAAGGAGGAGGAGGGAAGAGGAGGAGGGAGGAACAACTGAACCAGACCTTGGCAGAAAT 

GAAGGCCCAGGAGGTGGCGGAATTGAAGAGGAAAGAAAAAGAAG 

Sequence 1070 

GCGTCCGGTGCTGGAGGAMATGTTTCTGGGGAAGATGACTCAGTCATTTTGTGGCGAGA 

CACCCTTTGGTAACTCCCACTGACCAGTCTTGGGAGCCTTCCTGGAATGATCGTGGGCTG 

AGCGGAGATGTTTTTTGCAAAATGAAACTGAAGCTGAAAGAAAGGAGAATTCGAGTGAAC 

CAAGAGAAATCCAAAGACCTGGGGAAGGAGGACTTAAGATGAAAGTGAAGCAAGAGAGGG 

AAGGGGAAATGAAGTGAAAATGGCGTGAGGGTGTGAGAGAGGTTTGGGTTAGGAAACATG 

TTTTTAGTGCTATTTNCAACCAGGG 

Sequence 1071 

CACGCGTCCGGGACTGATCTCNAGGACCAGCACTCTTCTCCCAGCCCTTAGGGTCCTGCT 

CGGCCAAGGCCTTCCCTGCCATGCGACCTGTCAGTGTCTGGCAGTGGAGCCCCTGGGGGC 

TGCTGCTGTGCCTGCTGTGCAGTTCGTGCTTGGGGTCTCCGTCCCCTTCCACGGGCCCTG 

AGAAGAAGGCCGGGAGCCAGGGGCTTCGGTTCCGGCTGGCTGGCTTCCCAGGAAGCCCTA 

CGAGGGCCGCGTGGAGATACAGCGAGCTGGTGAATGGGGCACCATCTGCGATGATGACTT 
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1 CACGCTGCAGGCTGCCCACATCCTCTGCCGGGAGCTGGGCTTCACAGAGGCCACAGGCTG 
GACCCACAGTGCCAAATATGGCCCTGGAACAGGCCGCATCTGGCTGGACAACTTGAGCTG 
CAGTGGGACCGAGCAGAGTGTGACTGAATGTGCCTCCCGGGGC 
Sequence 1072 

CCCGCGTCCGCGCGACGGCCGCGGCGCGGGACCTTAGGACCCGCGGGCTCCAGGGCTACT 

GTCCGTCCGCCACTGCGCGCCAGCAGGTCCTGGTCTCCGCTCTCCAACAGCTGAAAGGCC 

GGCGCAGTGAACACAGAAACGAAAACCAAGAAATGCCTTATTCCACAAACMAGAGTTGA 

TACTTGGCATCATGGTGGGCACTGCTGGAATCAGCTTGCTGCTCTTGTGGTACCACAAGG 

TCCGTAMCCAGGGATAGCMTGMGTTACCTGMTTTCTTTCTCTGGGTMTACATTTA 

ATTCMTMCTTTGCAAGATGAAATACATGATGACCAAGGAACAACAGTAATCTTTCAAG 

AAAGGCAACTTCAGATACTGGAGAAGTTAAACGAATTCTGACAAATATGGAAGAACTCAA 

AGAGGAAATCAGATTTCTTAAAGAAGCTATTCCAAAGCTGGAGGAATATTTACAAGGATG 

AACTTGGGAGG 

Sequence 1073 

CGCGTCCGCTGAGTTCNAGGATGGTTTTTTCTTGGGACCAGACATGAACAAAAGTTGACC 

TCATGAGCACTTCAACCTCTCCAGCTGCCATGCTCCTCCGGAGGCTGCGGCGACTCTCCT 

GGGGCAGCACTGCTGTCCAGCTCTTCATCCTAACAGTGGTGACAGTTTGGCCTGCTGGCC 

CCCCTGGCCTGTCACCGACTTCTACACTCTTACTTCTATCTGCGCCATTGGCATCTGAAC 

CAAATGAGCCAAGAGTTCCTGCAGCAAAGCTTGAAAGAGGGTGAGGCTGCCCTCCACTAT 

TTTGAGGAGCTTCCCTCTGCCAATGGCTCAGTGCCCATTGTCTGGCAGGCCACCCCCCGG 

CCCTGGCTGGTGATCACCATCATCACTGTGGACAGGCAGCCTG 

Sequence 1074 

CGTCCGTGAAMTCCAAAGATGTATCATTTTTATTTGAATCCATCATGCAGTGTACATTT 

CAGATAATTTCCTTCAGT CTCCAGATA GGAGTGTATCCAAACATCTAATTTTATGTGCAC 

TGTGTATC TTATA TGAATGTTTTATTTTATATACCACATGCAAAAATGNCCATATGCACT 

A TTTAA ATGTTTTAAATMTATATTCCTTCTTTATAATGCTAAATCTATATGAGTACCAT 

ATTTTTATAAGTCAGTGGTCTGACNGGN^^ 

G GTAT TCAANGNGAAAAGGGTGGNTGTGAGGAGAANATGTGAAAGNGGNNTGGGNGNNCT 

CTTTTGCCTTTGGGCCAGGAATTNGGGGGGCNAAAATNNACCCANAACCTGGNNNAAAAN 

TAGGNCCANTTGGGGGNGANAGGTTTCACTTTG 

TTTNTNTTTNCCAAAATANATTNTTTTTGGGA'I I I I I I I I I GTNCCCCCCCCGNATTAAA 
TGGGGANTTGGCTGGNGTCTTGGCNCCTNTCATTTGTGCCAGACCTTTTTTTTATTAAAA 
MGMCCTTGGGAAAGGTCTTAAGTNCATTTGGGAAAAAAAAAAAAAAAAAA 
Sequence 1075 

GAGCCGNCCCACGCGTCCNCGGNCGCGTGGGCTACCTTGGAAGCAGTCATCTCTCAGTCT 

TACATTTGGAGAATGTGGATGGCATGACATCAGMTTCCTTTATATMTTTAACTTCAGA 

ATAGTCTGAGATCATCGAAGCACGATGGTCAAGGGAATTCCGTTTTTGTTTTAGAGCAAA 

TATGTTTGCTGTTTGTCTTTCATCACAMCATCAGTGGAGTTTCAGCACCTTACAGAGCT 

CAGTGAACCCCCTGGTCACCATCAAAGTTAGCACACAACAAAGCCAACCACGTGTCCCCC 

TCACAGATGACAATGGCTGAACTCTGAGTGAAACCACCTGTATGGCCGGGCACAGTGGCT 

CA 

Sequence 1076 

GCCCCGCGTCCGCTTTTTGAGAATCTCTGCTCTGTTCCTAGGTTCAGTGCTGGGTCCTGG 

GAATACAGCAGGACAGACCTCAGCTTATCTCTTCATAGAAATTATACAAAGAGAATTGGG 

GAGACAGCTAAGAAGAAAACMAGAMTAMGCAGTTACAAATTGTGATAMGTGCTTTT 

GAAGGAAAGAAGGGGTCTGAGACAACAACAGGGAAGGGGCCTCTCTTGAAACAGTAGTTG 

GGAAGGAGGCAGACATGCACCAGTGATGTGGTGACAGGTGCTCTGAAGGAGGTCACCAGG 

ACCTGACCTCTTTGAAGGATCAGAAAATACTTCCCTGAAGGACTGACATTTGAGCCTAGA 

CCTGAAGGGTGAGCCATCAAGCTAAGACAATTGGGGAAGAGCATTCCANGGAGAGGGAG 
Sequence 1077 

CGCGTCCGATCTTTGTCTGCTTTCCTATAACTCAGTACTGTAACTCAGTACTCTGAAATA 
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■ GTTTCCT7TGTTAATAGAGTCACTTTTATAGTACTGNGCTTGAGGNNATATACAGAGTAT 
TGTGTCCAAATTTATCATTGCACAAAGTGTrTTTGGAAATTCTTGGTTACTCCTTAGTAA 
ATTACCTGTAATT GGGT TAAATGCTGGTAGGGTTTAAAATCTGATTGCTAAAGTGAATTC 
TCTA TAAA GTGAGTTTTGATACATAGAAACTTTNCATATAATTCTTAAACTCATGTGTCA 
TGTATTTTCATTTATAGTTTTTCATATTCATTAACATATGTTGTTCCTTACCATTTACAG 
CTCANMTTCTGCANATGCAGATTTTTGCAAACTTTGATGCATTTGGACAGTCTAGTGGT 
TCGAGTAATTTTGGAGGTTT 
Sequence 1078 

TNCGGGCGGGTGCGGCGGGCGGGCAGGCGGGCAGGCCGGCAGGCGGGTGCGCGGAGGGCT 

GGTACCCCTNAGCAGGTGGGCGGGGTGCGGTTGGNGGCGGCGGCTGGGCCGGGGGCTGCC 

CGGCTGCGCTCGGGCCGTGCGCGGNGGCCGTGCGGGCACGCCATGGACTTCAACATGAAG 

AAGCTGGCCGTCGGACNGCGGGCATCTNTNTTCACCCGGGCCGGTGCANTTCACGGAGGA 

GAAATTTGGCCAGGCTGAGAAGACTTGAGCTTGATGCCCACTTTGAAAACCCTTCTGGCC 

CGGGCAGACAGCACCAAGAACTGGACAAGAAGAAGATCTTGAGGCAAGACAAGAGGTGCC 

TGCTGCAGCCCCAACCCCAGTGCCCCGAGTGGAGGGAATTANCTGTATGAAGAAGCTTG 
Sequence 1079 

CACGCGTCCGTGCTGAGCGGTCCCGAGGGGAGGGGCCTGAGGCCGAGTTCCAGTCGCTGA 

CCCCTAGCCAGATCAAGTCCATGGAGAAGGGGGAAAAGGTCTTGCCTCCCTGCTACCGGC 

AGGAACCTGCCCCGAAGGACAGGGAGGCCAAGGTGGAAAGGCCCAGCACCCTCCGTCAGG 

AGCAGCGTCCTCTTCCCAACGTGAGCACCGAACGTGAGAGACCCCAGCCTGTCCAGGCCT 

TCAGCAGTGCACTGCACGAGGCTGCCCCCTCCCAGCTCGAGGGGAAGCTGCCATCTCCTG 

ATGTCAGGCAGGACGATGGGGAAGACACCCTGTTCTCGGAACCCAAGTTTGCACAGGTCA 

AGCTCMGTAATGTCGTCTTGAAGACGGGATTTGATTTTCTGGACAATTGGTAAAATGTA 

TTAGAAAAATACAAT GAAAG AACCCTAAAATGTTTTCCAAAATGGTGTGGTGGAGGAGGA 

TAAAAMGGGCCACCTTTTCCTATGTATTTTACTGGTTTCTTGACACTCTTTTCTTAATC 

ATTTGGAAACTGGTCAATACTGNCAGATTTTTT 

Sequence 1080 

GTAAGCCAGGTGCTCCCCCTTCACTTCCTGTGTGCGGAGCACGCTCGCCCTGGGAGTTTC 

ACTAGAAAGAAGGTTGCCATGGGCCAGTGGGACAGCTTGGATCTCAAGTGCACGCGGATG 

CCCCAGAATCCCAGGATCTCAGCTGAGCTGTTTGTGGATTATTAGATCTGACTTAAAAGA 

ATATTATCCAGCAATGCAAATGAACAAACTATAACTACACACAGCTGCATGGATAAATGT 

CAGAAACATGACGTTGAAGTGTGAGAAGCCAGATGCAAACCGAGGACTCACTGTGCAATT 

CTGTGCATGTACAGTGGCCAGGAGAAGGGAGCACTGGCTTTTGCTTTCATCAGGCCAAAG 

ATGCCTTTCTTTGGGAATACGTTCAAGTCCCAAGAAAGACACCTCCTCGGAAGGTCGGCA 

TCTTTCTCCAACCTGCATTCTTTGGATCGATCAACCCGGGAGGTGGAGCTGGGCTTTGAA 

TACCGATCCCCCGACTATGAACCTGGCAGGGCAAAAGCCTGAAAGTTTGAAAAT 

Sequence 1081 

TGCTCTCTCACCTGTTAGCTGTGTAGCATTGGGCAGGTTACTTAACCTGTCTGTGCTTCA 

TCTGTGAAACAGGAATAACAGCATTATTAAGGATTGTTTTAGGATTGGATGGGTTAATAG 

ATGTAAAGTCTTAGAACTATATTGAGCATCCCATCAAGGCATTGTATTATATTGAAACAA 

TGGGGTTThrrTTCTCTTTATNCTTTTTTMACTATATMTGMCACTTTTGATCTTAAGT 
ATTNCTAA 

Sequence 1082 

CCCGCGTCCGGTGAATGTTAGTATTGGGTGTGGGATGCATCAGGGACACAGGTTTGTAAA 

CCATGACAATTCAATTGTAAAACTAAAGCCNAGTGCCCCCTGTAGTCCCAACTGCTGGGG 

AGGGTCACTTGANCCCTAGGGGGGGAAAATGCAGTGGACTGGATTTTGNGCCCTTTGCAC 

TNCAAACTGGGTGACATAGTGAGGCCCTTGTCTTCTACCANAAAAAAAAAANNANNANNN 

NNANGGTGCCGGGCCGCTTAGAACTAGGTCTTAGAAGAAAAAAACCCTCCCACAACCTTC 

C CCCT GAACCCTGGAAAACATNAAAATGGAATGCCAATTNGTTTGGTTGGTTAACCTTTG 

GTTTTATTTGCAAGCTTTATAAATNGGGTTTACCAAAATTAAAGGCCAATTANGCCATTC 

ACCAAAAATTTTCACAAAAATAAAAGGCCA I I I I I I I I I I CACCTGGCATTTCTTAGTTT 
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NNTGGGNTmGGTCCCAAAACCTCATCNAATGGGTANTCTTAATCATGTCCTNGGGATC 
CCCCCCGGGTTACCCGGAGCCTTCGGAMTTAMTTTCCTTCTTTCCGCTTTTCCTTCGC 
TTCACTTGACCTCCGCTTNGNCTCGGGGTCCGTTTCCGGGCTTG 
Sequence 1083 

TCGACCNCGCGTCCGTGGAGGGCCCATCTGCCAGAGCCTGGAGTCTGCGAAGGCCGGGAC 

CCGGTTCCCCGGCCCACAGTGGGGGTGTGCAAACCCGAGAGAACTGGGAAGTGCCGTCAG 

AAGCGATAACTGACGACGTCTAATGTCTATCTGACCGCAGTCGCTGAAACCTCTACAACT 

TAGTTGACCGTMCTGCCAGAGCCCTGCCCTGMTTCCTGTCCTTACTCCCTCTTTAAGA 

TTGCGTACCCACTGCAGAGTGCTGAAGACGGGGTAGCCACCGAGGTTGCAAATTCGTGAA 

GAATCAGCATCAT GTTTG GCAGCTGAGTATTGGAGCCAGGAGCCTGCCATGAGGTTTTGA 

GAACAGAGTGCTGT1TTAGAGCTGGCAGCAGCATCTCAGCCCAAGAGAAGGTTATATTCC 

CAGAGGATGTCAGTCCCAAGGACCAAGTAGCTGCCATCAGTTTGGATTCTGAAAACTAAC 

TGGCATCAAACTGGGTGTAGAAACATG 

Sequence 1084 

CGCGTCCGGACTGTCGCTCTAAAAGAATGAAGGAAATAATAAAGTGATAGACAGGGAAGG 

ATAGAAAAGACTTAACMTATACATATGTTCCGTCTTTGCTGTTTTGGAGAATGATGGAT 

MGTANGTGTTTCCTGATTCTGAAGCATAGCTGAACMTTTMTTGTGGTTTACCATCTT 

TTTGGTTCCCTCTTCAGTAATTAACCTATCGAAAATCTGTCCTAAATGTTTGGACTGGGG 

CACAGTTCCCTCCATCGCTTTGGGAGAAAATCATTAATATGGCATACTGCAGATTGGAGG 

GCAGGACCACTGAGGGTGTCATAGACATTAGCTCTATGGAATTCTGCTAGCAATTTCCAA 

GTGACAGTGAGGAATTATGGATATATGTTTGAGGTCATTCAAGCTTCCTGAGTACCACAT 

TCCCCAGCTACTTAGACACCGGGTTAAAATATTAAAGATGTCCTAGTTCAACAGCTTGAA 

TTCCATTGATTGGAT 

Sequence 1085 

GACCCCGCGTCCGGCTTCCTGGGTTCGAAAGAACCCAGTTCAGGAGTTTCTGTTTTAGTT 

TGAGATCTTATAGGCCTGTCTCATCAGGTTGGTGTCAGCCCAGCTAGGATTAGGCAGAAT 

TGGG TGGGGGCTGTAGTGCATCTTTGGCACAGCATGTACCTGTCTGACTAATTCTCTGTC 

TTTTCTTTCCTGTTGCMTTCATGGGTCTTAGCATCTTCTGMTGGTGTTTAGTAGGTCA 

TCCTGTTGATTTCCTGCTAGGGAGTAGCATACTCTGGCTCTGTACCATTGGCCAAGGGAC 

TTMGGATAGGTGAAGGGCTGCAGTTTTGTTAAATGGAACAATATGAAGAGATGGCATTG 

TAAAAAAACTTNTGNCAACTNAA 

Sequence 1086 

T GTCGC CCCGCGTCCGATCAAATCTTGATGMGGATTGTAGATTTT^ 

GTTTTTAAAACTTATTCCAATTGCTAMTTGGTAGTTTTTCAGTCTTTATA^ 

TTAAAAATATATATACAGTTATATGAAATGTTTA77TTCTATGTGTGTGCATATAGTTCA 

ATATTATGCAATAAATTTGGTGTTTTAACTTAAMCTATTTCTTATTGTACTTGCAGAAT 

GGATAGCTTGCTTTTAGTAGAAGCATTAGGTCGTATACTCAGATAATCTAATAGAAGGTC 

AGATTTGATTCCTGCATAAGAAAGTAGAGCCCAAGTGCTGCAGAAATGGAGAAGAAAGCA 

GGGGCAAGGGAGCAGATGGCATTMGGMGMTGAAGTTTTTGAAGGTTGGGGATGGACG 

AAMGGGTGTTTCTCATGGAGAGGGGATGCTTTAGCAAAGGCTCAAACATTGGGGCATAT 

TTAGGCAAGAGCCAAGAAACAGTTTGAAGGGGAACATCAGAGGAAATAGGCCAAATTAAT 

AGTAAATN 

Sequence 1087 

GGNGTCGACCNCGCGTCCGGAAATACTCAAATAATGCAACATTACTTCCCAGAAATGAAA 

ATACATTGCATCTCTTATTGAAAGAGCCAACAGAGGCAATGAAAAATAAATGTACACATA 

TGTGCACCCTTMTTCTTTATAATGAACCTTGAGAMTCAAGGCTTTAGAAAAATGCTGA 

GAGACAAGAAGCTTCTGAGCAGCAACACTGGATGCAGGAAAAAAAATGGAGTAATAAGTG 

TAAAGTTCTGAGGGGAAAAAAAATATCAGAACCTAGAATTCTAAATTTAACCAAACTGTC 

ATTAMGTAAATGAGAAMGTAMTTCTTTTGAGAAACGCAGAGCCTAGAAAGGATTACT 

AATCGGTAGGATCATCTTTTTTTGTTTGTTTGTTTGGNTTGGGGACAGACTCTCACCTCT 

GTCACCCAGGCCTAGGTTTGCAAGTGAGCCCAGGATTGTGCCACTGCCCTCCAGCCTTGG 
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GGGTGACAGGAGCAAGGACTTCATCTCAAAAAAAAAAAA 
Sequence 1088 

TCCCCCTGCCCCCCCAAGCTGAGGGAATAAACTCTGCCTGTCGGGTGCCCCACCAAGGGA 

AATAAACATGGCCTAGCTGCCAAGTCATGCCTGTAGGGTGCTCTCTACTGGCAGTTTCTG 

GGTGGTGCATACATCTAGTCTCCCTAGAAGAGCACAGTCCAGAATTGAGGAGGTACAGCA 

GAAACAGACTGCAGGCAGAGAGAGGTCTCATAGAGCTTGGACAGGGCTAGGCACAGAAAG 

AACAGGCAGCGTATCCAGAAGGGGGCAGGGAATGGGTGAAGAGGTTGTGCTCTAGGGCAG 

AGCTGAGCTCTGATCTAGAAAGGACAGCAAAGATACCTGGAAGGCCTCCCGATTCTTGCG 

TTGTTGGCGTCGCTCCCGAAGCCGGG 

Sequence 1089 

CGNTCCTTGGGTAAAAGCGTCCCAGAGACGGGAAGAAATATGGTATAAGCGAGAAGGCCT 

CATAAATCTGGGCTGTTAAAAATCTAAGTTAAAAATATGTTTTAAGTCAGAAAAAAAAAA 
AAAAAAAAAAA 

Sequence 1090 

CGCGCCTGGTAAAATTATATAAGCTTAAAAAAACAAAAACAAAAACACTTGCTTTGAAAA 

GAGTCTCTCAGCAGCAATTTTGTCCTTGCCCCTACTTCCACAGTTCCTTTTCTTACCATT 

TCACATCTGGATTACTACATTGGCCTCTTTGCTTAGACTCCCAATATTCATTTGCTTTCC' 

TTCACCCCATTTTATGGAGGACTGTCAGATCAATCTTTTAAAGATAAATTTTATAATGTT 

ACTACTGTTGC CTATT GGATTAGAGCCCTAGGGGTGCTTTTTGTAGTCTCACTGACAGCT 

GACATTAGTGATTTTTCACCCTCTTCTTATTGCTACCCTGTGTTGATGGCCAGTTTCCAG 

GTGGGCACCTGCTCCACTTGCTTTCAT 

Sequence 1091 

GGGGTATGTGTGGTTCTTCCAGGAAAGTGCTGAAAATATCACCCAGGCCTCTGCGCCACG 

CCCTGGGAGAGTACACTCCTGGGCTCACGCCTCTGCATTCCAAGGCTGACAGCTAGAAAT 

ATACTTTGTAAAATACCAACAACTTATTCACAAATATTCCAACTATCTACCAGCTCCAAT 

GAGCTTGCTGAGGATGGGTATGACCCCAGTCTAAGGGGAAAGAATCTAAAACACAAGTAA 

ACCTGTTTAAAGGCCAGATCTCCAGATGGAGATCCAAGCAGATGGCGCCTAAGGTTTGCC 

CTTGAAAACTACCAAGGAAGCCACAGAGAGGGATCTTTGGACCTTCTGGAAAATGGTAAG 

GCCCCAGGTAGATTATGGCTCCTCTGCCCTGGAGGCTGAGCCGCCCTCTGGTTACCTCAC 

ATCTTCTGGTTTCTTCTGAGTGGGACTTGATCTCATTTCTGCATTCACAGCAAGGNGGAA 

CTGTCTGGCAAGAGCTTAAAATTAGGACCTGNTGGTGGGGACCTTTAATAGCAGGTGGAG 

GGTTTGAGATCCCNTGAGATGCCCNAGATTAATTCAATAGGGGGANGAAAGATTTGGCCC 

AATTCAAAAGNGCTTAAAAAAAAATTTT 

Sequence 1092 

CGCGTCCGGTTCTTTGGTTGAGCTTCTTTGTATCAGTAACAAAAGGAAGCATCATTCAGT 

CTTTCmGTMCTTMTGTMGTCTCTTTTGTACATCCTATTACTTCAAATCATTGAAG 

TGAACTCC ATTTT CACATCTGTTGGGAACAATCATCTAGCTTCTTAAATGACTCATCTTA 

AAATATGAATTTTAGACTGCCTAAACATTCTGAGGGAGTACAGTGTGATATAGCAGAAAC 

AACCGGGGCTTAAGAAGGATCAAAATGAAAGGTTTTGTGAAGGATGTGCCAGAGACTGCT 

CTCTGTTCACCAAAACCACATTCTTCTCTTCTTCCTGGCATAGAGCCAGACTAGATTTCC 

AAGCTTTCCTTGCAATTGAGCTCTCAGAGTTCTCGTCATTAGAATGTGAATGATAGGCCG 

GGCGCAGTGGCTCATGCCTGAGATCCCAACACTTTNAGAGGCCAAGGTGGGTGGATCACT 

TGAGGTCCAGAG7TCNAGACCAGTCTGGCCAACGTGGTGAA 

Sequence 1093 

CGCCCC GCGTCC GATAAAACTGGATTTGATTTCTTTTTATGAAANGTTTCATATGAATGT 

AACTTGA i i i i i lACTATTATAATCTAGATAATATGATATAAGAGGGCTAAGAATTTTTA 

MTTGAATCATATATATGATATAATTTGATCCTTCTTGTATCTTGAAGTTTTGTACTTGG 

GATTTCTGGACTGATAAATGAATCATCACATTCTTCTGGTAAATATTTTCTTGGAGCTCT 

GTGTCAACTTTGATCCTTTGTCTCCCAGGAAGGTGTGACCTCTCCTTTGCCTGCATACCT 

CAAGGCCAGGGGAATATGCCTCAGTGATGCATTTATCTTTGTATATCAGGCCGCATGATT 

CCCMCTTTCTGCCACACTTAMTTACGTTCCTCCATTTCAGTTTTGTCTTTTCTGTCTA 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

178/467 

AAGT TCAGTCAAAGAGTATCAAAAAATTATGTTTCAGCTAGACTGGTGTAATGTATAAGT 

TTTTGTATCTTGTATTAGAGGATTTCGTAGCTTTTATTAGAGG 

Sequence 1094 

GCCCCGCGTCCGATCCCTAGATGACATAACAGCCTTACAAAAGGACAGGGAGGAGTGTCT 

GTTCCTACTCTCACATAGCGGAGGAAAGTTAGAGCCTCTCAGTCTCTGTTTATGAGGACT 

CATTMTCTCAAATMTTGATGCATTTTTCATACATTAGGGTCTCTGTCCATGTGTCTTC 

CTGATATTGTTATAGAAATGGCTTCAGGCTGCTGGTAACAGATGCTGCGGAAAAAGAATG 

CCTTAAACAAAGCCAGGCGCGGTGACTCACGCCTGTGATCCCAGCACTTTGGGAGGCTGG 

GGTGGGAAGGATCACTTGAGCCTAGGAGTTAGACACCTACCCAGCCTGGGCAACACGGTG 

AGACCTCGTCTCTACAAGAAACAAATAATTGGCTAGATGTCGTGGCGCACAAGCTTGTGG 

TCTCGGCTACTTAGGGGGCTGAGGCGGGAGGATTATTTGAGCCTGGGGAGGTCAAAACTG 

CGGTGGGCTGGGATTGCGCTACTGCACTCCGGGCTGGGAGACCGAGTANGACCCTGCCTT 

AAAAAAAAAAAAAAAAAAAA 

Sequence 1095 

AGTCGCCCCGCGTCCGCTCATACCCAGTGAATCTTCAACAGAATCTCTTAAAGATCTCCA 

GGMGTATMGCTCTCATTMTGTTTGAGTTAGAAGMCTTATTCTGGGCCTTTAATTTG 

TTGCATGTGCTGTACTTAAAGCATCCCAGATMTTTTAGCTTATATTTTCATAGTGTTTA 

TACAGAGCTTGAATTGGAA TGGT CCTTTCCTTCCTTGCCTCAGTACTTCCTTCCATAATC 

TTTCCTGCCATAACCAT TTATTTT GCCACCATTTCTTAAACACTTATGTGGCAGGCATTA 

TGCTAGACTGTAATATGTTTTTTTTAATCCCAGTTGAAGTGGATGTGGGAAGGTATTAGA 

AAGTAGAAGAAAGTATAGTCTAAAATAGAGAGGAAAGAAAGGAAGAGAAAAGTGGGATAT 

TTCAAAACCATTTGCGCAGAGGTAGAATGAAATTCGCCAGAATGGGAATCTCCGTATTTC 
TTTACAAT 

Sequence 1096 

GTNGCCCCGCGTCCGAGTNAACAGTGGTAGTNAAATTCAGGGTTGGTAAGTTTTTCCATA 

GAAGGCCAGATGGTAAATATTGTAGGCTTGCAGACCATGTGGGCTCCACGACTCAACTCT 

GCCACAGTAGTTTGAAAGCAGCCACAAACAGCCTTGGTGTGACTTTGTTCCAGTAAACTT 

TCTTTATAGAATGGGAGAAAATATTTGCAAACAATGCATTCAACAATGGCCTGATGTCCA 

GAATTCATGAGGAACTTAAAA AACTC AACAACAAAAATCACCAAATAACATTTAAAAGTG 

GGCAAAAGATATGAATAGTCATTTTTCAAAAGAAGACATACCGAATGGCCMCAAGCATA 

TGAAAAAATACTCAACATCTCTAGGCTTTCAGAGGCATGCNAANTAAAACCNCCATTNGA 

TATTATNTTACNNGANCCCNAAATGGG I I 1 1 I I I I A AAAGGCCAAA 

Sequence 1097 

CCCCNCGCGTCCGTTNAAGTGTGCTTTGGAAAAGGGAAAAAGTCCTAAGTAGATATAAAA 

CCCTAACTAAGGA AGAA AGCAGGTAGCAGTGGTGGTCCAAGAGACCGTGTAGTGGATGCA 

AGGACCGCTCGTATTTTACACGCTATATTTCAGCAAAGGGTGGCCCATCTGGCAGGAAGA 

TGGGGACATATGTCACATATAGAGCAGTTAAGGAACTAGGGAAAGTGGAAGACTCAGAAG 

ACCTGTCTTTGACCTGGTATGTTCTATCTCTACAGAACCf AATATGGCTTATACATACTG 

CCACAGAAAGGACTGAGGTAGACAGTGGCAAAGACTTCCTAGGAGTTGAACCCCTGAAAT 

TACATAAGGAGTAGGACCCCACCAGAATTCTGTCTTTGTAGGCTGCTGACTGCAGAAGAA 

ACGG TGTAG CGGAGGCAGGGGGAAGAGGAGTCAGNANAGTACACTGGGAAGGAAGAAACG 

GGTTCTTTTCTCTT 

Sequence 1098 

TCGCCCCTGGGCCCTCCTAACCMCCAGGGGAGGGGAGAAGGACCCAATTCTTTTCCTTT 

GGTGCACGTAGCCTGGACCCGTTATGGACAGAGGCCAAAGGAAGATAACAGTGTGGTGTC 

CAGAGATGAAACCAAGTGGTTGATGGGCAGTTCTTTGAGCAACCTTGTTTATGAGCCTAT 

TGATATGCAGATATAGAGGCATCCAATACTATTGACTAATTTAAAATCTTATTCAGTGAG 

TCMCACTCTAAATMGCAATGGAGATGGTTCCATTCATTTTTTTGCAAGTATCATTTTT 

ATAMCATAMTTTCCTGAGATTTTTGTTTTCATCTTAGCCTCTGTGGAGCTGCTTCGTG 

GTTATGATAAGTGCTGTGTGATGCTCACCTTGGGAGGTCTGCGACATATATTGAAGTCAT 

CTCTAACCTGAAGTACTGACAGACTTTCTGGAAGAAAAGGCTTGTAGGAGGAAACTTCAG 
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' AATTCTATTAAATGGTGTAAATGATGAAATTATAGTTGATATATGCTAGAGCATCAGTGC 
TGGGTATTTTAGAAGGGATGGA 
Sequence 1099 

CGGGCCTGTTCCTAGAGCCTCATTGGAGACATTGACAATGCCATGAGGACCTTCCTCAAC 

TACTACACTGTATGGAAGCAGTTTGGGGGGCTCCCGGAATTCTACAACATTCCTCAGGGA 

TACACAGTGGAGAAGCGAGAGGGCTACCCACTTCGGCCAGAACTTATTGAAAGCGCAATG 

TACCTCTACCGTGCCACGGGGGATCCCACCCTCCTAGAACTCGGAAGAGATGCTGTGGAA 

TCCATTGAAAAAATCAGCAAGGTGGAGTGCGGATTTGCAACAAAAGATCGCTTTGGCTGC 

TTTGTGAAGAATAGATTGAAAGGGTCAAAGGTGAGAGCCATCTCACATCCATGCAGGAAC 

CAAGCAGGCAAGATATAAATATGAAAGTAGAAGAAAATAGTCTGGAAGAAAATCCC 

Sequence 1100 

CGCGTCCGGGAGTGCACCCCAAGATCTAACAGCTGTTTCAGAGCTGCTCATTTTAGAGTG 

ATTGGTAGGGAGTTGGTGGCTCAGAGGTCCTAATCAGAATGTGTCCTGGGTTCTGAATGA 

CTAGCAGACTATCATTAACCAAATAMTTATGGGATTTTGTCTTAATTATATACATATAC 

ATATACACACACATACACATACACATACATGTGTATATATTCCCTAAAACTTAATAAAGC 

TCAMTMTAAAATCAGATTTCTTMGTATTCCMTTCCCTTTAAMTGTAMTCAGATT 

TTATA ATTCTTTTGTTCAAAACTGTCCATTGGCTCCCATTTCACTTAAATCAAAAGCTAG 

TTTTTACMTMGCTAAGATAGCAAACATTATTATCTATTTACTTATGAGTTACTTATGT 

AACTCAAGCATCCAATAACACTGTAGGGTGCTCAATAAAATAGTTGCTGAATGGATAACT 

TTC 

Sequence 1101 

TGTTTTACGACAGAGCTTAGTGCAGCCNGTTCTTGATGGCTGTGCMTGCTTTCCTTTTA 

AGAGTGGAGTTAGCCTCGTCATAAAGCGTGTTTTJGAGTCTGTTCGAACGGGTCAACAAC 

GMGGGAAGTTTCAGGCAGATCTTGTATGCCTGGCCCTGGTGGCTGCmCATTTCCTTC 

CAGTATCAGTGCTAAACAGGAATGAACATGTTCAAGCCCCGTCTCACCCACCTCTGGCAT 

CTTCGCCCTAACTCTGCCCTAGAAGACCTTTCCTTCCGTATCGTCAAGAAAACTGAAGTT 

GCTGTTTCACTCCTTCTCCCACCCAGAAACTTCGCTGCATCTTCCTGGATCCCTAGCTCC 

TTGCACCCATGATCCTGTCTCCTTCCTCAGCCCGGCTTCTGGCTGAGCAGCCTGCACTTG 

CTGTCTTCACTCCTACACGCTGCCCCCACTCCTACACGCTGCCCCTGCGTGCTTNTCACT 

TCTCTACCCTTCC 

Sequence 1102 

GTCCGTATCCTAT CTTCA AATTTTTTAAATATGTTCAAAATATCTGGAGGGTGAGAAGTT 

ACCAAGTTTGTATGTTTTGTTTGACTCACCATCTTTATTTTCTGTATATGTAGTAGCTGG 

CMTTGCATATATTTTCTTGATTAACATATTAGAGACTGCTTCCATCATCTTATGTAAAC 

CTGGAMCAAGCTGAAACTAGTCTTTTCTGAAGAACCGTGTATCAGTGTTAGATGTGCAT 

CCCGTTTTGTCATTCCCTCAGACTTTGAATACAGTCATTACTCTCTGGAAGAGAAATGTA 

AGTATATTTTTTGTTATCTGCAGTATGGTTTAACATGTATTAATAATACACATATGCAGA 

CTCACTAAAGTATCCCCAGTAATTAGTAAATTCCAAAT 

Sequence 1103 

ANTTGGGTACCCCCGGCCNGGCCAGNTGCGCGCGGCGGGGCATGCTGCTCGTCCCCCCGC 

GCCCCCGGCCCGGACACTTGGCGGGTGCCACGAGGACCCGCAGCAGCACGTGCGGTCCCC 

CGGCGTTCCTGGGCGTGTTCGGCCGCCGTGCGCGGACCTNNGCGGGAGTTGGGGCNTGGG 

GGGCGGCNGCCGTTGGTNCGGACAGNCNGGGTGCGCACTTGGGCCCCCNTGNCCATGGCN 

GCAAAGGTGGACCTGAGCACCTCCACCGACTGGAAGGAGGCGAAATCCTTTCTGAAGGGC 

CTGAGTGACAAGCAGCGGGAGGAACATJACTTNTGCAAGGACTTTNTCAGGCTGAANAAG 

ATCCCNACATGGAAGGANATGGCGAAAGGGGTGGCTGT 

Sequence 1104 

TGCCNCGCGTCCGAGCATCTCAGGTAACAATTTGAGCATAACTTTAACCATAACTTATGA 
TAGCATAATAACATTCATTAGTAATTCAGTAGCCGTATGTGCCAGGCTGTGTTAGGTGCT 
TTATATATTGTTTAATTTTTAAAAACTTGTGGAGTGTACAGATTGGTAAGGTGACA7TGT 
ATCACAAAGCTAGTCTTTGAGTCCMAGTTTrGTGGTTTTATGTTATGATATACTTTTAT 




WO 01/070979 



PCMJS01/09126 



TABLE 1 

180/467 



' CATGGMTTGTCTTATTAMTGTTTTGCCAGTGGTTCTTAMGTGTGTTTCTGACACCAG 
TAGCATTGACTTCACTTAGAMCCTGTTAGAAATACAAATTATTTGGCCCCACCCAACAC 
TTGAGTCACAMCTTTGCAGATGGGGCTCMTCTGTTTTMCAAGCGCTTCATGTMTTT 
TGATGCAG GCCTA AGTTTTTGAGCCCTGCAGTATGCATTTCTATTTTTAAGCAAAGATCT 
TGGTCTTTCTTTTTGGACATTGTAGAMTAACATGMCCTC 
NTTTGGTTTGGT 
Sequence 1105 

ACGCGTCCGCTCTGGTCAAGCAGGCGGTACTTCTCCTTGGATGTCTCAGCCACAGTGCCT 

ATCAGGGTACTGAGGGAGAGCACACATGGCCCGAGGCCCTNGGAGCCCTCGGAGGCTGAG 

TCAAMGAGTCTCCCTCGMTTGGTGGGCCTTTAGAAGACTTGGCTTCTTCACTGGAGAG 

CTATAMGTAMCACCACACTGAGGGCCCTCGTCCCAGGMGGCCTTCAGAGCATTTTCA 

TTTCTGAACACGTCCCCTCATCTTTCMGATTTTCTGGTCCTCTAAAGCTGAGAAACTAC 

AAGCACTGAMTGAGATGAGTTTTGATAAGGATGGTAATGAAGCACAAAAGCGTTATTCA 

CATTACTCACTGACTTTAATATMTTTTGAAATATTTCATACTTTTGAAAAACAAAATAG 

CCTGGGCGACAAGANTGAGACTCCATCTCAAAGGTAAAANAAATTTAANCTGGGTGCCNG 

CCGCTTGACTATGTCTAGAGAAAAAAACTTCCACA 

Sequence 1106 

GACCCANAGAAMGGNGCCAMGGGCATGTCMGCAATTGAAGTTAAGCTCATGTTTTTA 

AAGATCCGTTTATTGAGATGATTTTGAMTGCTCCTTTACCATTCATAATTTAAAAATAA 

AGTTTMCAATGGTTTAAATTCANAATGGATTAAAATGGAGTTGGGGGTGGAAAGTAGAG 

CCATTCTTAGTAAATATAAATAACTGAAAAGTTCTTCTGAGGAGACTATGTACCGAAGTT 

ATCATTGCATCTTTCAGTATAGGCAGATCTCTCCCTCATATMCCGGATGTTTCTTGGCG 

CTTGGAATATCAGATAMGGTAMGTTTAAGMACTTCTCTAGCGGGGGATTTAGGGAAC 

TTCTTAAAACCTAGAGTTAAMGCTGTTGCGTGTTTGTTGTGTTATTTTAGACCAATCAA 

CTTCATAGGCTAGACTAGTCTAGA 

Sequence 1107 

ACGCGTCCGAAAATTCACAGGGTGTGTTGGCACACGCCTGTAATGCCAGCTACTCAGGTG 

GCTGAGGCATAAGAATTGCTTGAGCCTGGGAGGCAGAGATTGCACTGAGCCGAGATCGCG 

CCACTATACTCCAGCCTGGGCAACANACATCCTGTCTCAAATAAATTAAATTACATTAAA 

TGTTTAAGAAGAAGTCTAAATAAGATTCATATGCTGCCCTCCCTCAGATAATGAGGGAAC 

CTGGGGTACTTCTGGGCTACTCTGGGGGACAAAGTATAACTATTCAAATGGCAAGTTGAA 

TTAGTACAGTCTAGGAGCCTTGGAGATGGCTTCTTGAAAGAGGTAGAACCTGAAATTCTC 

CTTCCTTGAGGGACGGNCAGGATTTGGCCAGATGGAAAGGCAAGTGGAAGGCTTTGCAGG 

GACAAGCAATGTAANCAGANCCTAGAAATGG 

Sequence 1108 

TCGACCCCGCGTCCGGNGTMTTCTAGGGGAMTWATATTTCTGAACAACAATGTTGGTT 

TGTGCAGGAAAATCACCAAAGAACATGACTAGAAAGTGTATAGCTACAGTTTCCCTCTTT 

TAAATGGGAATAGCAAAACATATAAAGAATATTGATAGGCCGGGTGCGGTGGCTCACGCC 

TGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGCAGATCAAGAGGTCAGGAGATCGAGAC 

CATCATGGCTAACACGGTGAAACCCCGTCTCTACTAAAAAATACAAAAAATTAGCCGGGC 

GTGGTGGTGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATGGCGTGA 

ATCCGGGAGGTGGAGCTTGCAGTGAGCCGAGATCGCGCCACTGCACTCCAGCCTGGGTGA 

CAGAGCGAGAGACTCTGCCTCAAACAAAAAAAAAAGNNAAAANAAAAAA 

Sequence 1109 

CCGTCTCTGGCTTGGCCAGGTTTAAATTAATAAAAATGAAGATGAAAATAAGTTGTCAGA 

TTTAGGATGTATTTTAGAAACCCAACTGATAATTTGCCAACTAATTGGATGCAGAGAGTA 

AGAGGGAGACTCMGMCACCTCTMGATTTTTACCCTGATCAATGGGATAGGTGAAAGT 

ACATTAATGGAGATTGAGAATCCTGGTGGAGGTACAAGTTTAGGGGTACTGAAGAGTGCT 

TTTGGACATGTGAATTCTTAGAAGCCTACTAGATTCTCCAAATGGAGACATAAAACATAA 

TTGAATACAAAAGTCAGGAGTTCAGGAGAGGGCTGAGCTAAAGATACAAATTTGATAGAC 

ATGAGCATTTAAAAAAAACTGCATGAAAATACTAAAGATAGGCTGTCCTGCCTATGGAAT 
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' AGCCATTCTTTGATCCCTTTACTTTCTTMTAMCTTGGTTTCACCTTACTCTATGGACT 
TCCCCCAMTTCTTTCTTGTGTGAGGTCCAAAAACTCTCTGTTGGGGTCTAGATCAGACC 
CTTTTCAAGTACATCTTNCTGATGAACCACAAANGGATTATACTAAAGAGACCCCCCACC 

Sequence 1110 

CCGGNAATTTGCTTATTCTAATTGAGACACANTGGTGGGGAGTGGGGGTCGGGGACTACA 

CAGGTGCATTTTCTGAACATTTATAAAATGAAAAAGATGGAGGCTTGGCTAGAATGGTTA 

ATCCCCTTTTCATTCCTCTAATTCTATGACMTTTTTTTAAAAAACCAAACACMCCAM 

ATAAGAGTGGACAGTTGAGAATTACCTTTAGGTTCCCATGACCCTGAAGACTGTATTTGG 

CCTTG G ATCC ATT AAAAAAA AA AAAAAAAA AAAAAA 

Sequence 1111 

CCCACGCGTCCGCGGCCATTTCTGTATCCCCCTGCCTGGGTTTGCTGCCCTTTATGCTCC 

TACCTCACCAGGTACAAGGAACATGAAGATGGCTATATGCGGCTGCAGCTGGTTCGCTAC 

GAGAGTGTAGAGCTGACACAGCAACTGCTGCGGCAACCACAAGAGGGATCGGGCCTGGGA 

ACGTCGCTGAACGAGAGCAGCCTGCAGGGCATTATTCTAGAAACAGTGCCAGGGGAGCCA 

GGACGTAAGGAAGAGGAAGAGGAGGGCAAGGGTAGCGAAGGGACAGCCCTCTCAGCCTCT 

CAGGACAACCCCAGTTCTGTCATCCACGTGGTGAATCAGACCAATGCCCAAGGCCAGCAA 

GAGATTGTCTACTATGTGCTGTCTGAAGCCCCAGGGGAGCCTCCCCCAGCCCCTGAGCCA 

CCTTCAGGGGGCATCATGGAAAAGCTTCAAGGAATAGCTGAGGAGCCAGAGATCCAGATG 

GTTTGAAGGCCGCAGAGCCAGACCATTTCTTCCCAGGTCTGAMGTTTGAGCCAGGCAAG 

TGGCAGTGCCCCTAGTGGGCAGCCGTTGCCAATGGATGCC 

Sequence 1112 

CCCCCGCGTCCGTAATTTTAAAGMCCTTGTTATTAGAAAATCTCAGCCTAATACAATCT 
GMGTTMGAGTTTTAGCAGCATrGTTITTC^ 

GGCCAAACMGGMGAGTATATGCCCTTGTAMTGAGTCTTGTTTTGTTATTTAAATAGT 

CAGTCAAAACGTAGAAATCAGTATACGTAAAATAAAATGCATGAGACTATTAAATCTTTT 

CATATACTCTACAAATAAAATGAAATCTGTGTGTGGTCCTGGTTGACTGGGCATCTAAAG 

GGMTC AGA AAAG AGATTGTGAAMGTTATATATATATCCTCTTCCTTATTTTAGTTTTG 

CTTTTTCCTATTTTCCATMTrAAGTGCCGTITACMAGTGGCATCAAAAAATTGAAGCA 

GGCCAGGCATGGTGGCTCATGCCTGTGGTCCCAGCAGTTTGGGAGGCTGAGGGCAGGTGG 

ATCACTTGAGATCGGGGGTTCGTGACCAGCCTGGCCAACATGGTGAAAGCCCATCTCTAC 

TGGGAATATAAAAATTAGCCCGGCGTGGTGGCATGTGCCTGTGGTCGCAGCTACTTGGGA 

GGCTAAGACAGGAGAATTGCTTGGGCCCTGGGAGGGGGGAGTTCANNNGNCCTGGANCGN 

CCCCTGNNCTCNANCCCTGGCAACCANTGNNGACACCNNCTTAAAAAAAAAA 

Sequence 1113 

TCGACCCCGCGTCCGGTTTTTTGTCCCAGCAGTGGCATTTAMTTACTGTACTTTAAGAC 

ATGGAATTGCTGGAGGCTTGGAAACTTGAGTGCAATTTCCCTAGTACGACCTCCAAGGAG 

MTAGAGCAAMCAGTGGTAGGAAAAACTCTCAMATTTTACCCMTTGTATGTTTTCTA 

CATTGTCAGTATCTAGTTTTATATAGTTMTATGTACTTCTAAMTTTCTGACAGTG^ 

GGTGTATAAAACAGACCAAGCTCAAGATGTAAAGAAGATTGAGAAATTCCACANTCAACT 

AATGCGACTTATGGTAGCCAAGGAAGCCCGCAATGTTACCATGGAAACTGAGTGAATGGT 

TTGAMTGMGACTTTGTCGTGTACTTAGGAAGTAMTATCTTTTGAATTAGAGAAAGTG 

TTGGGACAGAAAGTACTTrATGTMCTMGTGGGCTGTTCAGAAGCTTAGAGGTCATTTT 

TTGTAATTTTNTTTTTAATTACTTTAGMGAGCTAGGGATGCAAATGT^ 

AAG CCTTTATTT ACTTTTTG G G A AA 

Sequence 1114 

TCGACCCCGCGTCCGATTCCTTCTTCATATATTATGTCAGAAGAGTTTGAGAAGAAATGG 

TATTAATTCTTCTTTMATGTTAGGTTGACTCACCAGTTAATGCAGCTATTTGGTCATAA 

ATGTTTCTTTGTTMTCACTTTCGATTACTAATTCAATCTGCTAGGTTATAGGTCTATTC 

AGATTTTCTCmCTTCTTGAGCCACTTTGGTAGTrTGTGTCmCTAGTGATTCGTC 

TTTCATCCAGGACAGCTAATTTGTTGTTAGACAGTTGTTCACAGTATACTCCTGTAATCC 
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' TTTTGTATTTC TGTAM GTTGGTAGTMTGGCTCTGCTTTCATTTATTATTTTAA 
AGTCTTCCATCTTTTGCTCAGTCMTATAGTGAAAGGCTTGATCTTTCAAATAATCTATG 
TTTATTCATTCTACTGCTCTCCAAACTTCTATTTTATTGATTTATGCTCTAATTATGCTC 
TCTATTATTTCTTTCATACTGCTAGCTTTGGATTAGGCTTATTTTGNCTTCTTC 
Sequence 1115 

GCCCCGCGTC CGGG ATGACTAATGAAAGCAATNAGCTTGAACATTTAGAAAAAATTCATA 

TATGATCTAMTTTTTATATTATCATTTCTGTGCCTTCTMTTCCTGCATCCTGTTCAAA 

ACATCTTTCCAGACATTAACTTACACATTGTATAAAACCGACCAAAATGATTTCCTAAAG 

TTCATGCAAAAAAAAAAAAAAACAACCTAATTTTCTGTTAATATAAAAGAAACTTCAGTT 

TACTGACCGTGAAACAGACTATGTACTGACATCCAGGGTAAAGTAAAAGACTTTTAAATA 

TTGGTCATTAAAGGACAGGAGCTMGCTAGCAAAGCAAAACATCTTTAGCACTTTGCAGA 

TCTC MGCAGTTMCCAGGCTCTGATTCCCTTCCACTGTTTTATGMTTAATTCCAGTTC 

TTTTCA TGTATC TTTGMCCTAAGATTATGAAGTMTTTCCCTATTAGGGACTAGAATGA 

CTTCAGTTTTTTCATTTGATAAAAATCAGMCTGCTACCTTTCCCTTTTT^ 

AAATGTAGATGAGTGCATTAAGGGTTGTMGGATCTTTATCATTTTATGNCATTCATTGA 

MATTGAAATGTTCATTCTTTTTAATGGTT 

Sequence 1116 

CNGCTTTCGTCTCTTCCTTTNAAGTTGATACCCTTCTTTTTCTTGTCATTTTG 

TGGGACCTCCAGM TAATG TTTCATGAAGTAGCATGTATCCATATCTGGTTCTTGACTTT 

TTCATCATTATAATTGTTTTCTATGGGTTACTTATCAGTTTMGAATGCTTMTTCCT^ 

ATGAAC TAAGAG TGTTTATTACATGTTGAGATTTATGGTATGCTTTTTCTTCCTCAAGAT 

AATGCATTTTTTGTA TTATCTGT TAATGTGATAGGTTATCCATTTGTGTATTTT^ 

TTGMCAACCC TTGAT TTTTTTGGATAAACTCTATTTGGTCATTATGCATCATTCTAT^ 

ACCCTGCTGAA TTTTTCATTT GCCAACATCTTATTTCAGATTCTTTTAATCTGTGTCCAA 

CMTGAGATTTGTTTTCTTTTGCAAT^ 

CCTTATMTGGAAAATACATATTTCTCAMCTTTTACACTGATATATTCATAGTATTTTT 

TTATMTTTGAAAMTCTTGTCAGTATCTGTATTAAGGCCTNCATTTCAGTTCTGCTATT 

TCATATTGCCTTMGGTGGCTATTTGGCTCTTTMGGACCCCGATTTTGATTTTGTCATT 

TTTAAAATAAACCCCATTTATGCTATAAAAAAAAA 

Sequence 1117 

GCCTTTTATGGTGATGGAATATGTCTCAGGAGGAGAGCTATTTGATTATATCTGTAAGAA 

TGGAAGGGTAAGCTGTTCTGCTtTMTTCTGTATGTATTTTGTNNCTNGNCCTTTATCCT 

TTACTAGCATCMAATGTCAGCAACCAATTTTAAGAGGTCTATTTAATAACCAGTTCCCT 

TAGTCATATATTTGTTTAGAATCATAMCTATGTAGAACTAGAMGGATCTTAAAGATTA 

TCTCCTTAGCCTGTTTATACAGATGTGGATACTGAGCCTCGCGGCTTATATGATTGCTCA 

CAGTMCGTGATTTATTAATGACGGAATTGGCTTGAGCCCCCAGAACTCATAATCCTCAG 

ACTTATGCTTCCAGGGTATACAAATACTTTGAATATGTATCTTAATGTAATTAATCGTAC 

CAAATATATTATTATTACT 

Sequence 1118 

GCGTCCGTTGTCATCTATTT ACTTT ACATATGTCATAAACCTAACACTACATGGTCATTT 

TTGTTTAMCAGTCAATTACCTTTTAMGGGATTTGAATAATAAGTCAAAATCTAATACA 

TTMCTGTGTAGTTAGCATTTCTGGTGCTCTTCTTTCTTTTCTGTAGATCCATACTTCCA 

TCTGGCATTATTTTCCTACTGCCAGAAGGACTTCCTTTAACATTTCTTGTAGTGTAGATC 

TGCTGGTGATGMTTCTTTCAGCTTTTGTMTTCTTTTGTCTTTGAAAGGTATT^ 

GAGTATAGGTTMTAGCTTTTTCCTTTCAGTACTCTAAAGATGTTGCTCCAGGCCAGGCG 

CGGTGGCTCACTCCTGTAATCCCAGCACTTTGGGAGTCTTGAGGTGGGCAGAACACTTGA 

GGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGTGAAACCCCCGTGCTTCTAAACATAT 

TAAAGAAAAAAAAGA 

Sequence 1119 

NCGTGACATGCTGGCTGCTAGTNAGCTCCCCCATGATTGTCAGCTTCCGAGCCCTCACTA 
GAAGCAGATACCACCCCACCACCATGTTTCCTTTAAAGCCTGCAGAACTGNGAACCAATT 
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' MMCTCTTTTCTTTATAMTTATCCAGCCTCMGTATTTATAGCAACACAATAATGGCC 
TAACACAACCTACAACTCTCTATATGTATTTGTGTGTATTTAAAACATGCAGGAAATAAC 
ACAGAATCCAAGGCACCCAAAAGTCATTAAAAAATGGAATCAAGAATTCATATGCCATTA 
TGAMTTAGCCAGTCCTAAMTCTGACCTCTCTGCATTTTCACATTATTCTCCTCTCTCT 
ATCCCTGCCTTCCTCCCTCCCTTCCTCCAACTGTCAGAATTGTCCTGTAATCAAACATGT 
TCACATCACAGCTTTTCATTTTCTATTTCCAATCAATTGACCAGTCTAGCCAAGTAGCAT 
CCTGGATCCCGTATTACATATTCCTAGGACAGGMGCCAGATTTTT 
Sequence 1120 

AGCTCTTTAGCAGGAGACAATTCTTAAACTTAAATTAAACTGAAAAAGCCACAGAAAAAA 

GGGTTTGACACCCTAAAGCCAGTGTCCAAATGAACGCTACGGTTGNCCTCATAGGTGTGT 

TATGAATGTTTACCCTGACCTCCTAGAAGAAAAGGAAAAAGAAAGGAAAGAGAGAGGGAG 

AAGGTGAGGGAAGGGGGGAAGAAAAGAAAGGAGAGAAAAGAAGACAGAGAAAAAACAGGG 

AGGGAGGMGCTGGGGAAGGAAAMGACCATTTGCTGACTCCGTTGTTTTATTTCCCAGA 

ATGATTCMTACCTCMGMGAT7TCACTCCAGMGTGTACAGAGTTTTCCTCAACAACC 

TTTGCCCTCGACCTGAAATTGATMCATCTTTTCAGMTTGTAAGAGTACACATTTTAAG 

CCATATCTTTTTCAGCTTGCATTGATTCTCAGGTGGCTAGAGCAGGACTTGGAGTGGTAA 

TTGGAGATGGAAGAACATCATACACTGTGTCTAAA 

Sequence 1121 

CGGCCGAGGTACTATAATGGTCCCCATCTTMTTTGAMGCGTTTGAGAATCTTTTAGGA 

CANGCACTGACGAAGGCACTNGAAGACTCCAGCTTCCTGAAAAGAAGTGGCAGGGACAGT 

GGCTACGGTGACATCTGGTGTCCTGAACGTGGAGAATTTCTTGCTCCTCCAAGGCACCAT 

MGAGAGAAGATTCCTTTGAAAGCTTGGACTCTTTGGGCTCGAGGTCATTGACAAGCTGC 

TCCTCTGATATCACGTTGAGAGGGGGGCGTGAAGGTTTTGAAAGTGACACAGATTCGGAA 

TTTACATTCAAGATGCAGGATTATAATAAAGATGATATGTCCGTATCGAAGGATTTCGGC 

TGTTGAGCCAAAGACTGCGTTACCCTTCAATCGTTTTTTACCCAACAAAAGTAGACAGCC 

ATCCTATGTACCTGCCCG 

Sequence 1122 

CCCTTTCGAGCGGCCNTNCGGGCTTNTACGCGGGGGCAGCGGGAAGCTCGCAGCAGCTGG 

GGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATACACCCAACTGGGAGAT 

GAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCATTTCGCGAAGGAGCA 

AGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTGCAGCTTGAAGGAGCCA 

ACCATGGATTTGAGG CGTGT GAAGGAATATTTCTCCTGGCTCTACTATCAATACCAAATC 

ATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATGTTTAACACCATCTTACTA 

ACCATTATTGCTATGGTGGTATACACTGCCTATGTCTTTATTCCAATCCACATTCGCCTG 
G CTTG G G AATTTTTCTC AAA AATAT 

Sequence 1123 

CCCTTTCGAGCGGCCGTNCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACC 

ACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACA 

ACTGCMAAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGG 

Sequence 1124 

CCCTTATTTTNGGCNTTNGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTAT 

AGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCA 

MAAATTGCCAAAATGCNACmCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTT 

GGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGG 

TTTGGAGTGTGGAAGCCTNATCCCTTCANCATCAAGCTGGAATGGGGAATGAAGAATGGA 
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' NATGTGGTGCCCACTAGGCTACTGNTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTA 
TTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTA 
Sequence 1125 

CCCTTANCGTGNTCNCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCAC 

TNTATAGAGGGTGGGAAAATAAACCANAAATCAAGGGAGAAAGAAAAGATGAAAGACAAA 

CTGCAAAAAATTGCCAAAATGCG^CTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCGTCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTANGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATTGTTAACACCTGAATTG 

Sequence 1126 

CCCTTTCGAG CGGCC GCCCGGGCAGGTA C I I I I 1 1 I I 1 I I IN I I 1 1 i I I NNGGACTTGCG 

ACAGTTCCACCTTTTACTCTCATTGGTAAAATCTCCTTTTMTTATTAAAMTATTGATA 

AATTTATTAAA TTAAGTC TTTNATTCTTTrGTAATCAGAAGAGGACATTMTGTTGCGTG 

TCTTGACTGT CI I I I I I GCTTTGTAGATTTATTTTGTGCTAAATGAGAACGATATGCATG 

TTTGTGNTTGATTTTTCCAGMGCAGTTACTTTMTTCTT^ 

TTTTGC TTTAG GCATTAGTTTCTTCTTCCTTTATAGNTTTTTCAAAATC 

TTATTTGTTTTGAMGMGTAAATTTGGGGTAATTTTTTCTTATCACGCCC 

AGTTAMAAATTACCAACTGATTGCATTTCTTTACTTAATTTTGCAAATCGATTTTAC 

CATCAAMAAAAGAATTTAANAATTAATTTACCTTGTTCAGGTCTTGGMTTTNCACGCC 

CTCTAAGACGMGAGCCACTTTTACTCTGCGTATCTMTAAAMTTCTTTGGCTTTTTTT 

GCTT 

Sequence 1127 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCAC 

TTTATAGAGGGTGTAAAAATAAACCAGAMTCAAGGGAGAAAGAAAAGATGAAAGACAAA 

CTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGMGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1128 

CCCTT AGCGTGGT CGCGGCCGAGGTACATGCTANAAAACATTANCACAGATACGACAGAG 

TGTGGi I I I N II I AG AAAT GGGTAATTTCTCTCTCCAGTATCCTTTCACTTGTATGAGA 

TATTTCTCCTCTCCTGTTTTCACAAACCAAGAAATCCCCAGGTAGGCCAATCCCAGAGGT 

GCCATTTAGCAGTATGCAGCAGCCCAGTTTCAGCATAACAAAACATGCCTTGGTAGTGGC 

TCTCTCATGCAAATAAAAGAAAGCTTAAGAAATTCTTGTTGTAGGTGGATTAGGCAAGGC 

TGCCATTCAGCTGGTATAAGCTAAAAGTAAAAAATCAAAACGCTCAAGAAAACGGACACA 

ATTTTGGMTGATTAMGATGTCTTTATAMGTTTTTTTCMGACTTCATTCTAMTA 

CAGAATAAAAAATGGGTGTCAGCTCACTTGTAAGACACCMCCAGATTTTCCTTATACTG 

TCTC AAAATTTAAAGATCAATTTCCCCAGAAGGTGTNCAATGCATCATAAAATGGCCCTT 

TTTTGAGGATGGGAGAGGAAGGGTTGGGCAGGATGGAATATTAAATTGTACATGGATAAA 

CATGCCAAGACTGTTATCCMTCTAGATMTTTATATACATTTTGATGACTTAAGGAAAA 

CAMGCMTCATTTGGTGACAGCCTAAAAAGCNTGACCNTATTTAACATACTTAGGAACT 
TTTTTNGG 

Sequence 1129 

CGTTCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAA 

AAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCA 

AAATGCGACTTTCTAAMATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTC 

AGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTG 

GAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCC 
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' CACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATG 
TAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTA 
TGTTAACACCTGAATTGCTGGGTCTTGAAGAGAGCCCAAGGAGTTCTGGGAAGAGGGACC 
AGATTGGGGGGTAGGGTCACGGGCTTGGGTGATAGAATTATTTCTCGMTGACTTTCTTG 
AGTGCCAATTTGMCTGTAACATTTGCTTANTCACCTTTAGNGGAGTAATCTCCTGGGCT 
TGGTTCTATATTTATATAAAAG 
Sequence 1130 

CGGGCAGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTA 

CCAACTMCTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATT 

ACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTGGTTCCAATAAGTTATCCC 

CAGTCTTAAANGTAGGTTAAGTACCTCNGGCCGGCCACCGGGNTGGAGCTCCAAATTNGC 

CCTATAAGTGAGGTCGGATTTACGCCCCGCCTCACTTGGCCCGNNGTTTTACAACCGTCC 

GTNGACTNGGGAAAACCCCTGGGCGTTTACCCCMNCTTTAATCCGCCTTGGCAGCACAA 

TCCCCCTTTTCGNCCAGGTTGGCGTNAATAANCGAAAAAGGCCCGGAACCGAATCGGCCC 

NTTTCCNAACANGTTNGCGCCAGTNCTGGAATNGGCNAAATGGGGACCCCNCCCCTTGTT 

AACCGGGGNGCATTTAMCCNCCGGCCGGNTGNTGGTTGGNTTACCCCCCAANNGGTGAC 

CCGNNTTCMCTTTGGCMGGGCCCCTAANGGCCGlvnTTCTTTTGN l MIM IC 

Sequence 1131 

CGANGTACGCGGGGCTTTCCAAACAGCATAAGAAGTGTTTGAGCCNTNAGTATACTGAAG 

GAAGGGCTCCCTCGAGTTCTGGTGTGAAGAGATAAATCACCAGAAAAATGATGGCATCAA 

AACAAAAAAAGAACTCATTGTGAATAAGAAAAAACATCTAGGCCCAGTCGAAGAATATCA 

GCTGCTGCTTCAGGTGACCTATAGAGATTCCAAGGAGAAMGAGATTTGAGAAATTTTCT 

GMGCTCTTGAAGCCTCCATTATTATGGTCACATGGGCTAATTAGAATTATCAAGAGCAA 

AGGCCACCACAGACTGCAACANGCCTGAATGGAGTCCTGCAGTGTACCTGCCCGGGCGGC 

CGCTCTTAGAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGAT 

ACCCGGCNGACCTNGAGGGGGGGGCCCCGGTACCCAACCTTTTGT 

Sequence 1132 

AGCTCACTCAAAGGGCGGTAATACGGGTTATCCACAAGAATCAAGGGGATAACCGCAGGA 

AAGAAACATGTGAGCAAAAGGGCCAAGCAAAAGGGCCAGGAACCGTAAAAAGGGCCCGCG 

TTGCTGGCNGTTTTTCCATAGGGCTCCCGCCCCCCTGGACCGAAGCATCACAAAAAATCG 

ACGCTCAAAGTTCAGAGGTGGGCGAAAACCCCGGACAAGG 

Sequence 1133 

CCCGGGAACAAAAGCNGCAACCGNGCCCCCCCTCCAGGTCNACGGNNTCGANAAGCNCGA 

AAACCGMTTTTTGNAGNTTTNGGGGACCCACTANTTNGNGAGCGGGGGCCGANNNAGNG 

CGGC 

Sequence 1134 

CCCCCCCNCGAGGNCGACGGNANCGANMGCAGATNNTNMTTTTTGCAGCCCGGGGGAT 

CCNGGAGNGNGAGAGCCCACCGNGGGGGNGCGCCAACACCCCCGANAGAGAGNCGAANNA 

CGCGCGCNCACNGGCCGNCGANAAACAACGGCGAGACNGNGAAAACCCGGGCGANACCCA 

ACANAAGCGCCAGGCAGCACANACCCCAANCGCCAGCNGGCGAAACANNnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGANACCGCGCAGCGNGACCGNAACACNAGC 

CAGCCGCCCNAGCGCCCGCNCCAAGCGCNNANCNNCCCANACCAACANCGCCACGNACGG 

CCGGGCAANCCCCGGCAAGCACAAAAAGCGGGGGGGCNCCCTGGAAGGGGNNCNCGAANA 

AAAGGCAAGACGGGNACCNCNGAANCCCCAAAAAACC 

Sequence 1135 

GGGAGTNNGGGAGCGGCCGAGGGCCAAANGANCCGCGAAGACAAGGCCATCCACCACTNN 

ANAGAGGGGGNAAAAACAAACCAGAAACCAAGGGAGAAAGAAAAGANGAAAGACAAACNG 

CAAAAAANNGCCAAAACGCGACNANCAAAAAANGGAGCAGANGCNGAGGCANNGCAAGNC 

ACGGCANACCGNCAGGAGCCGAANGAAAAAAAGCAACAAGCAAGANGAAGACNCTGAGAG 

GGGCNCGGAGGCCNGGAAACCNCATCCCTNCAACAACAAAGCNGGAANGGGGGAAANGAA 
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' GAANCAGAAGAAGGGGGCGCCCACNAGGCTACNGCNGAAAGGGAACCGAAAAACCCCNCA 
CCAANNAGGNATNCAAAAAANNGGAAACGGACCGGGNCANGGCAAAAAAAANGGAACCAA 
GACACCGGGCCCANACCACNGGACCANGGGGNAANGGAAACACCCCGGAAATGGCAGGGN 
CCCCGAANAGAACCCCAAGGGAGCCCAGGGAAGAGGGACCCAAGATGGGGGGGGAAAGGC 
CCACCGGGCCNGGGGNGAAANAACAAAGCCACCGAGGGCCAACCGNAAGNGGCCAANNGA 
ACCCGGAACNANAAGGCNNAANCCACCCTGAAGNGGGAGGAAAACCCCCAGGGCNGGGGG 
CCAAAN AN N AAAAA AAAGCNGCCCA AAACCCCCA A 
Sequence 1136 

NTTTAATTTTTTGCAGCCCGGGGGANCCAGGGGNAGGGNGAGCCCACCGCGGGGGAGCGC 

CAANCGCCCNACAGCGAGNCGNAANACGCGCNGCCCACNGGCCCGCCGNANAAACAACGN 

CGAGACGGGGAAAACCCCGGCGNCACCCAACNGAAACNGCCANGCAGCACAANCCNCAAN 

CGCCAGCGGGCGGAANAGCGAAGAGGCCNCGCACCGAACGCCCANCCCAACAGNGGCGCA 

ACAGAAAGGGCGAAA 

Sequence 1137 

TTTGCANTTTNGGGGACCCACNAANTGGNGAGCGGGCGANGAACCACACNNACGGCCCGA 

GNCCNNGNCNACNGGNCNAGAGNGANAANCACAGGGACACCGANCANCCCTNNCGGCANG 

GACCCACAGCAGCAGGNGGANNGCCATACCGAGANGAAGGAAGACNCAGACANCGCCAAC 

CATNCCCGAGNGGACCNGCCCGGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnACGNNAAACAACCGGCGAGACNGNGAAAA 

CCCNGGCGNNACCCAACNNAANCGCCGNGCAGCACAACCCCCNACGCCAGCNGGCGNAAA 

AAGCGAAGAGGCCCCGCACCGAGCGCCCNTCCCAACAGGGGCGCAGCCCGAANGGCGAAA 

NGGGGACGCGCCCTGGAAGCGGCGCAANAAAGCGCCGGCGGGAGGGGGGGGANACNCCCA 

NNCGGGACCGCNACAACNANGCCAAGCGCCCNAANCGCCCGGCACCCANACGCTANCNAC 

CCNNCCNANCCACGCCCACGGGANCGCCGGCNANCCCCGGCAAGCCNNAAAAAACGGGGG 

GCNCCCCNNAAGGGGNCCCGACANANGAGGCCNNACCGGCACCCCGACCCCCANAAAACC 

AGGAAAAAGGGNGGAAGGGNCNACGGAAGGGGGCCC 

Sequence 1138 

CACGCGTCCGGAAAGGAGGGGAGGGGTGGGGCGGGCCGGATCTGCTCGCAGGCCGCACCC 

GCCTCCGGCTGGATGCTCAGAGTCTCGCCCTGATGGCCAGACTGGAGTGCAGNGGCGTGA 

TCTCGGCTCACTGCATCCTCCACCTCCTAGGTTCAAGTGATTCTTCCACCTCAGCCTCCC 

CTCCCAAGGAGCTGGCATCACAGGACAGGCACGGAGACTCACGCCTGAAATCCTAATACT 

TTGGGAGGCCGCGGCAGGAGGATCAGCTGAACCCAGGAGTTTGAGACCAGCCTGCCAACA 

TGATGTTGTTATTCATGAGGACCAATGGGTTGGCGAGACAGTACTACAATCAACATTTAG 

CAGTCAGTTATTAMTCTTGGGAGTTATTCATCTATTCAGCCTGAAGAATATTCCAGTGT 

AGTTAGTGATGTTGTACTTCAAGACTTACTGGCATATGTGTNCTCAAAACATTCC 

Sequence 1139 

TTACCCCGCGTCCGGGTAAACAAAACAAAGATCGTTTGTTCTGGAAACAGGTAAAATGGT 

AATCAAATAGATTGTGTTCCAGGAGTGCAAAGGTGGCTTAATATTCACAAATCAGTTGCT 

ATTGTACACCACCTGTAGAAAAGTAATCTGGCATGCAGAACATTCTTATGGTAAAGTTAA 

TGTTCATTTATGATCTTAGCAAATGATGGATTGAMGGGACTTCCTTAATTGCATAAACA 

GACTTCAACAAACAGTATGATGAAATAGTGAACATTTCTCCTAAGATTATAAAAATAAGA 

CMGGATATCTGCTGTCAATGATTTTATTCAGCATTGTTCAGAAGGACCTAACCAGAAAA 

CTAATGCAAGMACAGAAACAAAAGGCATAAAGATTANAAAAGAAGTAAAATTTTAAAAA 

AGAMMGAATATAMTCTCTATTTGCATATGCCATGAGTAMTTTGGTMGTTCCCTGC 

ATAAAAGTTATGCAAAAAGCATTTTTTTGATATACCAGCCAAAAATCAAGGGAAATGGAA 

AA 

Sequence 1140 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 
AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 
AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAG 
CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 
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AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

GGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTC 

TTGAGAGAGCCCAAGGAGTTCTGGGGAGAGGGACCAGATTGGGGG 

Sequence 1141 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGIXTCATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1142 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAG 

Sequence 1143 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCACTCTATCCATCGTGGATA 

GAGAAACTGAAGCTCTCTAAAGACCCTGCAGCTGGGAGGTGGCAGAGTCAATGGCAGCCC 

TCAGCCCTATCTGCCCTGACATGGCATTCCTCCCATTTCTCACCACCGAACCCTCTAAAA 

TAACAATGTGTGGGGTC CTTGG CTGAGAGACTTNCCTTTTGGGAATCAATCTGAATGTAT 

GATGACAAAGAMACAACTTTTGCTTTATACAACCTTCTGGTTAGATTCAGGCACCAAGC 

AGGACACTTCTTTGTGGCGCTCCAAGAATCTTTCAAATTCTTCATCACCAATAACAAATC 

TTTCTGCTTCTCTTAGAGCATCTTCTCCACAATTCTCACCCTCAATTAAGAGGCACTGGA 

ACACTTTCCAGCGGACAGGGTTTAGTGCTTTGATCTGTTCCGTCATGTCCTCTTCCACGT 

TG AAACG ATTAATG ACAG AA I I I I I I I I GGAGGCGACTCTATTAATCCCTACACCACCTN 

CTCAGCTTTTGAAGGGTTTNCACATGGGTTCTTTT 

Sequence 1144 

GNAGCTCCCCGCGGTGGCGGCCGAGGTACGCCACCATGCCTGGCTMTTTTTGTATTTTT 

AGTGGAGACGGGGTTTCACCATGTTGGCCAGGCTGTTCTCGAACTCCTGACCTCGTGATC 

CACCCACATTGTCCTCCCAAGTGCTGGGATTACAGGCGTGAGCCACTGTGCCATGAGGAT 

TAGTAAAGTGCACTCATGGTAAGTAAAAAAATTTGTTTTATGTTNATGCTGATTATATGA 

AGGTCATCATAGCTTAGACACAATCAAAACCCATGGGGAACATCTTTAGAATTCATTTTT 

CTCTTTTCTTACAAAAAAAGTAAATAGGTAAAATGGAAAATAGAAGACAACCTATCCTAT 

CCTGGAT GAGACACACACATATTTAAATTGAATTATAGACTTAAAATTTAAGTAGGGANT 

i I I 1 1 1 1 iNTGNGNAACAAAAGTTTNCAAAAAACCCAAAACTTTTNAGAATCACCCAGTT 

NTTGGAAAATATGATTATGAAAGCAGACTTTTTGGATGGGNGCTTAATGACATTTAGGCG 

ACATTTAAAATGCCCCTAGGNGNGGGAACACTTGAAATTGCCANCTAAATTAAAATGACC 

CTTTTMTTTGCCTGGACAACAMAAANTTTCCATGATTTTGGCTTTTTTTGGAACAANN 
GANNAAG AAAA I 1 1 I I I 1 1 I IAG 
Sequence 1145 

CNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCANGAGCTGAATGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 
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ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 
Sequence 1146 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCACGGTACTGTGTCAACAT 

GCAAGGCCAGGTGGCCGTGAGGAATGTCCCAATGACAGGCTCTATCAGTCATGTCCTGGT 

GCAGCTCCCACATGTCTTCCAGAGGATGGAAGCTGAAAACCTAGCTTCAGTGATTGATGC 

CAGGTTTMCTTTTTTGTGMCAAGATTTTGCCACAGTATCGTGATGCAGTCATGTCTCA 

CACGCTCATCTATATCCCCTCCTACTTTGACTTCGTGCGTCTTCGAAATTACTTCAAGAA 

GGAGGAATTGAATTTTACCCACATCTGCGAGTACCT 

Sequence 1147 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAMATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTNTTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGGGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGGGGTAGGTCC 

Sequence 1148 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTTCT 

GCGCAGCTTCCCGAGGCTCCGCACCAGCCGCGCTTCTGTCCGCCTGCAGGGCATTCCAGA 

AAGATGAGGATATTTGCTGTCTTTATATTCATGACCTACTGGCATTTGCTGMCGCATTT 

ACTGTCACGGTTCCCAAGGACCTATATGTGGTAGAGTATGGTAGCAATATGACAATTGAA 

TGCAAATTCCCAGTAGAAAAACAATTAGACCTGGCTGCACTAATTGTCTATTGGGAAATG 

GAGGATMGAACATTATTCAATTTGTGCATGGAGAGGAAGACCTGAAGGTTCAGCATAGT 

AGCTACAGACAGAGGGCCCGGCTGTTGAAGGACCAGCTCTCCCTGGGAAATGCTGCACTT 

CAGATCACAAGATGTGAAATTGCAGGATGCAGGGGGTGTACCTTGGCCCGCTCTAGAACT 

AGTG 

Sequence 1149 

TGGGCGAATTQGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 

TGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTG 

GACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCT 

TGAGGGGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTGGGGG 

Sequence 1150 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTG 
TGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGA 
NAAAGAAAAGATGAANGACAAACTGCAAAAAATTGCCAAAATGCNACTTTCTAAAAATGG 
AGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATTGAAAAAA 
Sequence 1151 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTG 

TGAGGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGA 

GAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCTAAAATGCGACTTTCTAAAAATGG 

AGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCA 

ACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATC 

AAGCTGGAATGGGGAACGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

AGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGA 
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Sequence 1152 

CCGCGGTGGCGGCCGCAGTTACCTGACGTATGACAACCCAGATATCTTGAAGAGGGTGTC 

AGGATCAAGATCAAATGTGTTCATGAGGTTTAAAACAACTGCCMGGATGGCCTTTTGCT 

GTGGAGGGGAGACAGCCCCATGAGACCCAACAGCGACTTCATTTCCTTGGGCCTTCGGGA 

TGGAGCCCTCGTGTTCAGCTATAACCTGGGCAGTGGTGTGGCATCCATCATGGTGAATGG 

CTCCTTCAACGATGGTCGGTGGCACCGAGTTAAGGCCGTTAGGGATGGCCAGTCAGGAAA 

GATAACCGTGGATGACTACGGAGCCAGAACAGGCAAATCCCCAGGCATGATGCGGCAGCT 

TAACATCAATGGAGCTCTGTATGTGGGTGGAAT 

Sequence 1153 

GCNMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGMCTAMTTTTTTAAACTTTA 

TTTGCTGTTAAATTCTGTGMGTTTCAGTTATCTAAAATAAATATACACAAATATGAAAT 

ATMTGTTTCAGATTGCAAGGTAATATGTAATAGTAGTGTTTGTAAGATACTCTTGTCTA 

ATATTMCTAGTAGTATTTTGATTTGTACAATGTCACCCTCCCAGCAACAAGAAGAACAA 

GCTACTGMTCAGTGTCCCTTTATTACTATGGCATCAMGATTTGGCTACTGTTTTCTTC 

TACATGCTAGTGGCGATAATTATTCATGCCGTAATTCAAGAGTATATGTTGGATAAAATT 

AACAGGCGAATGCACTTCTCCAAAACAAAACACAGCAAGTTTAATGAATCTGGTCAGCTT 

AGTGCGTTCTACCTTTTTGCCTGTGTTTGGGGCACA 

Sequence 1154 

GAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGGGGT 

Sequence 1155 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAATGTGTGAAGACAAGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CAMCTGCMAAAATTGCCAMATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTT 

GCATGTCTTGGCATTTCCTTCAGGAGCTGAATGAAAAAATGCANCAAGCANATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCAC 

CAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGG 

CCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGG 

Sequence 1156 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGCACGTCACGATGTCTTGAGTTTCATTC 

ACTAGGTGGCAGCCTGCATCGTTCCACTGCAAATGACTGAAATCCCAAAACACACAATGA 

GGCTGGCTCAGGTTTGACTCTATCTTGGAAAAAMTAGGAAMCTTCATTTATGGAATAG 

TTTTGAATMCCGTGGATATCACAGGTCCATTGACCTGAGCATTTCCATTTTTGGAAACG 

GGTAGAATGTTCCCCAGAGTCAACGAGGCCATGCTGATAATAGTTTCTGGAAGGGATCTC 

TGG AATTG GTCTGACCCMTTAACACACGGCCTCTGATGGGAATAGATGTATTTTGGGGA 

CACATTTTAATCTGATAGCTGTAACCCCTT7TGAGTTGGCTTTTGTTCACTGGAATCCCT 

TTCCAGTCAATGAATTTCCGAGAAAAATTCAGAGGAAGAGCTGTCGGAGGCACCAGAGTG 

CTGATGTTTTCT 

Sequence 1157 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGG 

GAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGA 

TACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGA 
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TCATTTCGTGAAGGAGCAAGGCTGMGAACCAGACCTGGACTTTCTTAGGACAAACTTAC 

TGCAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTC 

TACTATCAATACCAMTCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATG 

TTTMCACCATCTTACTAACCATTATTGCTATGGGTGGGTATACACTGCCTATGTCTTTA 

TTCCAATCCACATTCGCCTGGGCTTGGGMTTTTTCTTCAAAAATATGTGGATATCACAG 

GNCCTCGGCCCGCTCTAGAACTAGTGGGATCCCCCCGGGCTTGCAGGGNAT 

Sequence 1158 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGGGA 

AGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATA 

CACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATC 

ATTTCGTGMGGAGCAAGGCTGAAGMCCAGACCTGGACTTTCTTAGGACAAACTTACTG 

CAGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTA 

CTATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCCGATCTATGT 

TTAACACCATCTTACTAACCATTATTGCT 

Sequence 1159 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGCAGTGGGAA 

GCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGATAC 

ACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGATCA 

TTTCGTGAAGGAGCAAGGCTGAAGAACCAGACCTGGACTTTCTTAGGACAAACTTACTGC 

AGCTTGAAGGAGCCAACCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTCTAC 

TATCMTACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCGATCTATGTTT 

MCACCATCTTACTMCCATTATTGCTATGGGTGGTATACACTGCCTATGTCTTTATTCC 

MTCCACATTCGCCTGGCTTGGGAATTTTTCTCA 

Sequence 1160 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCACAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAAATGCGACTTTCTAAMATGGAGCANATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTNAGCATCAAGCTGGAATGGGGAATGAAGAATAGAN 

ATGTGGTGCCCA 

Sequence 1161 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCGGAAGGTGGGTGACGTGCGGA 

TCTTCTTCTTTTTGTGGCTGTGGACACCTTTCMCACTGCCTTCTTGGCCTTTAAAGCCT 

TCGCTTTGGCTTCAGCTTTAGGAGGGGCAGGAGCCCATCGCAAAACCACGCTGCGGAGAG 

AGGGGCGGGTAATGTAGCCCGGTTGAACATGAACCAGAAGGAAAATGGTTAAAGCTGAGG 

GCACTAATTCCTTACAGGCCCGGGGACATGGAGCTCCAACCAGTGGATGCATGTAGCTTC 

CCAGAACCGAATGTCTGCCCCGCGTACCT 

Sequence 1 162 

CCGCGGTGGCGGCCGAGGTACCACTCTATCCATCGNGGATAGAGAAACTGAAGCTCTCTA 

AAGACCCTGCANCTGGGAGGTGGCAGAGTCAATGGCAGCCCTCAGCCCTATCTGCCCTGA 

CATGGCATTCCTCCCATTTCTCACCACCGAACCCTCTAAAATAACAATGTGTGGGGTCCT 

TGGCTGAGAGACTTCCCTTTTGGGAATCMTCTGAATGTATGATGACAAAGAAAACAACT 

TTTGCTTTATACAACCTTNTGGTTAGATTCAGGCACCAAGCAGGACACTTCTTTGTGGCG 

CTCCAAGMTCTTTCAAATTCTTCATCACCAATMCAMTCTTTCTGCTTCTCTTAGAGC 

ATCTTCTCCACAATTCTCACCCTCAATTAAGAGGCACTGGAACACTTTCCAGCGGACAGG 

GTTTAGT 

Sequence 1163 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 
CAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGA 
AAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 
TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 
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AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGAC 

TAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCT 

Sequence 1164 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCNGGCACGGTNCCAAATGAAC 

GTGTGAAGACAAGGCCATNCACCACTTTATAGAGGGTGTAAAAATAAACCAGGAAATCAA 

GGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAA 

ATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAA 

TGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAG 

CATCAANCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAA 

AGGGAGCTGAAATTCCTCCACCAAGTT 

Sequence 1165 

TTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCATTTTC7TTNTCCTGT 

CCATAAATCTTCTTCCACCACGTGGCTGTGTNCAAGACTCTCTGAACCTNCTCTGGGTCA 

GGAGGCTTNTAGATNTGTGAATTGTCTGCTCAGTNNAACTCCATTAAATTNAATNTGGCC 

MGMNTTTCTTCTTAACAGNGGTATTGATGACCATTAATCCTTCAACCTAAACCTGCTC 

ATTAA 

Sequence 1166 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGAT 

TCTGAGGCTTTGCATGTCTT 

Sequence 1167 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAACAAGCGTTTGT 

MTGTTTCCCAMTATTAGCTTTGAAATCCAAATGTCAAGCAGGTTAAAGTTCANAAAAC 

TATAMTGTMTTGTTAATAAAACAATGGATGAAAAAAGTCATTGAMTTTTTTTCTACT 

TGGATTAAGMCATAAATTAAAAGTGCAACTGCAAAAATAATATTAGTTTGATAAGTAAA 

ATAMACMACTAGATATATTTTGAMATAAAAAACAAATCBAAACAAAATAAAATTTAGG 

TAAAGAMATTCMCGTAATTTGTTGTAGCTATATTTTTTGTMTAATTACAAAAGTAGA 

AAT MTA GCTCATAMGCAAMACAAAATTTATTCTATGTTCTTTTTTTCAGTCAATTCA 

GAATTTT AGCTT CATATTTGMGCATTTTTTTCTMTTTTGTT^ 

TTGCGGATTTTGATTTGCCATAAAMTATTATATTNTATTTMTTATATTMTCTTCGTC 

AGCTTTAATTGCTCTTCTTTAAAATTTGATCTGAAAT 

Sequence 1168 

CCGCGGTGGCGGCCGAGGTACCCTTGTCCTCTTCTTCAGTGACTTAAMCAATTCCAGGA 
TCAGAAGAGAAGCCAACGTGACATCCTCGATAAAACTGGGGATAAGCTGAAGTTCTGTCT 
GTTCACGMGTGGTTGAAAAACAATTTCGAGATCCAGAAGTCCCTTGATGGGTTCACCAT 
CCAGGTGTTCACAAAAAATCAGAGAATCTTTTTNAGGGGGTGGNCGClsrrTMCCCTTTTN 
NGGTTNANTGAAAATCCCCCCCCTGNI I I < I I I I G AGAGGTTNAAAAA I 1 1 I 1 1 < I I NAA 
AAAAAAAACCCCCCCCCCNGGGNATTNTNAN I I I I I I I I II NNNNNCCCCNNCAAAAATT 
I I I I I I I AACCCCCCCCCNNAAAAAAAAAAGGG I I I 1 1 I I I I I I I I AAAAAACCCNCNGN 
TCNCCCNNNCCANAAAAAAAAAAANGGGGGTCCCCN I I I I I I I I I I NTTNNCCCCNGGGN 
I I I I I I I I I I I I I I I NTTCTNNNNNNGGGGGGGGGGAAAAAAAAANAAA I I 1 1 I I I I I I I 
TGGNGGGGGGGGNNGCNCCCNCCCCCCNCGGGGGGGGNGGNTTAAAAAAAAANGNCCCCC 
CCNGGGGGGGAAANNIANTTTTANTNNNNTCCCCCCCCCCCNGGGGGGGGGG 
Sequence 1169 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 
CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 
ACTGCAAAAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 
TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 
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' GAGGGGTTTGGAGTCTGGAAGCCTCATCNCTTCAGCATCAACTGGAATGG 
Sequence 1170 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCTGA 

GAGGGGTTTG 

Sequence 1171 

GNGGCGGCCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGG 

GTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAA 

TTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCAT 

TCCTTCAGGAGCTGAATGAAAAAATGCAACA 

Sequence 1172 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATCTAGGTCAG 

TTTAAGAAGAGTCAGCTCAGAGAAAGCAAGCATAAGGGAAAATGTCACGTAAACTAGATC 

AGGGAACAAAATCCTCTCCTTGTGGAAATATCCCATGCAGTTTGTTGATACAACTTAGTA 

TCTTATTGCCTAMAAAAATTTCTTATCATTGTTTCAAAAAAGCAAAATCATGGAAAATT 

TTTGTTGTCCAGGCAAATAAAAGGTCATTTTAATTTAGCTGCAATTTCAGTGTTCCTCAC 

TAGG TGGC ATTTAAATGTCGCCTGATGTCATTAAGCACCATCCAAAAAGTCTGCTTCATA 

ATCTATTTTCAAGACTTGGTGATTCTGAMGTTTTGGTTTTTGTO 

AAAAAAAATATTCCTACTTAAATTTTMGTCTATAATTCAATTTAAATATGTGTGTGTCT 

CATCCAGGATAGGATAGGTTGTCTTCTATTTTCCATTTTACCTATTTACTTTTT^ 

GAAAAGGAGAAAAATGAATTTCTAAAGATGGTCCCCATG 

Sequence 1173 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAG7TCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 

GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGCTGCACACTGTGTTTATGACTTGTAC 
CTGCCCG 

Sequence 1174 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCNAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 

GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGCNTGCACACTGTGTTTATGACTTGTA 

CCTGCCCG 

Sequence 1175 

AGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTNNAGGTGTTCACAGCTGCT 

TCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTGCC 

CAACTGGGTTTCCCAAGCTATGTGAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGGG 

CAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGCA 

TTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCAG 

TGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTCC 

TTGCTCTCGCAGTGGCCCTGGCAGGCCAGCCTTCAGTTCCAGGGCTACCACCTGTGCGGG 
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' GGCTCTGTCATCACGCCCCTGTGGATCGTCACTGNTGCACACTGTGTTTATGACTTGTAC 
CTGCCCG 
Sequence 1176 

CCGGGCAGGTACAACAAGCGTTTGTMTGTTTCCCAAANATTAGCTTTGAAATCCAAATG 

TCMGCAATTAA AGTTCAA AAAACTATAAATGTAATTGTTAATAAAACAATGGATGAAAA 

MGTCATTGAAATTTTTTTCTACTTGGATTAAGAACATAAATTAAAAGTGCAACTGCAAA 

AATAATATTAGTTTGATMGTAAAATAAAACAAACTAGATATATTTTGAAAATAAAAAAC 

AAATGAAACAAMTAAMTTTAGGTAAAGAAAATTCMCGTAATTTGTTGTAGCTATATT 

TTTTGTAATAATTACAAAAGTAGAMTAATAGCTCATAAAGCAAAAACAAAATTTATTCT 

ATGTTCTTTTTTTCAGTCMTTCAGMTTTTAGCTTCATATT^ 

TTTTGTTTGTGAATTTGAATTTGTTTG 

TATTTAATTATATTAATCTTCGTCAGCTTTAATTGCTCTTTCTTTAACMTTTG 
MTTTGTTTTGGTGTTATTTCATAGTGATCAAATTGCATTTGATAAGTTCCACGACCTGA 
TGTCATAGACCTTAATTGTGTTGAGTATCCAAACATTTC 
Sequence 1177 

TAGGGCGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACCTACGGAAATCCT 

AACTACCACTGGCAGGAAACTGCATATNTTCTGGTTTACATGAAGANGGAGGGCTAANGG 

AAATGCCCAAAACCTTCAGAGATTGACACCGCTGTCATTNTCCATNTCNGTTCCTGGAAT 

CTACCGGGAGTNTTNATAAGAAGANTTTTGCAAATNGAGGGAAGAAGCAATTGTTTTCAA 

ACTATATMCTGGAGNCCTTAATTTATMTTAGGGGATATTTAATCAAAAATATNGTAAA 

CCATGGAGGGCCCCCTCAGNGTNCTGGATCAGGTTCAAGAAATNGAAATGIsrTTTTCACCC 

AAGNCANGACCCCGGCCATGTGGGCATGNTCCGGGTNCCTGGGGGTGGCNTCGNCTGGCT 

TGTGGCGMNGAACAATTAAGCCCCTTTTAAGTTTATTGMGCCCTGNGGGGAMCTTTA 

AGGGGGTTTCCANAGTTGGGGGANGAAGCANTNGGNNAGTTGGNGAAGGGCATTTTGGGG 
GGG 

Sequence 1178 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGC 

CATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAT 

GAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCC 

TCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGAC 

ACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCT 

Sequence 1179 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAAAATGCGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGG 

Sequence 1180 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTATTGCCAGGAAGATAGGCAGCTCATCTGTG 

TCCTGTGTCCAGTCATTGGGGCTCACCAGGGCNNCCAACTCTCCACCCTAGACGAAGCCT 

TTGAAGAATTAAGAAGCAAAGACTCAGGTGGACTGAAGGCCGCTATGATCGAATTGGTGG 

AAAGGTTGAAGTTCAAGAGCTCAGACCCTAAAGTAACTCGGGACCAAATGAAGATGTTTA 

TACAGCAGGAATTTAAGAAAGTTCAGAAAGTGATTGCTGATGAGGAGCAGAAGGCCCTTC 

ATCTAGTGGACATCCAAGAGGCAATGACCACAGCTCATGTGACTGAGATACTGGCAGACA 

TCCAATCCCACATGGATAGGTTGATGACTCAAATGGCCCAAGCCAAG 
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Sequence 1181 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 

CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 

TGAMGACAMCTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTG 

AGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATG 

AAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATG 

GGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTC 

CTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGA 

CACTGGCCATACCACTGGACAGGGTTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGA 

GCCCAAGGAGTTCTGGGAGAGGGACCCAGATTGGGGGGGTA 

Sequence 1182 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAA 

GACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGC 

TGGAATGGGGAATGAAGAATAGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGC 

TGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTG 

GACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAA 

Sequence 1183 

TCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAA 

GACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGC 

TGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCT 

GAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGG 

ACTAAGACACTGGCCATACCACTGGACAGGGTTTATGTTAACACCTGAATTGCTGGGTCT 

TGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTGGGGGGTAGGTCCCGGGCTTGG 

TGATAGAAATATTTCTCGATGACTTTCTTGAGTGCAATTTGNACTGTMCATTTGCTTAA 

TCACCTT 

Sequence 1184 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTAT 

TATCTAACTTGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACNGCCATAATGCC 

GTGCACAGNTTGAAAAGCMTTAGAGTMGCAAGATTAGTTTTTCCTCCCTTCCAGTTTN 

CTCAAGCAGGCCTGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAA 

TAGCAATAGCAATAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCT 

NTCCCGGATCAGGCTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTNTACTTT 

GCAGAACCTCACCMCTTGCCCAGGTNTTCTCCCCGGTCTTGAAGAAATGGCTCTCCACC 

TGAAAAGTNNGATCTTCTCCATACCAGCTTCTTAAGCMMGCAATCCTCTCTTTGCTTC 

CTCAAGGGGCA 

Sequence 1185 

TAGGGCGANTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAA 

GGCCATCCACCACTTTATAGAGGGTGTAMAATAAACCAGAAATCAAGGGAGAAAGAAAA 

GATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTC 

TGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGA 

TGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCNTTCAGCATCAAGCTGGAA 

TGGGGAATGAANAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAAT 

TCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATG 

Sequence 1186 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 
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' CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 
TGMAGACAAACTGCAAAAAATTGCCMAATGCGACTTTCTAAAAATGGAGCAGATTCTG 
AGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATG 
AAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATG 
GGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTC 
CTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGA 
CACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAG 
CCCAAGGGAGTTCTGGGAGAGGGACCAGATTGGGGGGTA 
Sequence 1187 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA . 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAAMTATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCAT 

ACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGT 

TCTGGGAGAGGGACCAGATTGGGG 

Sequence 1188 

CCGCGGTGGCGGCCGAGGTACAAGATANTCATCTCAGTAAAAGGTCTATTATCTAACTTG 

CCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCTT 

GAAMGCAATTAGAGTAAGCMGATTAGTTTTTCCTCCCTTTCNAAGTCCTCAGCAGGCC 

TGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAA 

TAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAG 

GCTTCCATTGCTCACGATGCTCACGCTGGGCAGNCGCAACTCTACTTTGCAGAACCTCAC 

CAACTTGCCCAGGTATTCTCCCCGGT 

Sequence 1189 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAG 

TGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGG 

GAGGAAGAAMGATGAMGACAAACTGCMAAMTTGCCAAAATGCGACTTTCTAAAAAT 

GGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATG 

CAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAG 

GGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAA 

AGAATTGGACTAAAACACTGGCCATACCCACTGGACAG ♦ 

Sequence 1190 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGCGAAGCTAGA 

GACTGTMCCTGAAGATTGGGACMATTAAAGAAAAAAATGTGATTTAACACAATTACAA 

MACTGTTACGTTAGGGTCAAACMGAACCATTTTATGAAACTGAATTACAACAAATGAC 

ATTATATCTMCTCTTCCGGGTCTCCACAACACTTATACTTACTTAAGCAGCTTAAACAC 

TTCCGAGTCTCCACAGCACTCTGATACTTACTTAMCAGCTCTTTTAACCTGCCCTAGTA 

TTCTTAAGTGCAGCATATCTAA I I I I I I I I I I CCTCAAGTAGTTTGAA 

Sequence 1191 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCAAATGAACGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

AAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGA 

GCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGAT 

TGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAA 
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' Sequence 1192 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCNTCCACCACTTTATAGAGGGTGNAAAAATAAACCAGAAATNAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCANAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCC 

GATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGG 

AATGGGGAATGAAGAATAGAGA 

Sequence 1193 

TGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCT 

GAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGGAAAAATGCAACAAGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATT 

CCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAG 

ACACTGGCCATACCACTGGACAG 

Sequence 1194 

NGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACATAAATCACCTGGAACCTTG 

TTAAAATGCAGATCCTGACTCAGGAGGTCTGAGTTAGAGCCCAGGATTNCATATTTCTAG 

CCAGCTCCATGATGAGCTGCTGGTCCGCAGATCATGCTTGCNGGTTTTGACCAGAGTCAG 

TGTTGGTTANAGTAAGAGGATGAGGCANACATNTGGGAAAAGTCCAGCTGGGGCAAGCAT 

TTGAAGTCTGCCTTCCTACCANGTCAAAATCAAGGCAACGACCTTCCATAGATAACTATC 

AAAGCTTGAGGGGGNGCCTTGAACCCAACTCCTAAATCCCTAAGACCTGCCCACCTCTTG 

TGTCTCCTGTNTNAGCAAACATTCCCACACTCTTGCATATTGTTAAAAGTAACCTCTGCT 

TACCAGGCTTTTG 

Sequence 1195 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGGCCCGGGGGGTGTCCGAACAAGGCAGGTTG 

GTGGGTTAAGGTCTTAATCTTGACTCGAGATCTCTCTCCGGAGTTCACAGNGTNGGCGAC 

GAAGCCGAAGCAGCTGGAGCGCGACCCGGAGGAGTCTGAC7TCTCGTTGTCTTCATAATT 

TTCATTCGTTGCTTTCTTCGTGGACTTGCGGCTGGGGGAGGATCCCCGCTGGTCGCCGAG 

CAGGCGGGCGGGTAAAGGTAGGCCGCCGAGAGCGAGGTTAGGAGAGGAGAGGAGGCCGCA 

GTACCT 

Sequence 1196 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGGAGGCGGAAGCGC 

AGCGGGGGCGGGAAGGTTGTAGTGCCGCGAGTTGAGCTCCTCTTGCCTAAGTGGTCGCGC 

CCCCTTTAAGAGCAGCGATTGTAAGGAGAGGCGGTCCCGGTGTCCTCGGGTCCCAGGTGA 

TTGTGAAGTGCTGACCAATTGCCACTGGACATACTTGAAACAAAATAGGAAAATGGCAGC 

AAACCCTGTCTCTAAATCAATCAATCAAGCGAGCCAGAATGCAGTAGTGGCCTGAGAGAG 

GCATCCTGGMCGCAGTGCGGTCTGGCTAGGCTTAGMGTATTCATGTGATTTTTACCTG 
ACAAGGG 

Sequence 1197 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGG 

Sequence 1198 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGAGCG 

AGCTTCGGAGAAGCAGTGGTGGGTTCCATGTGGTGGTGGAGTAGGAGGCAGGTCTCCGCG 

GTGGTTTCCACAAGAAAAATGGCACAATGTTTCTCAGAAGACAATTACATAAGAATCAGC 
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' ATACTTTAAATTCACAGCAAATAATCAGACAATTGATGAAAATACTTACCCAAACACTAA 
TTGTAGACTGTGCCTTCTGAATATGTTTTGTCATAAACTTGGAGTAAGGAATCCTCACAG 
GCACTGGACMTTCAAAMACGTAAAGTTTGTTTGTTAGMTACCTGGGTGCTTTTGGAT 
AGAAACCCTCATCCATATCCTGGTAAGGCTTGAAGTTGCACAGGAGTTTTCATTTGTCAA 
AACCCAGAAAACCATAAGCTTTAGATTTGGG 
Sequence 1199 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAMATGCGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTTTGA 

GAGGGGTTTGGAGTCTGGAACCNTNATCCTTTCACNTTCAACTTGAAATGGGGAATGAA 

Sequence 1200 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAANTGAAGTGTGAAGACAAG 

GCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCATAAATCAAGGGAGAANGAAAAG 

ATGAAAGACAAACTGCAAAAAATTGCCAAAATGCNGACTTTCTAAAAATGGAGCAGANTC 

TGAGGCTTTGCATGTCTTGGCATTCTTCAAGAGCTGAATGAAAAAATGCAACANGCAGAT 

GAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCNTTCAGCATCAAGCTGGAAT 

GGGGAATGAAGAATATAGATGTGGTGCCCACTAGGNTNCTGCTGAAAGGGAGNTGAAATT 

CCTCCACCAAGTNGGTATTCAAAATA 

Sequence 1201 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACGCGGGGGTAAC 

TGAAAATCCACAAGACAGAATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGC 

AGAACGGCTGCCTAATTTACAGCAACCATGAGGCCACTTAAGGATGCAGCAAGAAGGAGC 

CATCTGCAATCCAGGAAGAAATTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTA 

GGACTTCTAGCCTCGAGAACTTACAAATGGTGATGATCAT 

Sequence 1202 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCATAAANNAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCANAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCA 

NATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGG 

AATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAA 

ATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAAG 

Sequence 1203 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATATAAAATAATA 

CATATG AAGCC A CAATA TT CAATT ACAGTTCAAACTAGAAATTACTAACTACATGTAGCT 

GCTTCATTmCGTT TTGCA TTTTGGCCCCTMTTCATATT^ 

GAAAAAGATCCAAACTTTTAMCTTTGTATGTTTTGTGGAGGGTGCACMTTTCTTCTAA 

TATATCTTCAGGTGTTTTAAMTTTMTTTGTTTTTAATCATAAGATATCATCATGGCCA 

AGAGACTGGGAAMTMCMTTTTATTCTTTCTCCTMGATTGNGATTTTATTATTCCAA 

GATCTTATGCTTGAATTACTTAGCAAGAAGGCATGATTATGCANAAGACAGGGAAATGAA 

GAGAAAAGAGCGGGAATATACGAAAATGAAGCTTCCTTAACAGAGTTCATGGTGGAGATG 

GTAGACACTGGTGGAGI I 1 1 1 I I CCAG ACTTAA 

Sequence 1204 

TCGAGCGGCCGCCCGGGCAGGTACACTCTAAAGAAAGCCATGAGGATGATAATCCACTTT 

GATACT TCCAATCT GCTGGTCTTGCTGAACTCTTTGGATCATGGATATCATAAGTTCGAC 

AAAATAT7TTTTTTGTAGAAGCACAAATGTGAAGNGTCACTCGTTCTGAGACTTCCTCCT 

CTGTGAAATTCCACAATCTCTTTCTATTTATAGACTTTTCCACAGCAAACATTAGTCTAC 

GCAGAGCATTTTGAAAATCATTTGCCAGTTCTAAAGTAGTAATAATAAATACTCCAAGAA 

CTAAAAGTCCCCCTGGTAGCATTCTGGATACCTGGCAGGCATGTTCTGTGGCCCATTC 
Sequence 1205 
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NNGNCCTTTCGAGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCC 

ACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAG 

ACAAACTGTAMAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAAT 

Sequence 1206 

AGCGTGGTCGCGGCCGAGGTACATAAAACATTATTCCTTCCTTGGCCTAAAAACTCATCG 

CCACCTACATTAAAGCTMTATGCCTGATTACTGTTTTTAGAGMCTTATTTTATTAGGG 

CAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGNTAGCTACATAAAAATGCACCCT 

AGACCCGAAACTTACTAGACTCATTATAAMTTTTCTTTAAGGTGTCCACGCAGTCCCTG 

GTCAC ACTTGAAG CAG TCCG GAGAAATATCAGCCCTACCCCAGTAATCCCCAGAAGGAAC 

TTACACI I I I I 1 1 I MTCTTTTCCTACAACTTCATATTTTATAAATA 

Sequence 1207 

CCCTTAGCGTGGTCGCGGCCGAGGTACCATCTAGGTCAGTTTAAGAAGAGTCAGCTCAGA 

GAAAGCAAGCATAAGGGAAAATGTCACGTAAACTAGATCAGGGAACAAAATCCTCTCCTT 

GTGGAAATATCCCATGCAGTTTGTTGATACAACTTAGTATCTTATTGCCTAAAAAAAAAA 

TTTCTTAT CATTG TTTCAAAAAAGCMAATCATGGAAAATTTTrGTTGTCCAGGCAAATA 

AAAGGTCATTTTMTTTAGCTGCAATTTCAGTGTTCCTCACTAGGTGGCATTTAAATGTC 

GCCTGATGTCA TTMG C ACCATC CAAAAAGTCTGCTTCATAATCTATTTTCAAGACTTGG 

T GATTC TGAMGTTTTGGTTTTTGTGACTTtGTTTCTCAGGAAA^ 

MTTTTAAGTCTATAATTCMTTTAAATATGTGTGTGTCTCATCCAGGATAGGATAGGGT 
TGTCTTCTATTTTCCATTTTACCTAT 

Sequence 1208 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCAC 

CACTTTATAGAGGGTGTAAAAATAAACCAGANATCAAGGGAGAAAGAAAAGATGAAAGAC 

AAACTGCMAAAATTGCCAMATGCGACTTTCTAAAMTGGAGCANATTCTGAGGCTTTG 

CATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCT 

GAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGA 

AGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCA 

AGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGG 

Sequence 1209 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGNCAAAATG 

CGACTTTNTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTNTGAGAGGGGTTTGGAGTCTGGAAGC 

CTCATCCCTTNAGCATCAAGCTGGAATGGGGAANGAAGAATAGAGATGTGGTGCCCACTA 

GGCTACTGCTGAAAGGGAGCTGAAATNTCCTTCCACCCAAGTTGGTATTTCAAAATATGT 

NATTGACTGGATANGGGCAAAAAGGATTTGGACTAAGACACTGGGC 

Sequence 1210 

GCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAACGGCCATCCACCACTTTATAG 

ACGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

MMTTGCCAAMTGCGACTTTCTAAAAATGGAGCATAATTCTGAGGCTTTGCATGTCTT 

GGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCNGATGAAGACTCTGAGAGGGG 

TTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGA 

GATGTGGNGCCCACTAG 

Sequence 1211 

CGGAMNTTGGGGGGCCCCCTTNCTTAAGAAAAGGCCATTGGCTTNCCGAAGGCGGGGGC 
CCCG CCCC AAGTTGGTTGGMNTGGGGGATTATTNCTTTGGCCAAGAAMNTTTCCGGGG 
GGNGTTTTNAAGGGNGGGGGGGGGGGCCCCCCCGGGCCCCCCAAAAGGGGTNAACCCCCC 
TGGGGGGNAANAGGGGGGGAAANNTTTNNNAAACCGGNNAAAAAACCCCCCCAGGGGGGG 
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CCCCCCGGGGGGGGAAAAAANCCCCCNGGGGGGGAAACCCCCCCGG 
Sequence 1212 

GACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

AAAGAAMGATGAMGACAAACTGCAAAAMTTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAAGATGGTGGTGCCCACTAGGCTACTGCTGAAAGG 

GAGCTGAAATTCCTCCCCAAGGTTGGGTATTCAAAATATGTAATGACTGGGTATGGCAAA 

AGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAATTGCT 

GGGTCTTGAGAGAGCCCAAAGGAGTTCTGGGAGAGGGACCAGATGGGGGGGTA 

Sequence 1213 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTG 

G 

Sequence 1214 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAG 

TGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGG 

GAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGTCAAAATGCGACTTTCTAAAAAT 

GGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATG 

CAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAG 

GGAGCTGAAATTCCTCCACCAAGTTGGTATTCi(\AAATATGTAATGACTGGTATGGCAAAA 

GATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAACACCTGAAT 

Sequence 1215, 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 

CAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGA 

AAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1216 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTG 

AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACCTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAAGATTGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

AGCTGAAATTCCTCCCCAAGGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGA 

TTGGACTAAGACACTGGCCATACCACTGNCAGGGTTATGTTAACACCTGAATTGCTGGGT 

CTTGAGAGAGCCNAAGGAGTTCTGGGAGAGGGACCAGATGGGGG 

Sequence 1217 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTG 
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GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCCCAAGTTGGTATT 

CAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGG 

CAGGGTTATGTTA 

Sequence 1218 

CCGCGGTGGCGGCCGAGGTACTTCTTACAGTCTTCAGGAAATTCATTAAATCAGTGCCTC 

CAGTTCCTTTGGCTTCCAGTTTTGAAGGGTCTTCAGAGGTCTTATTCTCCTTTGGCTGCT 

GGCTTGCAGGAATCAGGATGTACTGTTCCTGTTGGCCGAGTGGAGACTGGNGTTCTCAAA 

CCCGGNATGGTGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATCTGTCGAA 

ATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAATGTCAAG 

AATGTGTCTGTCAAGGATGTTCGTCGGGGCAACNTTGCTGGTGACAGCAAAAATGACCCA 

CCAATGGAAGCAGCTGGCTTCACTGCTCAGGGTGATTATC 

Sequence 1219 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCTGATGCTACAGACGAGGACATCACCTCACA 

CATGGAAAGCGAGGAGTTGAATGGTGCATACAAGGCCATCCCCGTTGCCCAGGACCTGAA 

CGCGCCTTNTGATTGGGACAGCCGTGGGAAGGACAGTTATGAAACGAGTCAGCTGGATGA 

CCAGAGTGCTGAAACCCACAGCCACAAGCAGTNCAGATTATATAAGCNGGAAAGCTTATT 

GATTANMGCMTGNGCNTTTCCGATNTGATTGATNNGTNAAGNAACTTTTTNAAANGTN 

ANCCCTGMTTNCCNNNACCCAATTAAATTTTTNCNANCCCCTTTAAAAAATTTTNCTNG 

GGNTGGGGCCCCCCCNAAAANTTAGGGNANAAAAAATATNAAANCCCCNNAAAATATTTT 

NNNTTNTTTTCTAMAAAAATAAAAMCCCNCCC 

GGGAAAAAAAATTCCAATTTTTNCCCTTTTCNTTTTANCCNAAAAAAAAAAAAAT 
Sequence 1220 

CCGCGGTGGCGGCCGAGGTACATTGGCACGTCACGATGTCTTGAGTTTCATTCACTAGGT 

GGCAGCCTGCATCGTTCCACTGCAAATGACTGAAATCCCAAAACACACAATGAGGCTGGC 

TCAGGTTTGACTCTATCTTGGAAAAAAATAGGAAAACTTCATTTATGGAATAGTTTTGAA 

TAACCGTGGATATCACAGGTCCATTGACCTGAGCATTTCCATTTTTGGAAACGGGTAGAA 

TGTTCCCCAGAGTCAACGAGGCCATGCTGATAATAGTTTCTGGAAGGGATCTCTGGAATT 

GGTCTGACCCMTTMCACACGGCCTCTGATGGGMTAGATGTATTTTGGGGACACATTT 

TAATCTGATAGCTGTMCCCCTTTTGAGTTGGCTTTTGTTCACTGGMTCCCTTTCCAGT 

CA 

Sequence 1221 

ATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAAtGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 

ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAA 

TTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATTGGACT 

AAGACACTGGCCATACC 

Sequence 1222 

CCGCGGTGGCGGCCGCCCGGGCAGGTAGCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAAATTGCCAAMTGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAGGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCCCAAGG 

TTGGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATTGGGACTAAGACAC 

Sequence 1223 

CGACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAA 
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TGAANGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAA 

TCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCT 

AAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCNTTCCTTCAGGAGCTGAATGAA 

AAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCT 

TCAGCATCAAGCTGGGAATGGGGG 

Sequence 1224 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAAAAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCA 

TGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGA 

GAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAG 

AATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAG 

TTGGTATTCAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATAC 

CACTGGACAGGGTTATGTTAACACCTGGAATTGCTGGGTCTTGAGAGAGCCCAANGGAGT 

TCTGGGGAGAGGGACCAGATTGGGG 

Sequence 1225 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

MMTTGCCAMATGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAACAGAA 

GATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGGTTGGG 

TATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGGACTAAAGACACTGGCCATAC . 

CACTGGACAGGGTTTATGTTAACACCTGAANTGCTGGGGTCTTGAGAGAGCCCAANGAGT 

TTNGGAGAGGGCCAGATGGGGGGGTAG 

Sequence 1226 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGC 

CTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 

GACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTT 

AACACCTGAATTGCTGG 

Sequence 1227 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCAGGAAGTGAAGTAAAA 

CCTGGTCTTGGTTGATAGGCCCCAGGTTGGCTTGGAGCCATTCCAGGTTGAGAGGCAGGA 

GCCACAGTATMTTAGTAGGCTGAGMGTTTGGGCAGTGTAAGTTTGTGCAGGATAATTG 

CTCGCCTGGTACTCTTGGAAGTCCACCTCGTTGTCCCTGTTGCTGTCCAAGTTGCTCATC 

AGCTTCTGGAMGCAGCTTCACCTGTCCTTTTCCCCAAGAAGCTGGGCAGCTCCCGGGTC 

AGCAGCTCCTTTAGTTCTGACTTGTTGAGCTTGMCTTGTCACCCTCTTTGCCCGAGTAC 

CTGCCCG 

Sequence 1228 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCAtCAAG 

CTGGAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGAT 

TGGACTAAGACACTGGCCATACCACTGNCAGGGTTATGTTAACACCTGAATTGCTGGGTC 
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TTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAAATGGGGGG 
Sequence 1229 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 

AAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATNAACCAGAAATCAAGGGNGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

GGACT 

Sequence 1230 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGTCTTC 

TAGTCCGGTAAACAGAGGGCCTGCCCCCGACAGCTTCTGCTTCCGGGTCACGCCTTGACA 

GCGGCTTTCMCCCCCACCTCAGCCCAGCAATTCGTTTGGAGCATGTGAACACCTTGAGC 

CTTGATGAGTTCCAGTNTGTGGTATATTATGCAGNGCNTTCAGNGAAAATNCTTTTTNTN 

CGGGNNTTNAANNAAAAANANTNGGGTGCCATGNTNTTNCCCCCNNNNNNTNGGGGGGGG 

GCCCCCCTCAAANGGGGGGGG GGNACT ATNANNNNCCCTNTTTTTTGGGGNNCNANNTNN 

AC NCCNT T TNNTT NGGGCCCCNTTTTTTGGGGGNAAAAAAACCCCCCCCCTNNGGGGGGG 

GTATTTTCNTTTTNNGAAAAAAAAAGGCCCCGGGGNNGACCCCCCCCCNGGTGGGNTTAN 

ANAAAAAAAAAAAANTCNCCCNNTTN l I 1 1 I I I I I I IAAAA 

Sequence 1231 

AGGTACGCGGGGCTTTTCCGTGCTACCTGCAGAGGGGTCCATACGGCGTTGTTCTGGATT 

CCCGTCGTMCTTAAAGGGAAATTTTCACAATGTCCGGAGCCCTTGATGTCCTGCAAATG 

AAGGAGGAGGATGTCCTTAAGTTCCTTGCAGCAGGAACCCACTTAGGTGGCACCAATCTT 

GACTTCCAGATGGAACAGTACTCTTGGAAGTCCACCTCGTTGTCCCTGTTGCTGTCCAAG 

TTGCTCATCAGCTTCTGGAAAGCAGCTTCATCTGTCCTTTTCCCCAAGAAGCTGGGCAGC 

TCCCGGGTCAGCAGCTCCTTTAGTTCTGACTTGTTGAGCTTGMCTTGTCACCCTCTTTG 

CCCGAGTACCTGCCCG 

Sequence 1232 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAAA 

CMAAAGGAACCAGAGGCCACTTGTATATATAGGTCTCTTCAGCATTTATTGGTGGCAGA 

AGAGGAAGATTTCTGAAGAGTGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGAGAAT 

TGCACTGCAACCATGAGTGAGAACAATAAGAATTCCTTGGAGAGCAGCCTACGGCAACTA 

AAATGCCATTTCACC 

Sequence 1233 

GCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAANTGAAGTGTGAAGACAAGGCC 

ATCCACCACTTTATAGAGGGNGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATG 

AAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAG 

GCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAA 

GACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTNATCCCTTCAGCATCAAGCTGGAATGGG 

GAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTNCT 

CCCCAAGNTTGGTATTCAAAATATGTAATGACTGNTATTGGCAAAAA 

Sequence 1234 

GCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAATTGCCAAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGA 

AATTCCTNCCCAAGTTNGGTATTCAAATATGTAATTGCTGGTATGGCAAAAGATTGGACT 

AAGACACTGGCCCTACCACTGGACAGGGGTTATTNTTTAACCCCCTGAATTTGCTTGGGT 
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CTTTGAGAGAGCCCCAAGGGGGTTTTGGGAGAGGGGACCCANAATTGGGGGGTAGGTC 
Sequence 1235 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CAAACTGCAAAAAATTGCCAAAATGCCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTTGAAAGG 

Sequence 1236 

TGGA CTCCACCGCGGTGGCGGCCGAGGNACAATACACTAGAAACCAACATAATGTATTTT 

TTTTAAAACCTGTGNGAAAAAATAAATGTTCCACNAGTAGGGATAGGGGAAAAGNAACCA 

AAAGAGAG AAAGA GAAAGGAATGCTGGTTNATCTTTGTANGTNGTAATCGAATGGAGAAA 

TTTGCAGTATTTTANCCACTATTAGNG AAA I I I I I 1 1 I I I I 1 1 1 GTCAAAATGANAGACT 

GGAACCTCTGTTCAANATGCTTTNATTGNMCTCTGGTTTTGAAGACCGGGNNNGGNAAA 

GCAANNAAAAACGTNGGGAAACCCTNNGATGGACNTAAAGGGGCNNTGGNNGCCAAAGGG 

ACCTT GGGG GAAMNGGTCCACTTTGAATANANMGCATGGGGNGGGGNGNATTTTTCCC 

CCCCCTTTTAAAAMNATGGTNTGGMTMTTTTAMNNGGNATATTAAMACCACCTTT 

NTT 

Sequence 1237 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGTGATATCCACATAT 

TTTCGATAAAAATTCCAAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTA 

TACCACCATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTC 

TAAMCAGCACAGCAGCTMTGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTT 

CACACGCCTCAAATCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGT 

CCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTTAGTTCTCC 

TCTCTGGGTTGCTGTTTCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCTTAGGTGAG 

CGCCGAGGCTTTGGC 

Sequence 1238 

AGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACNNTGGQCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAG 

CAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAAC 

AAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAA 

GCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGAT 

T 

Sequence 1239 

AGCTCCACCGCGGTGGCGGCCGAGGTACGCGGGGGGCAGAAGAGGAAGATTTTTGAAGAG 

TGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGTGA 

GMCMTAAGAATTCCTTGGAGAGCAGCCTACGGGNAACTAAAATGCCATTTCACCTGGA 

ACTTGATGGANGGGAGAAAACTCCTTGGMTGATTTTGAAGACMMGTTATTTTTTACC 

CGGCACTGAAGATTTNCAGCAATCCGTTGGAATNTCAAAAGGCCCACCAAANGGTGCCAA 

CNCCTACATGNGCNCTTATCNTAAANAGGCACCCCTCCAANAGGGGGNCNAATAAACGNA 

GGGNCNAAGNNNCCCTGGGNAAATTGGCCTTTACNGGNTAAAAAAGGCCTTGGANNGGAG 

GTTTTAAANTNCCTACGGCCAAAGGNAAGCCATTGGCNTGNACCCCAAGGGCCAAGGAAA 

AATCCANNTAAAAAGNTTCTTGGGNGTCCACCCCNTGGGG 

Sequence 1240 

AGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGG 
CCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGA 
TGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTG 
AGGCTTTGCATGTCTTGGCATTTCCTTTCAGGGAGCCTGAATTGAAAAAAA 
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Sequence 1241 

AGCTCCCCGCGGTGGCGGCCGCCCGGGCNGGTACGCGGGGGAGACATTCCTCAATTGCTT 

AGACATATTCTGAGCCTACAGCAGAGGAACCTCCAGTCTCAGCACCATGAATCAAACTGC 

CATTCTGATTTGCTGCCTTATCTTTCTGACTCTAAGTGGCATTCAAGGAGTACGGGAAGG 

CGMGAAMGMTAGAGAAGATAGGGAMTTAGMGATAAMACATACTTTTAGAAGAAA 

AAAGATAAATTTAAACCTGAAAAGTAGGAAGCAGAAGAAAAAAGACAAGCTAGGAAACAA 

AAAAGCTTAAGGGGCAAAAATTGTACCTTCGGCCCGCTCTAGAACTAGTGGGATCCCCCG 

GGCTGCAG 

Sequence 1242 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTG 

AAGACATGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGA 

AAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAG 

Sequence 1243 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCAT 

CAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTANGCTACTGCTGAAAGG 

Sequence 1244 

CCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATA 

GAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAA 

AAAATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCCTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGTAATGACTGGTATGGCAAAA 

Sequence 1245 

TGGTACTGCTAMGTCATGACAGCCCMCAGGTGATGTTTTACTGGATGAAACTCTGAAA 

CACATCAAAGCAACTGAACCCACAGAAACTGTCCAAACATGGATAGAGCTACTCACTGGT 

GAGACCTGGAACCCCTTCAAATTACAGTACTGTTCCTGTTGGCCGAGTGGAGACTGGTGT 

TCTCAAACCCGGTATGGNGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATC 

TGTCGAAATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAA 

TGACAAGAATGTGTCTGTCAAGGATGTTCCGTCGTGGCAACCGTTNCTGGTGACAGCAAA 

AAATGACCCCACCAA 

Sequence 1246 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTG 

Sequence 1247 

AGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGA 

AGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA 

AGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGC 

AGATTCTGAGGCTTTGQATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACA 

AGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATAGNAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 

CTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 
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GGACTAAGACACTGGCCATACCACTGGACAGGGTTTATTGTTAACACCTGAATTGCTGGG 
GTC 

Sequence 1248 

GAGCT CCCCGCGGTGGCGGCCGNGGTACATAAAACATTATTCCTTCCTTGGGCTAAAAAC 
TTTTTGCCACCTACATTAMGCTM^ 

TTAGGGCAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGTTAGCTACATAAAAATG 

CACCCTAGACCCGAAACTTACTAGACTCATTATAAMTTTTNTTTAAGGTGTCCACGCAG 

NCCCTGGTCACACTTGAAGCAGTCCGGAGAAATATNAGCCCTACCCCAGTAATCCCCAGA 

AGGAACTTACACI I I I I I I IMTCTTTTCCTACAACTNCATATTTTATAAATAAAAAGAC 

ANAAATGTCAGGCCTGTGAGCTGAAGCTTAGCCAT 

Sequence 1249 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGAGTGG 

GAAGCTCGCAGCAGCTGGGGAGGAGCCAAAGCCTCGGCGCTCACCTAAGCCGCAGGGAGA 

TACACCCAACTGGGAGATGAGGAAACAGCAACCCAGAGAGGAGAACTAACCCACACAGGA 

TCATTTCGTGAAGGAGCAAGGCTGMGAACCAGACCTGGACTTTCTTAGGACAAACTTAC 

TGCAGCTTGAAGGAGCCMCCATGGATTTGAGGCGTGTGAAGGAATATTTCTCCTGGCTC 

TACTATCAATACCAAATCATTAGCTGCTGTGCTGTTTTAGAGCCCTGGGAGCCGATCTAT 

GTTTAACGCCATCTTACTMCCATTATTGCTATGGTGGTATACACTGCCTATGTNTTTAT 

TCCAATCCACATTCGCCTGGCTTGGGAATTTT 

Sequence 1250 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAAGGC 

CATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAT 

GAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCC 

TCCACCAAGTTGGTATTCAAAATNTGTAATGACTGGTATTGGCAAAA 

Sequence 1251 

CTNCTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAA 

GACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAA 

GAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCA 

GATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAA 

GCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAG 

CTGGAATGGGGAATGAAGAATGGAAGAATNGTGGTGCCCACTAGGCTACTGCTGAAAGGG 

GAGCTTGAAATTCCTCCACCAAGGTTTGGTATTC 

Sequence 1252 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATCTTGAGAAGGATTT 

GAAGGACAAGTTTGTGGCCCTGACCATAGATGATATCTGCTTCTCGCTCAACGACAACTC 

ACCAAACATCAGATATTCTGAGAACGCCGTGAGGATTGAGCCAAACTCCGTGAGTCTGGA 

AGACTGGTTGGACTTCTCCAGCACCAATGTGGAGAAGGCTGACAAGCAGCGGAACAACTC 

CCTGATGCTGAAAGCCCTGGTGGATCGAATCCTGTCCCAGACAGCCAATGGATCTGTGCA 

AGCCAGTGTGATTGTGGTGGACACCGGCATTCAAGAATGGGCCTGAAGGGATCAAAGGGA 

TGCCAGGGACAAGCTGGGCTTGATCATCTGGCCCAAGGTATTNGGAAAGAGATTGCTTCC 

CAGGGAAGAAAA 

Sequence 1253 

ACTNAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGAC 

AAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAA 

AAGATGAAAGACAAACTGCAAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGA 

TTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGC 

AGATGAAGACTCTGAGAGGGGTTTNGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCT 

GGAATGGGGAATGAAGAANGGNAGATTGGTGGTGCCCACTAGGCTACTGCTGAAAGGGAG 
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CTGAMTTCCTCCCCCAAGGTTGGTATTCAAAAATATTGTAATGAACTGGGTATTGGCAA 
Sequence 1254 

CCGCGGTGGCGGCCGNGGTACAATGATTGTCATCTCAGTAAAAGGTCTATTATCTAACTT 

GCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCT 

TGAAMGCM7TAGAGTMGCAAGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCC 

TGGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAA 

TAAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAG 

GCTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCAC 

CAACTTGCCCAGGTATTCTCCCCGGTCTTGA 

Sequence 1255 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCATGGTACGCGGGGG 

GGTGGAGAGAGGCCTCTAGACTTCAGTTTCAGTTTCCTGGCTCTGGGCAGCAGCAAGAAT 

TCCTCTGCCTCCCATCCTACCATTCACTGTCTTGCCGGCAGCCAGCTGAGAGCAATGGGA 

AATGGGGAGTCCCAGCTGTCCTCGGTGCCTGCTCANAAGCTGGGTTGGTTTATCCAGGAA 

TACCTGAAGCCCTACGAAGAATGTCAGACACTGATCGACGAGATGGTGAACACCATCTGG 

GACGTCCTGCAGGAACCCGAACAGTTCCCCCTGGNGCANGGAGTGGCCATAGGNGGCTCC 

TATGGACGGAAAAC 

Sequence 1256 

TGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACAANGGCCATCCA 

CCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGA 

CMACTGCAAAAAATTGCCCMAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTT 

TGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACT 

CTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAAT 

GAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCT 

Sequence 1257 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAA/,GAAA 

AGATGAAAGACAAACTGCAAAAAATTGCCCAAAATGCGACTTTCTAAAAATGGAGCAGAT 

TCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCA 

GATGAAGACTCTGAGAGGGGTTTGGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTG 

GAATGGGGAATGAAGAATAGAAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTG 
AAA 

Sequence 1258 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

AACAAGCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGAAGATTGTGGTGCCCACTAGGCTACTGCTGAA 

AGGGAGCCTGAAATTCCTCCCCAAGGTTGGGTATTCAAAAATATGTAATGACTTGGTATG 

GCAAAAAGATTGGGACTAAAG 

Sequence 1259 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTTGTTT 

TGGCACACT7TTCCTGACAAACAGCCAGTGTTCTCAATACATAAATACTAGTCCACGTTA 

ACAACAATAGCATATGAGACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGACTG 

GAAATACCTTGGAGGGTTTCACAAAAATAAGACAAAGGGCAAAGGAACTTTGCCAAAGGA 

GATGGAGAGCAATTCTTTAAAGATAGTGGGAGGGAGGAAGCAAAGAGCTCATAAATACAA 

GCCTCTTAAAATGGGACGCATTTGCCTCGCGCCTCTGGGGTGTCTGCAGCTCAGCNTTGG 

TGCCCCACACGGGACACCCGACTTTT 

Sequence 1260 
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TAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTCAATGTCATT 

MCCATAAATGAGCATTTACAATCTGGATTAAATGTCACATGGTATTAAGTCTACACTTA 

GAGTMTGCTTTTACTGATTTTTAAAMTATATGCATATGTTTAGTGATCGAGAAAAGTG 

AAATACTGGAGTAC I I I I I 1 1 I I I I I I I 1 1 I I I I I GGCTTGATGAGTAGGTGAGTTTATT 

GGGACTTACACACAGGTCAATCCTGGGCGGCGACAAGACAGCTCTAGAGATCTGAGCTTC 

CTCCCMTGCTAMCTGCTTTCATGCTMTTTTCTGACTGTTTACTTACCCGGGGTAAGA 

GCGATGGGGACTGTTTTCATTGG 

Sequence 1261 

TNCTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGT 

GTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGG 

AGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATG 

GAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGC 

MCMGCAGATGAAGACTCTGAGAGGGGTTTTGGAGTCTGGAAGCCTCATCCCTTCAGCA 

TCAAGCTGGAATGGGGAATGAAGAATAGGAGATGTGGTGCCCACTAGGCTACTGCTGAAA 

GGGAGCTGAAATTCCTCCACCAAGGTTGGGTATTCAAAATATGTAATGACTGGTATGGCA 

AAAGATTGGG 

Sequence 1262 

TGGCGGCCGCCGGCAGGTACGCGGNGCAGAAGAGGAAGATTTCTGAAGAGTGCAGCTGCC 

TGAACCGAGCCCTGCCGAACAGCTGAGAATTGCACTGCAACCATGAGNGAGAACAATAAG 

MTTCCTTGGAGAGCAGCTACGGCAACTAAAATGCTTTCACCTGGAACTTGATGGGAGGG 

AGAAMCTCCTTGGATGATTTTGAAGACAAAGTATTTTACCGGACTGAGTTTCAGAATCG 

TGAATTCAAAGCCACAATGTGCAACCTACTGCCTATCTAAAGCACCTCAAAGGGCAAAAC 

GAGGCAGCCCTGGAATGCTTACGTAAAGCTGAAGAGTTAATCCAGCANGAGCATGCTGAC 

AAGGCAGAAATCAAAAGTCTGGTCACCTGGGGAAA 

Sequence 1263 

CGAGGTACCCAGGCCTGCAAATCTCCTGGCAGGATGGTCAGGAACTGCTCTAGCACCAGC 

AGTTCCAGAATCTGTTCCTTGGTATGGATCTCTGGCTTCAGCCACTGACAGCAGAGCTCC 

CGGAGGCGGCTCAGTGCCTCGCGAGGTCCAGGAGAATCCTGGTAGCAGAACTGCCTAAAA 

AGCTGCCTGCAGAGCTCCTGCTTAAGGAGTTCACTTCTCTGTAAGCAAGTGTCCTGCCCA 

TGGATAAATTGTTCCTCTTCTATCTTCACTATCAGAAGTCCTTCATCCTCTGGAGAGTTC 

TGGGCTGCAGCTTTCTTTGGTTCTGTAGCCATCTCTCGGACAGGGCTGATTCCGATCGGA 

CACTTCCGGTGGAAGGACTGAGCGGCGCTACACTTCAAGAATTCCGTCCACAGGGACTTG 

TGAGTCTGCGCAGAAGGCGGGATGCCTTTGGACTACGATTCCAAGAATCCTTCCTGGGTC 

TCTTCGGGCGCAGACTTTTCGCCAAAGTCCTGAAGATCTCAGGGCTTGAAGGAGGGGGCA 

TCCTTCTTCTTCATTGNAGTAGTGTGTCTTGCTAAATAACAGAAGGGACTCCTGAAAAGA 

AAATGACGTTGGCCCGGGCGCGGGGGCTTACGCCTGCAAGTCCACACTTTGGGAGGCCGA 

ACNGGCNGATCACGAAGGTCAGGAGATCGANGACTATCTGGGTTACGCGGNGACACCCTG 

NGTTTCTTAAAATCCANAAAAAAAAAAAAAAAA 

Sequence 1264 

GGCGAATTGGACTCCACCGCGGTGGCGGCCCGAGGTACAGAGATTTATAATGTGCTGCTC 
TAGGTCCTATCGGGTAAAGGGATCAGCAGATGTGAAGTCAAGAGTCTCCTGTAAGATTTG 
^?ZTZ?T^^' G AAAC AT ATTTTAATCCTG G G C CTCCTISfTTTC AA ATC ACCT ATTTCTTTT A 
GTTTTTTGCAGTGATACTGTGTGTTGCTTCT^ 

TCAAGTGTCTTATCCTAAAAGTAAAACCTAGATGATCTAAGGTGGTGGNTTTCAACAGGG 

TGCA AATTTGCCTCCTATACTCGCAACACCCAGNGACAGTTGGCTATGTCTNGGAGACAT 

TTTTGNGTTNTCACACCTGGANNNAGGGTGGGGGAGGTGGNGCTAATGACANCAAGNTGG 

CCNTAANCCAATNATGCTGATANAAATNCTACANTGCACAAGGATAGGNTCCCACANAAC 

ANAAGNCTTANCCAAACCCCAAATACTAACAAT 

Sequence 1265 

CCGCGGTGGCGGCCCGAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTT 
GCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCT 
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TGAAMGCAATTAGAGTAAGCMGA7TAGTTTTTCCTCCTTCCAGTTCCTCAGCAGGCCT 

GGCTGAAGGCCCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAAT 

AAGAAGAATGCCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAGG 

CTTCCATTGCTCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCACC 

AACTTGCCCAGGTATTCTCCCCGGTCTTGAAGAAATGGCTCTCCACCTGAAAAGTTGATC 

TTCTCCATACCAGCTTCCTTAAGCAAAAGCAATCCTCTCTTTGCTTCCTCAAGGGGCAGC 

ACAMGGATGTTTTGGCTGTGTGGAAACAGAAGCCCGCATTTGTAGTTGCACTGGCGAGT 

GAAGTGATAGTTGACGCTGGTTGGGGTGGT 

Sequence 1266 

CCGCGGTGGCGGCCGCCCGGGCAAGGTACTTGCTAACTTTGACGCCAGCATCTCTGAAAG 

ATCCCCATCGAAGGCCGGTCATTGCAAATACAGGCTGTTCCTTTTCACCCTTGATCTGCA 

AGACATCAAGTGGAACTGTCTCTCCTTTCACAATGGCAAGTGTGGCATCAGTAATATGTT 

GGACTTTGTTTCCACTTTCGGCAAAGAGGGTATGACTCAAACTACTGGTCTCTCCCAGTG 

GGATAAATCCAATGGGAATCTTACTGAAGGTAGCCTCATCTGTTCGTCGAAGAACACCAA 

GTAACAACCTNCTGCAGTGTCCCATCTCCTTCCTGNAACAATGGATCACATTCCGGNGTT 

TTCCATCAGTTNCMGGAGGTTTNTTNGGCTTGGGCCCTTAANAAATCTGNGCTTAAACA 

AAAAGGCCCNATTCCTTCCCAAATAAAAANGGAAAAANTCGGGGGCCNCAAI I M I I I I 

Sequence 1267 

CACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGT 

GAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAG 

AAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGA 

GCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAA 

CAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCA 

AGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGA 

GCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGA 

TTGGACTAAGACACTGGCCATACCACTGGACAGG 

Sequence 1268 

GGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAA 

ACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGC 

GACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGC 

TGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCC 

TCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAG 

GCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGA 

CTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAA 
CACCTGAATT 

Sequence 1269 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTCTACAG 

AGATCCTGAAGAGA7TGAAAAAGAAGAGCAGGCTGCTGCTGAGAAGGCAGTGACCAAGGA 

GGAATTTCAGGGTGAATGGACTGCTCCCGCTCCTGAGTTCACTGCTACTCAGCCTGAGGT 

TGCAGACTGGTCTGAAGGTGTACTCTTGGTTTATCAATGGGACGTTCCAGCAATCCACAC 

AAGAGCTCTTTATCCCCAACATCACTGTGAATAATAGCGGATCCTATATGTGCCAAGCCC 

ATAACTCAGCCACTGGCCTCAATAGGACCACAGTCACGATGATCACAGTCTCTGGAAGTG 

CTCCTGTCCTCTCAAGCTGTGGCCACCGTCGGCATCACGATTG 

Sequence 1270 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACTTTCTMAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAG 

CTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGC 

CTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGGAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 

GACTGGTATGGCAAAAGA 



# 
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Sequence 1271 

ACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTCAT 

TTTTATTTTGAAMCCATTCAGCACATTTATCTTATGTAACATGCAGAGCATATATCTAT 

CTGTATTTTTAAMTTTTCCTGTTACTCATTGATACAT^ 

AGTCCGTGAGAAACATGGCGACTCGATCAATGCCCATGCCCCAGCCAGCTGTGGGGGGCA 
GCCCATATTCCAGGGCAGTACTCAMGGTGATATTTGCTTTTTTCAATGC7TCAGGGGAA 
AAATCCTTTTCTTTA 
Sequence 1272 

TAAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAANGACA 

AGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAA 

AGATGAMGACAMCTGCMAAMTTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATT 

CTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAG 

ATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGA 

ATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAA 

TTCCTCCACCAAGGTTGGTATTCAAAATATGTAATGACTGGGTATGGCAAAAGATT 

Sequence 1273 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGTGGTGACGTGGTTCCCAAAGATGTC 

AATGCTGCCATTGCCACCATCAAAACCAAGCGCAGCATCCAGTTTGTGGATTGGTGCCCC 

ACTGGCTTCAAGGTTGGCATCAACTACCAGCCTCCCACTGTGGTGCCTGGTGGAGACCTG 

GCCAAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACT7TATAGAGGGTGTAAA 

AATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAA 

AATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCA 

GGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGG 

AAGCCTCATCCCTTCAGCATCAAGCTGGAAT 

Sequence 1274 

ATCCCCGGCGNGGCNNNNCGCCCGTTCAGGTACTCTTTGTTTTGGCACACTTTTCCTGAC 

AAACAGCCAGTGTTCTCAACACATAAATACTAGTCCACGTTAACAACAATAGCATATGAG 

ACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGACTGGAAATACCTTGGAGGGTT 

TCACAAAAATAAGACAAAGGGCAAAGGAACTTTGCCAAAGGAGATGGAGAGCAATTCTTT 

AAAGTTAGTGQGAGGGAGGAAGCAAAGAGCTCATAAATACAAGCCTCTTAAAATGGGACG 

CATTTGCCTCGCGCCTACTGGGTGTCTGCAGCTCAGCTTGGTGCCCCACACAGGACACCG 

ACTTTAAGTGGCTGCCTTTGCAAGGCTGAGAGGCCATGGAGGGGTTGATGCCTGAAGTGT 

CAGCGCCATCTAGTGGAAACATGGGGCATGGCCC 

Sequence 1275 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCC 

ATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATG 

AAAGACAAACTGCAAAAAATTNCCAAAATGCCGACTTTCTAAAAATGGAGCAGATTCTGA 

GGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGA 

AGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGG 

GGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTNC 

TCCACCAAGTTGGTATTCAAAATATGTAATGACTGGTATGGCAA 

Sequence 1276 

AGGTACMAATTATCATCATTTAGAGTTGATTTTTTTCACCAGCCCTGAATTTTCAM 

TGTAATATGCTGTTTCACAATCTTTTTATTAAATTACTTAATGATTCCAGTCTGCAAAAT 

GAGCCATAAGACTTTGCTNCTGTTTGNTATANGATNCATNTGGANATTGGGGGNGGGNAA 

ACCATANGTAAGGTTAAACCTATCCGTCACCTGCTTCATGTAAAAGACTCCACCATTTGN 

TTGGATNTATTTTTTTCTCCAGGCNACTAGTAAGMAAAAGGTGAACAAAGGTGGATTTN 

CAT CCCT NNCMANTGGGCCCTTNTGGCNCMTTCTTTTTTCANTAATCCTATGGTANAC 

CNNTTTTGGTAGATTACCTTGGTC 

A A A A ATTCTTTG G G G ATTT A A AATTT AA A N C AA A A A AA CC AAACC AAAT A AA ANTTCT AA 
TCCACCCNTGNGGAAAATTATTTTGGAAAAGGAAAAATTTTCAGGTTAAAAAACAAAGGG 
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ANTGGNTGGTTCCTTCCATTAGGTTTAAAGGGGAGGAGGNANNATTAAAAAAAATTAAAA 

AAATTGGTTCATTTTAAAACAAGGTTTNGAAATTTNAAGGGAA 

Sequence 1277 

AGGTACAAMTTATCATCATTTAGAGTTGATTTTTTTCACCAGCCCTGMTTTTCAAACT 
TGTAATATGCTGTTTCACAATCTTTTTATTAAAT^ 

GAGCCATAGACTTTTGCTCTGCTTGTTATAAGATCANTTGGAATTGGGNGGGGGGGANAA 

CCANTAGTTAAGNCTAAATCTAATCCGGTCACTTGGCTTCATTGTAAAAGAACCCCACAA 

TTGGTCTGMTTAATTTTTTGCCCAGGCACCANGAAAGNMNATNGNTGTACCAMGNTN 

GAANTACATTCCTTGGCAAANTGGGGCCCTCTTTGGCCCAMTCCTTTTTTCCAATTATC 

CTTATTGGTTAAAACCCTTTTTTTGGTTMGTNTNACCTTGGGTGGTTGGMTNTTTA^ 

GCCGGNCTTGGNNGGCCTTMTTTTGGTAAAAAATTTCTTTTGGGGGATTTTAAM 

AACC CMNMNMCCANACCAAANAMTANTTCNTNATTNCACCCCTTGGGGNAAATTNA 

TTTTGGGAAAAAGGAAAANATTTTCNAAGTTTAAAAAACCCAAAGGAANTGGGTNGTTCC 

TCCAATTANGGTNTTAAAAGGG 

Sequence 1278 

AGGTACATTTACACMTATTAACACTAAAAATCTGTGTTTTTTTAAAACACCATAGAAGT 

CAMCCACMAMCCCAGGATCTTGTTTTAAATGTGTTTATGAAGACTGCTGCTGAGCTC 

AAMGCATTGCAGGTAATCATGACCACCTAGATGAAAGCTGGATGTTTGAAAACTCCTTC 

ATGTCCAATGMTGTAM7TTTTTCACCTCATCCCCAAGGTATTCTCCCATACTTTGTTC 

TACTTTTGACCTTCTTTT TTmT TGGGNCACCTC 

TGAGGGGGTACAGGTTCTTTTTNGTGGTNTAAAAGGAATTACTTTTCATTMTGA^ 

CTCCTTGGTTTCCTTTAATTTCCCTTT 

TTTTTCNTTTTTAAGNGGACC 

Sequence 1279 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAATGAAGTGTGAAGA 

CAACGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAG 

AAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAG 

ATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAG 

CAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCT 

GGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTG 

AAATTCCTCCACCAAGGTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATT 

Sequence 1280 

GTGGCGGCCGAGGTACCAANTGAAGTGTGAATGACAATGGCCATCCANTANTTTATAGAG 

GGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAA 

ATTGCCAAMTGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTTGCATGTCTTGGC 

ATTCCTTCAGGAGCTGA 

Sequence 1281 

CCGCGGTGGCGGCCGCTCGGGCAGGTACCATTCCTCTACATCCATTTGGTAGCAGAACCT 

CMGTGTAAGCAGTCAGTGTAGCATGMTATGMCTGGCTCAGTTTATCACTTCCTGTTT 

NGACCTGAAGCACCACCCAGCTATGCAGAAGTGGTAACAGAGGAACAAAGGCGGAACAAT 

CTTGCACCAGTGAGTGCTTGTGATGACTTTGAGAGAGCCCTTCAAGGACCACTGTTTGCA 

TATATCCAGGAGTTTCGATTCTTGCCTCCACCTCTTTATTCAGAGATTGATCCAAATCCT 

GATCAGTCAGCAGATGATAGACCATCCTGCCCCTTTTGTTGAAGGAACACTTGGTTGA 

Sequence 1282 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAAACATTATTCCTTCCTTGGCC 

TAAMACTCATCGCCACCTACATTAAAGCTAATATGCCTGATTACTGTTTTTAGAGAACT 

TATTTTATTAGGGCAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGTTAGCTACAT 

MMATGCACCCTAGACCCGAAACTTACTAGACTCATTATAAMTTTTCTTTAAGGTGTC 

CACGCAGTCCCTGGTCACACTTGAAGCAGTCCGGAGAAATATCAGCCCTACCCCAGTAAT 

CCCCAGAAGGAACTTACAC I I I I I I I I MTCTTTTCCTACMCTTCATATTTTATAAATA 

AAAAGACAAAAATGTCAGGCCTGTG 
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Sequence 1283 

GGTGGCGGCGCCCGNCAGGTACCAAATGAAGTGTGAAGACCNGGCCATCCACCACTTTAT 

AGAGGGTGTAAAAATAAACCAGAAATCATGGGAGAAAGAAAAGATNAAAGACAAACTGCA 

AAAMTTGCCAAAATGCGACTTTCTAAAAATGAGCAGATTCTGAGGCTTTGCATGTCTTG 

GCATTCCTTCAGGAGCTGAATGAAAAAATGCAACANGCAGATGAAGACTCTGAGAGGGGT 

TTGGAGTCTGGAAGCCTCATCCCTTCANCATCAANCTGGAATGGGGAATGAAGAATAGAG 

ATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTAT 

TCAAAATATGNAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTG 

GACAGGNTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCNAGGAGTTCTGGGA 

NAGGNACCACATTGGGG 

Sequence 1284 

CGCGGTGGCGGCCGCCCGGGCAGGTACCCCGGGAGAGCCCGCTTCCCCCTCCTCCCTGTG 

CTGTCTGCACCCGAGGAGAGCGGCCTGCCCGGAAGTGGGCCACCATATCTGGAAACTACA 

GTCTAT GCTT TGAAGCGCAAAAGGGAATAAACATTAAAGACTCCCCCGGGGACCTGGAGG 

ATGGACTTTTCCATGGTGGCCGGAGCAGCAGCTTACAATGAAAAATCAGAGACTGGTGCT 

CTTGGAGAAAACTATAGTTGGCAMTTCCCATTMCCACAATGACTTCAAAATTTTAAAA 

AATAATGAGCGTCAGCTGTGTGAAGTCCTCCAGAATAAGTTTGGCTGTATCTCTACCCTG 

GTCTCTCCAGTTCAGGAAGGCAACAAGCAAATCTCTGCAAGTGTTCAAAAAAATGCTGAC 

TCCTAGGATAGAGTTATCAAGTCTGGAAAGATGACCTCACCACACATGCTGTTGATGCTG 

TGGTGAATGCACCAATGAAGATCTTCTTGCATGGGGGAGGCCTGGCCCTGGCCCTGG 
Sequence 1285 

CGCCGGGCAGGTACCAAATGAAGTGTGAAGACANGGCCATCCACCACTTTATAGAGGGTG 

TAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGAl GAAAGACAAACTGCAAAAAATTG 

CCAMATGCGACTTTCTAAAAATGNGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTG 

CCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATA 

TGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGG 
Sequence 1286 

TCGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAGGTAAGATATA 

CTGGAGTCACAGAGCAATATGCATTAACAGGATACAACAGTTCATAAAAACTGAGTAACT 

ATGCACACAAATTTCTTAAACAGCCACCTAAAGAGAAAATGCACAGATGTATGGTGGAAA 

CTGTATCTAACACTGAAACTACTACAGGACTCCATCAATGAGTCCAACTTTTAGTGATAA 

AAAACTACTGTACACTACATGAAGAACCATATGTTTATAATTATCCAAATAAAAATGAAG 

TTATTAAACTTCAAGATAATATGGTAATTTGCATTGMCCGATGATTTTACAAAATTCTG 

CAMGGTCAAAATTTTAAAAGATGGCTGAACAGTAATTGCAGCATCTAATAAAAACGCAG 

CTCATTACCGAGCAAACGGTTTTAATTAAAAATTCAAAAGGAATAATCCTGACAGGAGAA 

ATAMAAAATAGATGTCAMAGAAGATAAAATTATTTTCAAAGGAGTAGTAACTCAAGTT 

TTAACACC 

Sequence 1287 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCA 

CTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAA 

ACTGCAMAMTTGCCAAAATGCGACTTTCTAAAAATGNGCAGATTCTGAGGCTTTGCAT 

GTCTTGGCATTCCTTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAG 

AGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGA 

ATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGT 

TGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATA 

CCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTT 

CTGGGAGAGGGACCAGATTGGGGGGTAGGT 

Sequence 1288 

CCGGGCAGGTACAAGATAGTCATCTCAGTAAAAGGTCTATTATCTAACTTGCCAAACTTG 
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TTCACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGCTTGAAAAGCAA 

TTAGAGTMGCMGATAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGC 

CCAGGAGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAATAGCAATAAGAAGAATG 

CCATCCCATGGAGCACACCATAATTCTGGAACCACCTCTCCCGGATCAGGCTTCCATTGC 

TCACGATGCTCACGCTGGGCAGCCGCAACTCTACTTTGCAGAACCTCACCAACTTGTCCA 

GGTATTCTCCCCGGTCTTGAAGAAATGGCTCTCCACCTGAAAAGTTGATCTTTCTCCATA 

CCAGCTTCCTTAAAGCAAAAG C AATCCTCTCTTTG CTTNCTC AAG G G GCAAGC AC AAAGG 

GATGTTTTTGGCTGTGTGGAAACAGAAAGCCCGCATTTGTAGTTTGCACTGGCCAGTGAA 

GTGATAGTTGACCCTGGTTGGGGTGGGGG 

Sequence 1289 

CCGGGCAGGTACCAAAATTGTAAGAAGAAGCTTGGGAAGCTGCCACCTCAGTATGCCCTG 

GAGCTCCTGACGGTCTATGCTTGGGAGCGAGGGAGCATGAAAACACATTTCAACACAGCC 

CAGGGATTTCGGACGGCTTGGAATTAGTCATAAACTACCAGCAACTCTGCATCTACTGGA 

CAMGTATTATGACTrTAAAAACCCCATTATTGAAAAGTACACAGGAGGCAAAGTGTTTC 

ACATCATAGACTTCACTTCCAACTCCTTGGAATGTTCATTTCTTTGGCTTACAGGAGAGA 

CTAGACAGGAAGGCCAGGCAATGCTTAGGCAACTAAAATGAGGTTGGGGGTAATGCTAAC 

GTCACCCTCACAGGGATGGCCACGGGGACTGTTATTCGCAAGCTGGTTTTCTAGGCCTGT 

TAGCTGGAAGCATGGTGAGCACCATTTCTGGACGCTCAGGCCGTGTCGGGCTTNAAGTCA 

TCTTNCACCACACAGGTACCTTNGGGCCGGNTCTAGNAACTAGTGGGATCCCCCCGGGCT 

GGCAGGAAATTCGAATATCAAAGCTTTATCGATACCCGTCCGACCTTCGANGGGG 

Sequence 1290 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGTTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CGACmCTAAAMTGGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGC 

TGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCC 

TCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCGACTAG 

GCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAATGA 

CTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCACTGGACAGGGTTATGTTAA 

CACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGATTG 

Sequence 1291 

AGGTCATAAAACATTATTCCTTCCTTGGCCTAAAAACTCATCGCCACCTACATTAAAGCT 

AATATGCCTGATTACTGTTTTTAGAGAACTTATTTTATTAGGGCAGTTCCAAGCTCAAAA 

ATCGCTAACT GGCA CCTTGTNGTACATAAAAATGCACCCTAGACCCGAAACTTACTAGAC 

TCATTATAAMTTTTCTTTAAGGTGTCCACGCAGTCCCTGGTCACACTTGAAGCAGTCCG 

GAGAAATATCAGCCCTACCCCAGTAATCCCCAGAAGGAACTTACAC I I I I 1 1 I lAATCTT 

TTCTACAACTTCATATTTTATAAATAAAAAGACAAAAATGTCGGGCCTGTGAGCTGAAGC 

TTAGCCATTGTAACCCCTGTGACCTGCACATATCCGTCCAGGTGGCCTGCAGGAGCCAAG 

AAGTCTGGAGCAGNCGAAAAACCACAAAGAAGTGAAACAGCCAGGTTTCTGNCTTAACTA 

ATTAACCCAC 

Sequence 1292 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAAACATTATTCC 

TTCCTTGGCCTAAAAACTCATCGCCACCTACATTAAAGCTAATATGCCTGATTACTGTTT 

TTAGAGMCTTATTTTATTAGGGCAGTTCCAAGCTCAAAAATACGCTACTGGCACCTTGT 

TAGCTACATAAAMTGCACCCTAGACCCGAMCTTACTAGACTCATTATAAMTTTTNTT 

TAAGCTGTCCACGCAGTCCCTGGTCACACTTGAAGCAGTCCGGAGAAATATCAGCCCTAC 

CCCAGTAATCCCCAG AAGGAACTTACAC I I I I I I I I AATCTTTTCCTACAACTTCATATT 

TTATAAATAAAAAGACAAAAATGTCAGGCCTGTGAGCTGAAGCTTAGCCATTGTAACCCC 

TGTGACCTGCACATATCCGTCCAGGTGGCCTGCAGGAGCCAAGAAGTNTGGAGCAGCCGA 

AAMCCACAAAGMGTGAAACAGCCAGTTTCTGCCTTAACTAATTAACCCACCTTACGAC 

ATTCCACCATTATGACTTTGTCCACCATTATGACTTGTTCCTGGCCTGCCCCAACTG 

Sequence 1293 
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CCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTA 

AAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCC 

AAMTGCGACTTTCTAAAMTGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCA 

GGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGG 

AAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCC 

ACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGT 

AATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCAGTGGACAGGGTTAT 

GTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAGTTCTGGGAGAGGGACCAGA 

TTGGGGGGTAG 

Sequence 1294 

CGAGGTACCGCTGTGTCCGGGTGGGTGGTCAGAATGCCGTGCTCCAGGTGTTCACAGCTG 

CTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCCTGTG 

CCCAACTGGGTTTCCCAAGCTATGTAGTTCAGATAACCTCAGAGTGAGCTCGCTGGAGGG 

GCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTGACTGC 

ATTACACCACTCAGTATATGTGAGGGAGGGATGTGCCTCTGGCCACGTGGTTACCTTGCA 

GTGCACAGCCTGTGGTCATAGAAGGGGCTACAGCTCACGCATCGTGGGTGGAAACATGTC 

CTTGCTCTCGCAAGTGGCCCTGGCA 

Sequence 1295 

CGAGGTACCTGTGAAGACAGCTACACCTGGTTTCCTCCCTCATGCCTTGATCCCCAGAAC 

TGCTACCTTCACACGGCTGGAGCACTCCCAAGCTGTGAATGTCATCTCAACAACCTCAGC 

CAGAGTGTCAAT TTCTG TGAGAGMCAMGATTTGGGGCACTTTCAAAATTAATGAAAGG 

TTTACAMTGACCTTTTGAATTCATCTTCTGCTATATACTCCAAATATGCAAATGGAATT 

GAAATTCAACTTAAAAAAGCATATGAMGAATTCMGGTTTTGAGTCGGTTCAGGTCACC 

CMTTTCGAAATGGAAGCATCGTTGCTGGGTATGAAGTTGTTGGCTCCAGCAGTGCATCT 

GAACTGCTGTCAGCCATTGAACATGTTGCCGAGAAGGCTAAGACAGCCCTTCACAAGCTG 

TTTCCATTAGAAAGACGGCTCTTTCAGAGTGTTCGGAAAAGCCCAGTGTAATGACATTGT 

CTTTGGATTTGGGT 

Sequence 1296 

CGAGGTACAGGAGCAACCTTCTTTCCACCATTACTGGGAATTCCACCACTATTTGCTCCC 

CCAGCCCAGAATCATGATTCTTCTTCATTCCATTCAAGGACTTCGGGAAAAAGTAATCGA 

AATGGTCCCGAAAAAGGTGTAAATGGGTCAATAAATGGAAGTAATACATCATCTGTAATT 

GGTAT CAACAC ATCTGT AC 1 1 I I I i I I I I I I I I I I I I I I I ATCTAAAAGCAACATAATTA 

TTTTTCTTGCGATTTTTTCAAGMCTCTTTTTAAT^ 

AGCTTCTGATT 

Sequence 1297 

CGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGT 

GTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATT 

GCCMAATGCGACTTTCTMAMTGANAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTG 

CCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATA 

TGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCATACCA 

Sequence 1298 

CCGCGGTGGCGGCCCGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACC 

ACTTTATAGAGGGTGTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACA 

AACTGCAAAAAATTGCCAAAATGCGACTTTCTAAAAATGGAGCAGATTCTGAGGCTTTGC 

ATGTCTTGGCATTCC7TCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTG 

AGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAA 

GAATAGAGATGTGGTGCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAA 

GTTGGTATTCAAAATATGTAATGACTGGTATGGCAAAAGATTGGACTAAGACACTGGCCA 

TACCACTGGACAGGGTTATGTTAACACCTGAATTGCTGGGTCTTGAGAGAGCCCAAGGAG 
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TTCTGGGGAGAGGGACCAGATT 
Sequence 1299 

CGCCCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGT 

GTAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATT 

GCCAAAATGCGACTTTTAAMATGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCC 

TTCAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGT 

CTGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGT 

GCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAAT 

ATGTAATGACTGGTATGGCAAAAGATT 

Sequence 1300 

CGNCCGCCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTG 

TAAAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTG 

CCAMATGCGACTTTCTAAAAATGGACAGATTCTGAGGCTTTGCATGTCTTGGCATTCCT 

TCAGGAGCTGAATGAAAAAATGCAACAAGCANATGAAGACTCTGAGAGGGGTTTGGAGTC 

TGGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATANAAGATGTGNT 

GCCCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGG 

Sequence 1301 

CNAATTGGAGCTCCCCGCG GNGG CGGCCGAGGTACAGTATGGCTTAAAAGGCTCTGCCTT 

AGATTCTAGAATCCAGAACATTTTCTCCAAAGACAATCAGGGTATGGGGGAGAAGTTAGT 

TCCAGAGAAGAGAGCGAGTCCAGGGTAGAAGGGATTCTTCTCTCCTGAGGGTCTATGGTC 

TCCCATTTTTTAAAGCAGCAGNGGTATCTATCCCACTCATGGCCTAGAGGTTGCACAGAG 

CTGTCTGGCACCCGCTTCTTTGGCTTTTCTCTCCTGACACCCAGCAATGCTTACTCAGAG 

CGTTGAAGGCGGCCAGCACCTCGAAAGAGATTCTCTGATTTTTTGTGAACACCTGGATGG 

TGAACCCATCMGGGACTTCTGGATCTCGAMTTGTTTTTCAACCCTTCGTGAACAGACA 

GAACTTCAGCTTATCCC 

Sequence 1302 

CCGGGCAGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTA 

AAAATAAACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCC 

MMTGCGACTTTCTAAAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTT 

CAGGAGCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCT 

GGAAGCCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGC 

CCACTAGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATAT 

GTAATGACT 

Sequence 1303 

CCGGGCAGGTACTACCATGCCGGGCCAA I J I I I I I I I I GTTTGTAGAAATGAGGTCTTGC 

TATGTTGCCCAGGCTGGTCTCAAACTCCTGGTCTCAAGCGATCCCCCCGCCTCAGCCTCC 

CGAAGTGCTGGGATMMGGCGTGAACCACCATACCCAGCCAGTATTATCTTTTCATTTC 

ATTTTCCAGTTGAGTATATTATTGGCTACATTTGCATACCGCACAATTGTTCATTTT^ 

MAACCAATATTTTGTTTTGTTCTGTTGTCTACMTMGGAGAATTCAGATGATAAACTT 

ACAACCAATCATGGCCAAGTCCACTTGAGGAATTGTCTCTGTAGATTTATCTGTAGACTC 
CCTAATA 

Sequence 1304 

AGGTACCAAATGAAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGAGTGTAAAAATA 

AACCAGAAATCAAGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATG 

CCGACTTTCTMAMTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGA 

GCTGAATGAAAAAATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAG 

CCTCATCCCTTCAGCATCAAGCTGGAATGGGGAATGAAGAATAGAGATGTGGTGCCCACT 

AGGCTACTGCTGAAAGGGAGCTGAAATTCCTCCACCAAGTTGGTATTCAAAATATGTAAT 

GACTGGTATG 

Sequence 1305 

AGGTATTCGACCCACGCGCCCGTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAAC 
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CAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAAAAAAGTnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnGCTGTTTCCTGNGTAGMNATTGTTATCnnnnnnnnnnnnnnnnn 

nnnnnnnnnNAGCCCGGGGAGNCATAAAAGTGTAAAAGCCTGGGGGTGCCTTAATGAAGT 

GGAGCTMCCTCCACATTTMTTTGCGTTTGCnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnCTGTCGGTGGCCAGCTGCATTAATGAAATCGGGCCAAACGCCGCGGGGGAAAGAAGG 

CCGGTTTGCGTTATTNGGGCCGCTC7TCCCGCTTTCCTCGCTCACTGGACTCCGCTGNCG 

CTCGGGTCNTTACCNGCTTGCGGGCCGAAGCCGGGTAATTCAAGCTTCAAC 

Sequence 1306 

AGGTGTTAGTTACCACTTCATTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGAC 

TTGAAGCTGGAAGCAAATACAAGTCTGAGGGGCTCTAAGCTGGGAGGTTCTGGCCTCTCC 

CTAGCTCTCTATGGCTCTACCTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTC 

TGACTGGGGATAGGACCACTGCTGACAGGGCCCCACCTCAACTTCTTTCATTGCCCTCTT 

CCAGGAAATCCCACCCTGGGATACTTCAAAGACCTCATATGCTACAAAGATCAAGGCCAC 

CTAATGAGTGCTCTAGAGATCAGCACCAAAGATGCTTGCCAGAGTCTTCTCTATATGTCC 

TCCCTCCTGTATCAAATAAGTCACCAAGTATAGCTGACTATCTCCATAACATCTCTCCAG 

TTGTGGCCTTAGTTCCAGCCTTCATCACTCTGCATTCTGAGGAGCTGATATCTGTCTCCT 

GCTGCCAGCCCCTCACCCCCAGTCGGTCTCCAACATAC I I I I I I I I I I I I 1 1 I ICGGACN 

CGTGGGTCGAAAGCCTTGACCCTGCC 

Sequence 1307 

ACM I 1 1 I I I I I I I 1 1 I I AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAGGCAGTGCA 

CCAAGAGCCACTATAAAAAGATCCTTAATGAGCAAAATATATCCCCTATTATTTTCCTAC 

AAGTTGCTTTTTACTTGAGTAGGAACCCTTGATTGATTTTTGCGGACGCGTGGGTCGAAG 

CTTGACCT 

Sequence 1308 

ACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGAGGTCGGCCGCAC I I 1 1 I I I I I I 

I I I I I 1 1 I I I 1 1 1 1 I I GCCTTCAACAAAAAAGGAATTTATTGGCTCACATAAGNGAAAGC 

TGANAAATAGATCTCAATTCAGGTCCAATTTGATCCANAAGTTCCCAAGGGCTCCAATAG 

ACTCCCTNTCACCCTGGTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTGTCAAGCTTC 

GACCCACGCGTCCGTCTTANCCTACAAGGNGGCTNTTATGTACATAGNTGGGTAATACAT 

CCAATTAATGATGT NTGA CATGCTATTTTTGTAGGGAGAAMTATGTGCTMTGATATTT 

TGAGTTAAAANATCTTTTGGGGAGGATTTGCTGAAAAGTTGCACTTTTGTTACAATGCTT 

ATGCTTGGTACMGCTTATGCTGTTTTAMTTATTTTAAAAAAATAAAATACTGTTTGTN 

AGAAACCANCTGGTTTAGAAAAGTTTNNTATGTGACGATAAMCTMGAMTTTCCCTTN 

TTATTCCTAGTATTTTTCAGCACTCCATNAATTCTATTACCCTAA 

Sequence 1309 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGT 

CCGCAAGACCCAAGAGAGCTGATGGTGCAGTTCCAGTCTCACAGAAGTCTGCTGGAGAAT 

TTCCTCTGGATTGGAGAAGGTGGTCTTTT CATTC TACTCAGGTCTTCTATTGATTGGATA 

MGGTGCACATACATCATGGNGGGCMTNTTTTTTACTCANAGGACATCTGTGCCAATGT 

TATTCCCATNCAGAAACAGCCTCCMGTTGACACATNAMTTTAACCAATACACTTTTCC 

TTATCTTNACCCAAAANTTAAGGTTAMGAAGTTTTNGGATNNCCCTGAATAANNNATTG 

GCCA TTNGGAATGCCCACTTTn*NCCTTTTTTGGGNGAAAAMTTNTAAMTTNACCNNT 

TTTTNTNAAMNTAMAAAAACTTGTTGGGCTTANGGNCCACCTNGGTTTNCANCCTTAT 

TMTTCCCAGCCCNTTTTTNGGANGGNCCCNANNGNCAGGNNNGGATTCCCTTTGNGCCC 

CCCGG GAANTTTCMNNNNTTTNNGCCCNGGGGNTAANAAANAAGGNGNAAATCCTNTTT 

TTTTTCANAAAAAAAAAAAAA 

Sequence 1310 

GGGGGGGCATTTCACGCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGG 
GCAGGTCGCAC I I I I I I I I I II I I I I I I I 1 1 ACCTAGCCCATGTTACCAATCTAAATGAA 
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CTGTCTATGCATAGAAAACTGCTTTATGCCTAAGATAATTACTGGGATTTAAGAAAGTGA 
GAAAAAAGAATAGGTGGGATTGAGAAATTAGGTAAAAACAGAAGAGGCCAACTAAACCCA 
AGTGCTGCCCTTCAAGGGCTCTAGTAACCGGACGCGTGGGTCGAAGCTTGACCT 
Sequence 1311 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAATCCCGTCTT 

ACAGAAGAGAAAACTGAGATTTAGCAACATAAAAGTATTTCCCGTAAGTAAACAGTAGAG 

CCAAGATCTTGACCTACGCCATCTGATACCTGAGCCCATGCTATAAAAGAGGAGCATTAG 

AAATATTTGAAAGATAGAAATGAGAACTAGTCAATATTTATTTTGCTTAGCACTGTATTC 

AGTATTATGGCATCTTAMGTAGTTMGACTCMTATTTCATCAAAMAGTTTAAAATCT 

AATCAGAGAAT 

Sequence 1312 

TACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 
TCCGGCCACATTTTCMTTTAGCCATTTTTCTCTTATTC 

TGTAATTCCACTAAGAAAAGTCAATAGATAATTCCAATAATGACTTCACTCCTGAGAATT 

TTATTAGCTGCTMCGCTTGTCTCATCATMGCACTCATATGTTCATTGAGTAAATATTT 

ATTG AGTA TTTGCTATG GTCCA GGCACTGTGCTAAGTATTGAGGATAAAATGGTGATTGA 

AACATTTTCCCTTCTTGATTTTAACATCTACAAAATAAAAAGTATGTTAATATCAAAAAA 

AAAAAAAAAAAAAAAAAAAAGGN 

Sequence 1313 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAAAAGCCTCACTCACAATTATT 

TTTGAGATAGCTCCCAATGAGTTTAATCACTGCTATGCCAGGTGTGTGAGGCTGCTGTGG 

GACAACAATCTTG ATTCC TAGAAGAGTCATAAATTTCTAGGGACTACAGGCTCCTGCCAC 

CATGCCTGGCTMTTTTTGTAGAGATGGGGTTTCACCATGTTGCCCAAGCTGGTCTTGAA 

CTTCTGAGCTCAAGCGATCCACCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGTGTG 

GGCCATCACGCTTGCCCTAGAGTAATATTCTCTATTATCAAGGTAGAAAGTTCAACATAT 

ATTC ATTAGATCTACTTTATAGATACTGTTACTCAGATCACTTATATCGTTATATGTATT 

TTTTGTCTTCTTMCTTCMGTCTTGGATGAGAGAAGAGGTGTTTTAAATTTCTCTGTTA 

TTTCTAGGGTTCTATTCATTTT 

Sequence 1314 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGATGACAACATC 

AAAACATACTCTGATCACCCCGAGAAAGTAAACAAAGATGATGAGGAATTCATAGAAAGC 

AATAAAATGCATGCTATTAATGGAAGAATGTTTGGAAACCTACAAGGCCTCACAATGCAC 

GTGGGAGATGAAGTCAACTGGTATCTGATGGGAATGGGCAATGAAATAGACTTACACACT 

GTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTCCGGGCAGGTGCTGTGAGTGCTCTGG 

CGMGTTTGGAGCCCAGAATGAAGAGATGTTACCCAGTATCTTGGTGTTGCTGAAGAGGT 

GTGTGATGGATGATGACAATGAAGTAAGGGACCGAGCCACCTTCTACCTAMTGTCCTGG 

AGCAGAAGCAGAAGGCCCTTAATGCAGGCTATATCCTAAATGGTCTGACTGTGTCCATCC 

CTGGTCTGGAGAGGGCTCTGCAGCAAGTACCT 

Sequence 1315 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCAGAT 

CAGACGTGGCGACCCGCTGAATTTAAGCATATTAGTCAGCGGAGGAAAAGAAACTCTGAA 

TCCGACCAGTGTAGGTGATTACATTAGCCT7TGAAGTCAACACAAAGTTTAAAACACCTG 

CCCGGGCGGCCGGCCGCCCGGGCAGGTGTACAAGCTTCGACCCACGCGTCCGGCTGAAGA 

CATCCCTAGGGCAGGTAGCAGAATACCTAATTCAACCTAGAGAGGCACAGGCTGCACGAG 

AGTCTCTCAGATAAAGCCCCATTGAAAATAAATTTACMTCTAAAATTTAAAAACCCGTT 

AAAAAAGCAGCACAGCATGAGGAGTCAGTAGATACACTGAAAGCAAGATTAGATCTTCAA 

GACTTTCAMCTATAAAATTTAGAMATTATMTAAATTATGAAATAGAGGCCCTTTCAT 

GTCAAAAAGTCATGAAAG 

Sequence 1316 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCG 
AAAAGATGAGGCAACAAGTAAGAGAAAACAGCATTGAGCTTAGAGAATTGGAGAAGAAAT 



ft 

WO 01/070979 PCTAJS01/09126 



TABLE 1 

217/467 

TAAAAGCAGCTTACATGAATAAAGAAAGGGCAGCTCAGATTGCTGAAAAGGATGCCATTA 

AATATGAACAAATGAAACGTGATGCTGAAATAGCCAAAACCATGATGGAAGAACACAAGA 

GAATAATAAAGGAAGAGAATGCTGCAGAAGACAAACGAAACAAAGCGAAAGCACAGTACC 
TGCCCG 

Sequence 1317 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCGTGCATAG 

GGACTCTTGCCTTMGGAGTGTAMCTTGATCTGCATTTGCTGATTTGTTTTTAAAAAAA 

CAAGAAATGCATGTTTCAAATAAMTTCTCTATTGTAAATAAAATTTTTTCTTTGGATCT 

TGGC AAAAA AAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACACTTGTTGAT 

AAGAGTTTTCTGAAAACAGTCTATCAAATATAAAGAATGGTTTCTATCCAAGAATCAGCA 

GTGAGGGAAGAAATACTAAACACCTGTCAAGAAATCAGTTATTCATTTTAAAAAATAACA 

GMCCAGTGCTGCTCTCTGTCATAAAAMGAACATGTAAAATTTATTT7TATAGGCTTTG 

GTAACATTATATTCCCCACAGAGGCCTTCAATCCTACTTAAAGATA 

Sequence 1318 

AGGTCAAGCTTCGAC CCACG CGTCCGGTTACTAGAGCCCTTGAAGGGCAGCACTTGGGTT 

TAGTTGGCCTCTTCTGTTTTTACCTMTTTCTCMTCCCACCTATTCTTTTTTCTCACTT 

TCTTAMTCCCAGTAATTATCTTAGGCATAAAGCAGTTTTCTATGCATAGACAGTTCATT 

TAGATTGGTAACATGGGCTAGGTAAAAAAAAAAAAAAAAAAAAAAGTGCGACCTGCCCGG 

GCGGCCGGCCGCTCGTGATCTAGATCGCCGACCT 

Sequence 1319 

GCGAATTGGAGCTCC CCGC GGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGTTT 

GGGTGGMTTATAATATTTTAGATAAGATTTAAGAGGATTCTAATTCTAGCTACTTGATA 

GGAATGCGAATGATGATAAGGCTTTTAGAGTTAGATAAGAGAGAGGGCTAGCACCCTGAT 

ATTCTGTMTTGAMCAGAGTTTCAAGTCCTTTGGTCAAGTATTACCCTTATTCCTTCAG 

GMTAGTAGATATTTTAAGATTACAGATAGGTTATCTTATCTAATTTACCTACCTATTGT 

TGAAATTATTTAATTTGCATTTAACTGTGTTTTCACACCTGCCCGGGCGGCCCTCTTACC 

TGCTTCTGACCTTATGCTCAAGAACTCCCCTAACTCTGGCCAGAGCTCAGCTTTGGCAAC 

TCTGACCGTTGAGCAGCTCTCATCCCGGGTTTCCTTTACGT 

Sequence 1320 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I 1 1 1 1 1 I I I 1 1 I 1 1 1 1 1 1 1 

AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAGGCAGTGCACCAAGAGCCACTATAAAA 

AGATCCTTAATGAGCAA MTATA TCCCCTATTATTTTCCTACMGTTGCTTTTTACTTGA 

GTAGGAACCCTTGATTGATTTTTGCGGACNCGTGGGTCGAANCTTGACCT 

Sequence 1321 

GG GCGAATT GGAGCTCCCCGCGGNGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGT 

TTGi m 1 1 1 iCTTACGGCAACTCAAAGCAAAGAGCTGGAGGAGCCAGCCATTATNATTGC 

TTACTCTCATCGCTTAGCGCCCCAGGTGGGATGTGTTTCCAAAACACATTTTTGTATTTA 

TMGGAMTGTAGTTAGGATTMTTTTATTGTCCtAATTAGAACTCACATTTTGGTTAAA 

TCCTCAATTTCATTAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACA 

ATTTAATTTTTCTGCTTGCCCAAGAAACAAAGCTTNTGTGGAACCATGGAAGAAGATGAA 

AATGAGACTGGCAAAGAACAAATGCTGAATCTGAAGAAGATTTGGGCAAATAATCTGCAT 

ACTTTTAATTGGGAATAAGATGGAAAATATGAATGCTAAATCAAATTTTTTA 

Sequence 1322 

CCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCGTCCGCTCACTTCATCCTCCCAG 

CAACCTATTATGATCCATTGCCACACCAACTTGCTGATGAGGAAAGTGGGGCTTAAGGAA 

ATTAAAGAGCTGTTGTGGGACTTCCAAAGCAGAAGACAGTAGGCTTTCAGAAATTTGATA 

AAAATAGCACTTTGCATTTCTTGAATCTTGAGCTAAATGGAAATTAATACTAAACATTCT 

CCACTGGTAAAATAGAGAATAAGGATATTAACAGTAAAAGAAAAGAAGAAGAAAAGGAAA 

TGTGCTTCCACAGATTTAGAAACATAAGTAACAATCTAAGTTAAGGCTTTGGCACCTGCC 

CGGGCGGCCGGCCGCCCGGGCAGGTTCAAAGACTACCAAAGTATGTATTTGATTTTCACA 
TGCAAACAACTTAAA 
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Sequence 1323 



CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC 

CGCAGAAACCTTGCCATCATTCTTACTGCTGGTTTGCATCTCATTTATGGTGNISITCTGGG 

ATTTCTTCTCATGAACAGAAAGATTCAGTGGCTGTTATATGCCTTGTCAATTTAATGTAT 

TGCCCTATCCTCTTTTTGATCAAAGATAGAGACTAAGACTGGGAATTATGACAGAAAAAG 

TCATATTTTTCTTTAAATGATTTTGAAATGTTAAAATAGGCCAATATGAGTCAAAGTGCA 

MTTTTTTGGTGACCTGCCCGGGGCGGCCGGCCGCCCGGGCAGGTACTAAGCATTTCAGT 

TCCAGGAGMTAAAAGMA7TCCTATTTGAAATGAATTCCTCATTTGGAGGAAAAAAAGC 

ATGCATTCTAGCACAACAAGATGAAATTATGGAATCAAAAGTGGCTCCTTCCCATGTGCA 

GTCCCTGTCCCCCGCCGCCAGTCCTCCACACCCAAACTGTTTCTGATTGGCTTTTAGCTT 

TTTGGTGG I 1 1 M I I I I I I 1 1 

Sequence 1324 

CCGCGGTGGCGGCCGCCCGGGCAGGTGCCTAATATATTTACTCTCTGGTCCTTTACAGGA 

AAAGTTTGCCAACCTCTGGCTTAGATGATCACCTGAGGCCAAGGAGCCTCGCCCTTGAGC 

ACAAGACTATGTAGTCAGTAAAGCACAAACAAAATTGGGGCTTTCCCTAGCAAGGTTGGA 

AAGGCGGAGAAGAAATGGATTTGGATAGGTAGTCAACAATGTCTGTTTTATGTTACCACA 

CATTTTCTCGAGAAATTTCMTCAGCTCTCTGAGMCAGATTCATCTTTAAATGAATGTT 

CATAGGTMCAGCAACTCATGCATCMTGTTGCAAAGTGAGCTCATTTTCACATTGCTTC 

AGGTTAGGCAGAAGGTTTGGTAMGGGATTMCGTMTTGTTTCCTTGNTGTTTACAAAA 

AGAAGTCCCAGTTGGCATGCCACATAAMTCTTCTGNATCTCACTCTTGGTTACATTTC 

Sequence 1325 

CTATAGGGCGAATTGGAGCTCCCGGCGGTGGCGGCCGAGGTCACGCGTCCGCAGAGGCCC 

AGGCTCCCAAACCGACAAAGTGAAAAGAGACCAGAGAGGCCAAGCATATTGACTGGTGCT 

GTTCAGGGCCTGCTCTTTTCCACTCACCACTTGTTTTGCTGCTTGTCACGAGGAGAGTTG 

TTCCTGTATGTGGCTGCTCTCAGATCTTTCCMGCAAGCCAGTCATTTGAAGAGGTTTTC 

TTTTCATGCTGGAGGGCAGGCTMGATCAATGAGTGGAAGAGAGAAAGGCTGTTTTAGCT 

CAAGTTAAAGGAACACCTTCTAGCCATCAAAGCCGCCCAACAGAGGCAAGGGCCACCACA 

CATGAGAGAGCGCTCTNTCCTTAA 

Sequence 1326 

GCGAATTGGAGCTCNCCGCGGTGGCGGNCGCCCGGGCAGGTACCAAAATAATTACCAACA 

NTACATTATGTACACCATTTACAGGAGGGTAACACAAACCTTGACAGGTAGTMCTTTTC 

ACCCCACATNACTGMCGCTTMCACTCCTGGCTGTTMTTGTCAGTTCAGTGTTTTAAT 

CTGACGCAGGCTTATGCGGAGGAGAATGTTTTCATGTTACTTATACTAACATTAGTTCTT 

CTATAGGGTGATAGCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 1327 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCTGTAGCCTATGACTTG 

AGTCTCTTGAACTTCTAGGAAGAGGCAAACTACAAACTACTAGGATTCTGATTTCAGATA 

TAGGCATTCCAGAATCTTCTCTTTACGAGTTCACCTGCTAGTATAATCTCCACAACTTGA 

ATGGCCTTGGTTGTTCTGTAATTGCTGCCAAAATCATCACAAGCTGTACCTGCCCGGGCG 

GCCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGTCCGGATGGGAATTCAGGTATGA 

AAGAAAACAGGCAAGGAGGCACTGAGGGAGAAAGACACAGACTTTATCGCTCTGTGGCTC 

ATTGTTACTGGMTATTCTAAMCTCTTGTTCACATGCTATTATGACTTATAAAGCAGCA 

ACAGCTGAGGCGCACCAGGACACAGCTTCCATTTCTTTAACGT 

Sequence 1328 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGA 

AAAATGGGAGACAATTTCACATGGACTTTGGAAAATA I I I I I I I CCTTTGCATTCATCTC 

TCAAACTTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAACCAAAAGTGCATTCAA 

CCTTACCAAAAAAAAAAAAAAAAAAAGACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTA 

CAAG CTCGACCCACGCGTCCGAMTMTAAAGCTAGAAGTMTATTTTTCCTTTTGTCTA 

TTTTCCAMTTGACTCGATATTGATGGCTACTTTTGTAAGTTTTTATTTAAGT^ 

GAATATTTATTGATCACCTNTATGTGCTCAGTACCT 
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' Sequence 1329 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCTTCGACCCACGCGTCCGC 

TTGGGGATTTCGAGGMGGGTTCATMGGGAGATTTTAGCTGAGAAATACCATTTGCACA 

GTCAATCACTTCTGACCAAGTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAATG 

TTCTAGGGTGGTGAGAAATCTAGGGTGGTTATCTAAATCAACAATATTTAGATATTCCAA 

TATCTAAATATTGTTGGAAATACTCTCCTGAAGTGTTCATTGAACTCTAAGAGAGACAGC 

TTGTGTATCAGTGGCAGGGTTTMGGTTCATTTTTTATTCCCATATTAATCCTTTM 

TTAGACMATTTCCTTCTGAGTTTAAGGATAAAATGGGATGGGTTCTGCCTGGGCCTGGC 

CCTCATGGGGACATCAAAGGGCAATGTTGCAAAAAAAAAAACC 

Sequence 1330 

AGGTCMGCTTCGACCCACGCGTCCGTGMCTTTTATCMGGCTTTTGCTCTTTTAGA 

TGAGTTTATCTTTATMTTMGGAGAATGGTTTTTAAMTTTAGTTCCTCTGACACCCCA 

AAATTATCAAMTA AATTA TGTTGTAGTGMTCTGTGTTTTGAAAGTCATTGATAGGACT 

TATATGAGTCAAAATTTTATGGATTATAMCTAGGCTTTATCTGGTTGGAAATAATTGCA 

ATACMGAAGCAACTTTATTAAATTAGACCTAMGTCACMTCTTCTTCTTTGCTGCTTT 

TTAAAMTTACCTATTACCTTTAAAAGATCCCAAATTTAGAAGAGGAATTAAAATAAAAG 

TTAATGCAATAAAACACTTCCACAATATTCTATTACTTCAACCTCTAATCAATGAAA 

Sequence 1331 

AGGGCGA ATNGGAGCT CCCCGCGGTGGCGGCCGAGGTACCCTAAAATTTAAAGTATAATA 

ATMTAAATTTTTGTTTTAAAAAAAGAGTGTTGTCTTTGTCTTGTATTTTCT 

CATGTGATATTCTTAGGTATAGA I WWW I AGTATTTGTCCTGTATATTGTTATTCG AG 

CTTCTGGGATCTGTGTTTTGGTGTCTATCATTMCTTTGGAATATTCTCAGTCATTACTG 

CTTCAAACATTCATTCTGTTGCTTTTTCTCTTCTGGTATTATCATTACACATATATCACA 

CCTTTTGTMTTCTCCCACAGTTCATAGATA 

TGCCTTTTAGTTTTAGAGATTTCTATTGACATCACTTAAAGATGATTGATGAGTTGATGA 

GAATTGAGAGAATTGATGAGAATTGTTGATGAGAATTATT 

Sequence 1332 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTTTCCTGCATTTC 
TAATGAAGAAATAGGCTGGTCCTCAATTTTGCAGAAGTTGTATCATCATAGGTCATACCT 
MCATTCGTTTGTCAAGAGCAAAAAAACCCCCTTGGGTTCTCTGGATCTCACACAGCCCA 
CAAACCTTCAGAATGTGGTTCCTTCCCGCAGGCTTTGTCACACTTAAGATCCAAGAACAA 

ATCAGCCTGGCTTTAACATGGGGTAGATGGCAAGAAGGATAATGCGGACGCGTGGGTCGA 
AACTTGACCT 

Sequence 1333 

CCGC GGTGG CGGCCGAGGTAGCGGTCGCACI I I I II I I I I I I I I I I < II I I AAATAG AG A 

TGAGGTTTTGCTATGTTGCCCAGGCTGGNCTNCTGGACTCAAGCAATCTCCCACTTCAGG 

CTACCMAGTGCTGGGATTTACAGGCATGAGCCACCTCTCCCAGTCTCAGTTATTATTTT 

AATAAATGAGACTGAACGTCCTCTTATAAGGCTCACTCCCTTGTTCCTACTACATTTGCT 

CTGTTTAAGTATCTCTTTAMTTCTTCAGTTAAGCGGACGCGTGGGTCGAAGACCTGCCC 

GGGCGGCCGGCCGCC CGGG CAGGTATTMCAGGTGCTTGCAGTTTGTTGACTTTTTTGAA 

AAAATCAAGTTGTAMCTTTTATTACAMTTAAAMTGAAGTTCTTAAAAATCTCAACTT 

GACCAGATATGAAACMTTTAAAMCCTTTAAGGCGTATTGAGAAAAACCAGGCTTTTTT 
AAA 

Sequence 1334 

ACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGCTCACCTGCCCGGGCGGCCGCTC 

GAGGCCGCTCACCTGCCCGGGCGGCCGGCCGCAC I II I I I I I I I I I I I I I II INNCAAAA 

CAAMCATGCTTAGCATGCACACTTTTACCACTTTTTTCGAGTGGAAAGTTTATTGGCAA 

TATTAAATTTCACCCTANATAGGATATGAGAATGTTTTGATAAATCACAATTTATAGTAT 

ATTMTGCCATGTGAGAATTTTGTTTCCCAAGTAAGAGCTCACATGGAACTTGGTCA7TA 

AACCTTAAAGAAA CCTTTC TCACATATCTATAGGCCTCAAATTGAAATAATCTATAAATG 

MTTTGTAGATTTCTT1TTAGTTTAATTCCTGAGTATACAGGGCAAAAGCTTATATCCTT 
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TATATAMCTTCTGCTTTGGTCTAAMCTGATATATCTTCACGTTGAGGTTTCATCTGAA 
ATGCNCCACCGTTTGCTGACTTGCTTCMTATGMTTTGGATGGCTATAAAATTGACCTC 
GGCCGCTCTAGAACTAGTGGGATCCCC 
Sequence 1335 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCGGCCGCTCGAGGCCG 

CACI I I I I I I I I I 1 1 I I I I I I I IGGTAMCAGGCGGGGTMGATTTGCCGAGTTCCTTTT 

ACTTTTTTTAACCTTrCCTTATGAGCATGCCTGTGTTGGGTTGACAGTGAGGGTAATAAT 

GACTTGTTGGTTGATTGTAGATATTGGCGGACGCGTGGGTCGAATCTTGTACCTGCCCGG 

GCGGCCCATAGTTTGTCMCCACTGGTGTAMACCTTAGTTATATATGATCTGCATTTTC 

TTGAACTGATCATTGAAAACTTATAAACCTAACAGAAAAGCCACATAATATTTAGTGTCA 

TTATGCMTMTCACATTGCCTTTGTGTTAATAGTCAMTACTTACCTTTGGAGAATACT 

TACCTTTGGAGGGAATGTATAAAATTTCTCAGGCAGAGTCCTGGATATAGGAAAAAGTAA 

TTTATGAAGTAAACTTCAGTTGCTTAATCAAACTAATGATAGTCTAACAACTGAGCAAGG 

ATCCTCATCTNGAGAAGTGCTTAAAAT 

Sequence 1336 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGAAGATCCTGCGGAAGGAA 

TATGTTTTTGCTGACTCCAAAGTAAGTGACAGCAAACTTCTAAAATGGGCTGTGAGGTAG 

GGAGGGGACACAAGCGTTTTGAGGCTCGCTGNGTGCCAGGGAGTGTATCATTAGCTCACT 

CMTTCCCAGMCAACCCATTTCACACCTGGGAAAGGTGAACTTAGAGAAGTTGAGGATC 

ATGTTCCAGGTTGGCCTGGATTTGAGCCATCACTGTCTCAGGAGTAGGGAGGCTTCCCAC 

TTTGCCCAGCTGCCTCCCAGCCTCGAGGCCACATCCTTTATGACCCACATCTAACTCAGC 

CCCACACCTGGGGGAAAGGCTTTCAGCTTCTCTGGGCTGGACTTGGGAAATCTTTGGGAC 

ACTCCTGACCTGCCCGGGCGGC 

Sequence 1337 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCCCCATGAGGGCCAGGCCCAGGCAGAACCCAT 

CCCATTTTATCCTTAAACTCAGAAGGAAATNNGTCTAAATATTAAAGGATTAATATGGGA 

ATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTGTCTCTCTTAGAGTTCAAT 

GMCACTTCAGGAGAGTATTTCCAACMTATTTAGATATTGGAATATCTAAATATTGTTG 

ATTTAGATAACCACCCTAGATTTCTCACCACCCTAGMCATTTAGNGGGGAGACATTCTT 

TTCTCCTTTTTCTGATAACTTGGTCAGMGTGATTGACTGTGCAAATGGTATTTCTCAGC 

TAAAATCTCCCTTATGAACCCTTCCTCGAAATCCCAAGGT 

Sequence 1338 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGCTTCGACCCACGCGTCCGCTT 

GGGGATTTCGAGGAAGGGTTCATAAGGGAGATTTTAGCTGAGAAATACCATTTGCACAGT 

CAATCACTTCTGACCAAGTTATCAGAAAAAGGAGAAAAGAATGTCTCCCCACTAAATGTT 

CTAGGGTGGTGAGAAATCTAGGGTGGTTATCTAAATCAACAATATTTAGATATTCCAATA 

TCTAAATATTGTTGGAAATACTCTCCTGAAGTGTTCATTGAACTCTAAGAGAGACAGCTT 

GTGTATCAGTGGCAGGGTTTAAGGTTCATTTTTTATTCCCATATTMTCCmMTAm 

AGACAMTTTCCTTCTGAGTTTAAGGATAAAATGGGATGGGTTCTGCCTGGGCCTGGCCC 

TCATGGGGACATCAAAGGGCAATGTTGCAAAAAAAAAAAAACCTGCCCGGGCGGC 

Sequence 1339 

CTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAACTT 

TTTTGATGAAATATTGAGTCTTAACTACTTTAAGATGCCATAATACTGAATACAGNGCTA 

AGCAAMTAAATATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGCTCCTCT 

TTTATAGCATGGGCTCAGGTATCAGATGGCGTAGGTCAAGATCTTGGCTCTACTGTTTAC 

TTACGGGAAATACTTTTATGTTGCTMATCTCAGTTTTCTCTTCTGTAAGACGGGATTAA 

AGTACCT 

Sequence 1340 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTACTCAAGTAGTCT 

TTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAACAAGGAGTTTGATNTGTGTTNGCTG 

ATTGTGMCCATCAATTGAGATMCTCACTACCTTCAGGCCAGCCAGTTACATACTTTTG 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

221/467 



AAMGCCAAGAGTGMGCAGGGTTGTTTTTCATCCMTTCTTGGTCTTTTTGTT 

AGCAATAAGATAGGGTGGTTTCGGGCAATCACTTAGCTAATTGGCTCTCTATAGTCATAC 

CTGGATMTATTTGTAGTCATACGTGGGATAATATTTAAAGGGAAGAAACTAAACATAGT 

CCTTAAGTAGGAACCAACTACAAT 

Sequence 1341 

CCGCGGTGGCGGCCGAGGTCCTAGCTTGAGTCGACCCACGCGTCCGGCCGCTGTTCGTAT 

TTCTTATTCTACAACAAGGGNGCAGCCTANAGGCAAAACACATCCCATTGTC AI I I I II I 

GTAMTAAAGTTGTATTGGMCATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGC 

TTTCACTTACAACCTGAGTGGTTGCCACAGAAACTGTATGGCCTGCAAAGTCTAAAATAT 

TTACTATGTAGC I I 1 1 ICTTTTCTTTTTGGAGACAGTGTGCCACTCTATTGCCCAGGCTG 

GAGTGCGGTGGTGTGATCATGGCTCATTGCAGCCTCAAACTCCTGGGCTCAAGCAATCCT 

CCCGCCTCGGTCTCCCAAGTAG7TGGGACTACAGGCATGAGCCACCATACCCGGCTAATT 

TTTTTAAAGTTTTTGGTAGAAATGGAG I I I I I I AATGTTGCCCAGGCTGGTCTTGAACTC 

CCTGGTCTTAAATGACCCTTTTCCCATCAG 

Sequence 1342 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAGTTATTTATTGATTTAATCAT 

TGTAATCTCCAATAGAGATTACAATAGAGATCTCCAACATGATTTCATGCATTTAGAGGA 

GAAATATTTCCTGGTTAAGTGGAAAATTGTGCGGATGTGGCTTCTGGAAGACCTTCATTC 

TAAAGCAGCGGACGCGTGGGTCGAAACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTCTG 

CAATCCAGCTAGGCATGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCG 

AGAGCACAAAGATTCTAGGATACTGCGAGCAAATGGGGTGGAGGGGTGCTCTCCTGAGCT 

ACAGAAGGAATGATCTGGTGGTTAAGATAAAACACAAGTCAAACTTATTCGAGTTGTNCA 

CAGTCAGCAATGGNGATCTTTTTGCTNGTCTTGCCCATTCCTGGA 

Sequence 1343 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCTTTAAAACCCATA 

CCCCTCCAGGGTTCTTTTCTGTTGCTGGTGMGTGCACTTTTTAAAAGAGTNATTCATAC 

CATCAAGATTTTTGACMGAMATTTTAGAAMACTGTGGAAGAAAACTGATTGCTCTTA 

GTTCTAGCCATGTGTAATTGCTGACCACCTGAAATGGTCCAAACTGAGATTTGCTAAAGC 

ATAAAATACACACCATATTTCAAAGGTTTTTAAMGMTGTAMACATTTCATTAATTTC 

GGACGCGTGGGTCGAAGCTTGACCTGCCCGGGCGGCCGAGGTGGATGGACCCATCCATTC 

AGGCAGGGGGTGTGGGGTGTCCCCTGTGCTTAGAAACCACCTAGCATCATAAGCTGCAAC 

AGCACTTTATTGGGATCTGAGTCTACAGTTCACATAGGGAGGTGAAGCCGTGGGAGAAGC 

AGGGNGTAAAAAMAAAAAAGGGGGGGGGACTTTACCCCCCTAAGGACAGGNTGCTTTCC 

AAACCTAACAAAAAC 

Sequence 1344 

CCGCGGTGGCGGCCGAGGTCMGCTTCGACCCACGCGTCCGGTAATTTGTTGCACACTAT 
GTMCAAMCAACTGAAGATATGTTTAATAAATATTGTACTTATTGGAAGTAAAAAAAAA 
AAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCAC I I 1 1 I N II I I I 1 1 1 1 1 GCTGGGGTTT 
TTTTCTTTTCl I I I II I I i CAGCTACAGGAATTTAGCCAATTCANAGGAAATCTTCCCCA 
TMTTATGGMCTTTNTTACAGATTTTACCAAGTCTGGTCAACCCAATAAGAAAAAGACT 
GAAATAACAATAACAACTTCAACAAATAAAAAAACAGTTAAGCTAAATAAACAGATGATT 
GCAGAATTTATGTGATTACTGGGTACCJCGGCCGCTNTAGAACTAGTG 
Sequence 1345 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAAGTTTGAGTTGAAACGGTATGTGACTTCCC 

CAGCTGCACCCTGGGCAGTGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAA 

AACTCCACAGATCACTCCTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCC 

AGTGCATTGAGGCTGGAAGCAGGAGCAGAGACTCTGGGATATAGTGCGAAAGTCTCTTTC 

CCCTGTAGTTGGGCTAATCTGGAAAAACTCAAAAACCTGGCCTGATTACCGAGGTTTCTT 

TTATGGATATTTAGTATTTAGATAAAATTTTTACAGTATTCTTGAAATGAACCCAATTAA 

ACACATAGTTCTCAGTCTTGACCACACATTAAGAATCATCTGGTAGACTTCTGTAAACTA 

CCAATGCCTGGCCA 
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Sequence 1346 

CCGCGGTGGCGGCCGCAC I rTTTTTTTTTTTTTTTTI I ACCTGAAAATGCTTATTCTAGC 

TTCACATTTGATTGTTTGGCTMGMGAAAATTATTTATTAGACTTAATTTTCCTC^ 

GTTTAAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGAAGTCCAATGCCA 

TTCTGATTCTTGCMCTTACMGTAGTCTTTTTTTGTCTANACGCTTT 

TTTTCCTCAGTCAGTGTATCCAAACCTTCACAGTGATATCTTTTGGGTACCT 

Sequence 1347 

CTCCCCGCGGGGGCGGCCGCCCGGNAGGCNAAGCTTCGACCCACGCGTCCGCTTTAAAGG 

GAATTCTNTGTAGAGTGGGAGGCGAACACGNCTGGNNCTTCCAACTCAGGAATTCTCGTG 

GCTGGGCTGGGTCAGCGATGGCTTTGTCTCTTTATGTCTAAAGTGCCCTATGGCATGCTG 

MGGTTACCTAACCATTCTTTAAAAGGAGAATGACCCTCCATGGGAATGGCCAGCCTGCC 

AACTGTGCAATTGAAGAAGACCCGATGGATCAACCCCATGTCTTCCTTGGGGAGAAAGTG 

CATAAACCAGGGGTCCCI I I I II I 1 1 I 

Sequence 1348 

AGGTCAAGCTTCGACCCACGCGTCCGCAAAAATCAATCAAGGGTTCCTACTCAAGTAAAA 

AGCMCTTGTAGGAAAATAATAGGGGATATATTTTGCTCATTMGGATCTTTTTATAGTG 

GCTCTTGGTGCACTGCCTGTGAGTTAGCCTTATCCTCAAGGAGCAGCTTAAAAAAAAAAA 

AAAAAAAAGT 

Sequence 1349 

GCGCGTATACGACTCCTATAGGGCGAATTGGGAGCTCCCCGCGGTGGGCGGCCCGAGGTA 

CAAACTATGTATCTGAAACACTTCTATTTGGCMTTTTATMCMATCAAATTTTAAAAA 

GAACAAAAGAGATTGCAGATTACTTCGCAGATACAGAATAAAGCAATTGATGAAGTGCTT 

AAGCAAAAGAAACAACMAAAAAGAAMCACACTGCTTTTCTTTTTAAAAATAAAATCAC 

ATTGCTATA GATCA AATGGATAATACCCTTATTAAACAACCATTCCAGAATGTCTTATAG 

TAGCAGTGCTTTTATTTGCACTTCACTTMTTTTATAAGACTCATTTTCATGTATATAGC 

TCTTTACCCCCATTGTTMCGAATAMGTCTCTCATMTTmACACTTTTAM 

AAAGCAAATG AGAAA TGATTTATGTATCGTGGMCCTTTCCCATTTTGGAACCAAAGGTT 

TT AATTCT AT ATTTTTG N CT AAT ATTTCTTT AAAAAAT 

Sequence 1350 

CC GCGGTG GCGGCCCGCCCGGGCAGGTACTATCTATAMGGAGGTTTGATGTTTTCATTA 

CTG GTAAATATTTCAGCATTATCTTTAAAAAGTAAGGACATTGGCCGGGTGCGGT 

GG CTCAT ACCTGTAATCCCAGCGCTTTGGGAGGCNGGTGGGTGGATCACCTGAGGCTAGA 
TAGTTTTATTCACTTGGCTGTTTCACCAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCT 

Sequence 1351 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGCAGGATTACCATGG 

CAACAACACATCATCAGTAGGGTAAAACTAACCTGTCTCACGACGGTCTAAACCCAGTAG 

AAACAAAGTGC GGCC GGCCGCCCGGGCAGGTGCCGCA CI 1 1 I I 1 1 I I i I 1 1 I I I I I I I AG 

AATAGGATTGAATTTTATTAAAAACAAMATAAATCTAAAMGCTTCCTTCAGTTACAAA 

TATGCACMGAATTTCTGCATTACATCATTTGACATAAAATGTTCTGAATGACAGAAGTA 

GMGTAGAACTTACTACCATTTGAAGACAGGAG7TGAGCGCTGAAAACCACACACATTTA 

TAGAAAGAAACCAAAGTTTCACAGGGAAGACCTGTGATCTCTGGCTACAGGAGCTGAAAT 

TAGGAACATGAAAGAAACTTGGAGAGAGAAGACATTCAATACTCTAAAATACTTCAGCAA 

AAATAGTCAAACATNTGTNAACAACTTGGNACAAAACTTTATATGGTGGGGGGTGGCTAT 

GCCGGAAAAANTCTTNACTGGNTATTATTCACCTCAAAMGGGGGNTTTTAATGNTCACG 

AATCCTTCCTTTAAMTAMANAMGCNTGGNTTNTTTTTCTGGNGTCAAGAGTAAAANG 

TANTAGGGNNTACTCAGGGATGGTTTGGAATTTTAAAACGGGGCNTTCCACCCTTGGTGG 

CTNGTGGCANTTTANCCCAAAACGGCNAAANAACCGGCCCGNGGTCAACTTGNAAAACCT 

GGCGGNTTNAAATANGGACCCCGGGTGGGGGATTNANTNAGNTTTGANCCGNNNCTTGGG 
GGGGCCGGCCC 

Sequence 1352 
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CGAGGTCAAGCTTCGACCCACGCGTCCGGTTACTAGAGCCCTTGAAGGGCAGCACTTGGG 

TTTAGTTGGCCTCTTCTGTTTTTACCTMTTTCTCAATCCCACCTATTC 1 1 1 1 1 1CTCAC 

TTTCTTAMTCCCAGTMTTATCTTAGGANMGCAGTTTTCTATGCATAGACAGTTCATT 

TAGATTGGTAACATGGGCTAGGTAAAAAAAAAAAAAAAAAAAAAAGGTGCGACCTGCCCG 

GGCGGCCGCTCGAGGCCGCCCGGGCAGGTACTATGTCGATTCGACAGAACATTCAGAAGA 

TTCTCGGCCTTGCCCCTTCACGAGCCGCCACCAAGCAGGCAGGTGGATTTCTTGGCCCAC 

CACCTNCTTCTGGGAAGTTCTCTTGAACTCMGMCTCTTTATTTTCTATCATTCTTTCT 

AGACACACACACATNAGACTGGCAACTGTTTTGTAGCAANAGCCATANGTAGCCTTACTA 

CTTGGGCCCNTTTCTAGGTTTGMTTATTTCTMGCCTTTTGGGNATGATTAGAGNGAAA 

ATGGCNCCNGCAMCTTGNAGGGGC7TTTGGNNCCANAATGATTTTTAATAAAAAAAAGG 

GGATTGAATAGNTAAANTCMGGGAANGGTTTATGNAAAGGAAAAAAAAAAGCCTCCTTC 

NTGGTTTMTTTACAAAAGGTNTTTTNTNGGGGGACCGNCTNTAAAGNACTNGGGNTTNC 

CCCGCAAGGTGGGNNGGTATTNACCCNTTTNNGGNTTNAAAAAAAAAANTTNGNNNGGGT 

NAACCCTGGAACNGGGGGGGTTNGGNG 

Sequence 1353 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTCTTCGACCCACGC 

GTCCCGGGGTTTCAMCTGMTAACTCATTGTTGGGTCTGGATCTGTAAAMGGTTTTGT 

CACTGATGGGCAATATGGAGAGAATGTAAAAATATCTAACTTCAAACAGAAAAAAGAAAC 

AACTGGATGAAAGCTATAAACAATAGTTCAGAAGATTGGCGTAGAGGATTTACCTACAGA 

ACTtCAGGAGATTCCTAAGMGGCCTTCAGTGATCTTTTCATCAATATCATCAGGCCTTA 

TCATTGTTTCACATTTGCTTCTCTTTACCATAGGGAATATAATAATTATTTACTGGTTAA 

CTTCCTAGGGAGATTGCCTGCGGGTTATTTAAGATCCAAATTTTAMGTMTAATTTCTG 

TTGAAGCTGCTTGTGAGGTGGTTGGGTGGGCAGATAGAGTGAAGCCAGGGACACACACTA 

MTGAGCCCGGGATGTAGGCAGGTTTTGATGTTTTGCTTGCTTTTATCCCTAACATT 

Sequence 1354 

TATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAACTT 

TTTTGATGAMTATTGAGTCTTAACTACTTTAAGATGCCATAATACTGAATACAGTGCTA 

AGCAAAATAAATATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGCTCCTCT 

TTMGCATGGGCTCAGGTATCAGATGGCGTAGGTCMGATCTTGGCTCTACTGTTTACTT 

ACGGGAAATACTTTTATGTTGCTAAATCTCAGTTTTCTCTTCTGTAAGACGGGATTAAAG 

TACCT 

Sequence 1355 

AGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGGGGTTTCAA 

ACTGMTMCTCATTGTTGGGTCTGGATCTGTAAAAAGGTTTTGTCACTGATGGGCAATA 

TGGAGAGAATGTAAAAATATCTAACTTCAAACAGAAAAAAGAAACAACTGGAATGAAAAG 

CTATAMCAATAGTTCAGAAGATTGGCGTAGAGGATTTACCTACAGAACTTCAGGAGATT 

CCTMGMGGCCTTCAGTGATCTTTTCATCMTATCATCAGGCCTTATCATTGTTTCACA 

TTTGCTTCTCTTTACCATAGGGAATATMTAATTATTTACTGGTTAACTTCCTAGGGAGA 

TTGCCTGCGGCTTATTTAAGATCCAAATTTTAMGTAATAATTTCTGTTGAAGCTGCTTG 

TGAGGTGGTTGGGTGGGCAGATAGAGTGAAGCCAGGGACACACACTAAATGAGCCCGGGA 

TGTAGGCAGGTTTTGATGGTTTGCTTGC 

Sequence 1356 

CGCCCGGGCAGGTACTATCTATAAAGGAGGTTTGATGTTTTCATTACTG 1 1 1 1 I GTAAAT 

ATTTCAGCATTATCTTTAAAAAGTAAGGACATTGGCCGGGTGCGGTGGCTCATACCTGTA 

ATCCCAGCGCTTTGGGAGGCNGGTGGGTGGATCACCTGAGGCTAGATAGTTTTATTCACT 

TGGCTGTTTCACCAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCT 

Sequence 1357 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCGCGTCCCGAGTGTTTCCA 
CTCTGTCCATAAAATGGGAGCTAATATTCTCCAACCTGTGTGCCTGACATGATGGTTAAA 
GGGATTAAACAAAACAATAGTTTGTAATTTATTCTGTCAGAGCAAACTGCTGGTAAATAA 
AAGGGCTAGTGACGAAAAATAAATTTTAAAAMCCTMTAAAACAAGTTTGTAATTTATA 
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' ATTGTATACAAATAAAAGATGTTACAAAAAAAAAAAAAAAAAAAAAGGACCTGCCCGGGC 
GGCCGGCCGCCCGGGCAGGTTTTATTTMCATTCAAACTTCATTAAGACATGTGCAATAT 
GGCMTTTTACTGGGGATTAAACCCTACCTAGGATTGCTTGCTGGGGCTTAGCAACAGGG 
TCCAGTTCACACTTAGCACTAATTAAATACTTTATTGAATAAATACAATACCAAACAAAA 
TGCATTCAAA 
Sequence 1358 

CCGCGGTGGCGGCCGAGGTCMGTTTCGACCCACGCGTCCGCATTATCCTTCTTGCCATC 

TACCCCATGTTAMGCCAGGCTGATTTGTTCTTGGATCTTAAGTGTGACAAAGCCTGCGG 

GAAGG AACCACATTCTGAAGGTTTGTGGGCTGTGTGAGATCCAGAGAACCCAAGGGGGTT 

TTTTTGCTCTTGACAAACGAATGTTAGGTATGACCTATGATGATACAACTTCTGCAAAAT 

TGAGGACCAGCCTATTTCTTCATTAGAAATGCAGGAAACCTGCCCG 

Sequence 1359 

CGCG GTGGCGGC CGAGGTCAACGCTTCGACCCACGCGTCCGGGACCTCAGAATATAAAM 
TATGG I I 1 1 I I I I I C AG ACTT ACTAG I I I 1 1 I I I GATAATTCCTCTACGAATGTTGATTT 
AACTTAGAMTATGTAAATTTAATATTCAAAACCAMTTATTTTTTAAAGAGGAAAAAAA 
TATAAAACCTGCCCGGGCGGCCGGCCGCAC I I I M < I) 1 1 M I 1 1 1 N I i I I NAAACTTT 
AATAGNGTNCGGAAGNTGAATAATTTATGAAGGAGAGGGGTCAGGGTTGATTCG 
Sequence 1360 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTGAGTCGACCCAC 

NCGNCCGGAGATGTATACTGCCACTATAGGAACTATAAGAAAAAGTCAAATGGAAATNTN 

ATAAATAAAAACCACAGTCACTATAATGAGGAAATACTTTGATANGGNGTCAGTGAACTC 

AAAAAT NANTCAA TNGAAACTACTGAAACTAAAACTCAAAGAGAAAAAAAANGATGGGAG 

ATMTTATTTTTTMGAATTGGTCATCAAMTGTAGCAACAAGTTCGCCTTATCCTATAT 

CATTTGMTTTTCAAAAAATAAGCTCATTATACAATCTTTAAAATATTTTGAATAGAACT 

GTTTCATGTGTTATTNGTGAAAAT 

Sequence 1361 

CCGGGCAGGTCTACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAA 

CAAGGAGTTTGATCTGTGTTC AACT GATTGTGAACCATCAATTGAGATAACTCACTACCT 

TCAGGCCAGCCAGTTACATACTTTTGAAMGCCMGAGTGMGCAGGGTTGTTTTTCATC 

CMTTCTTGGTCTTTTTGTTAAAGGCAGCMTMGATAGGGTGGTTTCGGGCAATCACTT 

AGCTAATTGGCTCTCTATAGTCATACCTGGATAATATTTGTAGTCATACCTGGATAATAT 

TTAMGGMGAAACTAAACATAGTCCTTMGTAGGMCAACTACAATTTTAAC 

Sequence 1362 

ACTGI I I 1 1 I I I I ATTTGTTGAAGTTGTTGTTGTTATTTCAGTCTTTTTCTTATTGGGTT 

GACCAGACTTGGTAAAATCTGTMGAMGTTCCATAATTATGGGGAAGATTTCCTCTGAA 

TTGGCTAAATTCCTGTAGCTGAAAAAAAAAAAAAAAACCTGCCCGGGCGGCCGGCCGCCC 

GGGC AGGTTACMGCTTCGACCCACGCGTCCGGGAAATTTTAATTAAAAATAGGTGAACA 

TTTTAAATGACCTMTACATATTTAGTCCACATTGAMCTTTGGCATTTTGNCATTGCCA 

TTAMATTTTTGATGGCATTAAMTTTTNATGCCATTAAAMTTTTTGATCAGT^ 

CA 

Sequence 1363 

CCGCGGTGGCGGCCGAGGTACCACGGTTGTCCCCTGAAAGGTGTTGTGTCCCTCACCAGA 

CTGGGAGCACCTCAAGGGCAGMCCCATGTCATGTTCCTTTTTGTATTTCCAGACCTGAA 

ACTGCCAGTAAATAAACCCTAAAAGTAGAAAGAAAAAAAAAAAAAAAAAAAGTGCGGCCG 

CCGCACI 1 1 I I I I 1 1 M I I I I I I I I I NGGAAAACCAMCATGCTTTATTTCATTTTTTTC 

ACAATTTATTTAAACATCTCANATATACAAAATAGGTACCT 

Sequence 1364 

CCGGGCAGGTCAGGAGTGTCCCAAAGATTTCCCAAGTCCAGCCCAGAGAAGCTGAAAGCC 
TTTCCCCCAGGTGTGGGGCTGAGTTAGATGTGGGTCATAAAGGATGTGGCCTCGAGGCTG 
GGAGGCAGCTGGGCAAAGTGGGAAGCCTCCCTACTCCTGAGACAGTGATGGCTCAAATCC 
AGGCCAACCTGGAACATGATCCTCAACTTCTCTAAGTTCACCTTTCCCAGGTGTGAAATG 
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GGTTGTTCTGGGAATTGAGTGAGCTAATGATACACTCCCTGGCACACAGCGAGCCTCAAA 
ACGCTTGTGTCCCCTCCCTACCTCACAGCCCATTTTAGAAGTTTGCTGTCACTTACTTTG 
GAGTCAGCAAAAACATATTCCTTCCGCAGGATCTTCCGGACGCGTGGGTCGAAGCTTGAC 
CT 

Sequence 1365 

TACTATAGGGCGNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCC 

GATTGATTAATACCTGTCACAGATACATTTTGGTTTACAAATCAATGAACAATGGAGGGA 

ACTCTGTCCTTAATCTTGGTACGAGACAATGAACCCCAGGTACTTACCCCAGACAACGAC 

GCCGCTTNACCATGATGATGGACAACAGGCAAC I I I I I 1 1 I I GGAGTTTCAGCTTGCTTC 

CAACAGGGACGGTGAGTGTGAGGTTTATTCCCATTTCTMGACGATAGMGTTTTCAGCC 

TAAGCCGTATTCCTAGGTAAGCAGCTGGATTGCAGTTTTTGTCTTGGAAATTNTCCTTAA 

TTGNNTNANNCGTTMNATTMCAAACTAGCTGGNTNTTAM^ 

TANAGGTNCCCCCANAAATTNAAAATNAAATTTNTGCMTTMTTTTTGAACCTTGCCCC 
GGGGTGGGCCCTGGCCCCCCCCTNGACAANGN I I I I I I I 1 1 I I I I I I I 1 1 1 1 I 
Sequence 1366 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTCG 

ACCCACGCGTCCGGAGCTGCTCAATAGTGAGAATCAGGTGATATAATGCATGTGGAAAAA 

GAATGTGAAAMTCTMCACTTTAGATTGTATACAGTGTTTTTTAAAAAGACACAAAAAA 

ACTGTCAACATGAGAAACATAAGCAAAGTTTTACTCAAGACAAACATCCACGAGTCACAA 

CTTCAGTTATTCCCAGTCTTCAAAATAACAGAAGGGCAAAGCAAAGGTAAACATGCAAA 

Sequence 1367 

GACTNCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGATTCTCTGATTAGATTTT 

AAACTTTTTTGATGAAATATTGAGTCTTAACTACT7TAAGATGCCATAATACTGAATACA 

GTGCTAAGCAAAATAMTATTGACTAGTTCTCATTTCTATCTTTCAAATATTTCTAATGC 

TCCTCTTTTATAGCATGGGCTCAGGCATCAGATGGCGTAGGTCAAGATCTTGGCTCTACT 

GTTTACTTACGGGAMTACTTTTATGTTGCTAMTCTCAGTTTTCTCTTCTGTAAGACGG 

GATTAAAGTACCT 

Sequence 1368 

CCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGATTTACAAGGGACAAAA 

TGATGCAAATTATATGCTGTCCAACCTACTGGTGAACTGGATCAGAATGGTCCAAGGACT 

GTTAAACAGAGGAAGTATTTACATTCTGAAAACTTGCGGACGCGTGGGTCGAAGCTTGTA 

CACCTCGGCCGAGGTACCTTCTGTCACAAAGACCCAAGCTTCCTCCAGCTTCCAGGATAG 

CAGTCAGCCAGCTGGAAAAGCCGAAGGGATCAGGGAGCCAAAGGTGACTGGGAAGCTAAA 

GCAACAATCACCTAAATTACAGTCCTCCAAGAAAGTTGCTTTCCTCAGGCAGAATGCCCC 

TCCCAAGGGCACAGACACACAAACACCGGCTGTGTTATNCCCATCCAAGACTCAGGCCAC 

CCTGAAACCTAAGGACCATCATCAGCCCCTTTGGAANGGGCC 

Sequence 1369 

CCGGGCAGGTCGAGCGGCCGCCCGGGCAGGTTTCCTGCATTTCTAATGAAGAAATAGGCT 

GGTCCTCMTTTTGCAGMGTTGTATCATCATAGGTCATACCTMCATTCGTTTGTCAAG 

AGCAAAAAAACCCCCTTGGGTTCTCTGGATCTCACACAGCCCACAAACCTTCAGAATGTG 

GTTCCTTCCCGCAGGCTTTGTCACACTTAAGATCCAAGAACAAATCAGCCTGGCTTTAAC 

ATGGGGTAGATGGCAAGAAGGATAATGCGGACGCGTGGGTCGAAACTTGACCTN 

Sequence 1370 

CCGCGGTGGCGGCCGCCCGGGCAGGTGTCGACCCACGCGTCCGACGACTCACTATAGGGA 
TCTAGATCACGAGCGGCCGGCCGCCCGGGCAGGTACAGAGATTTAAATGAAATCTTCGAA 
AGMTAMTTTGCTTTTCAGTCCACTGTATTTTCAAAATTT 
Sequence 1371 

CCGCGGTGGCGGCCGAGGTACTTCAAAGTTATTGCACATACACTTGTTTACTTTGTATGT 
TTTGCAGGATTAAACTTTGTATAATCTTTTTACAAAA I I 1 1 1 1 1 1 I CAGTATGCAAGCTT 
GCAAGATGAAAATAAAACCTGTTTGCCTGATAAAAAAAAAAAAAAAAAAAAAAAGTGCGG 
CCGGCCGCCCGGGCAGGTCTTGAGTCGACCCACGCGTCCGCCGGAGAnnnnnnnnnnnnn 
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nnnnnnnnnnnTGATCTAGATCACGAGCAGCCGCACTTTN I I I I I I I I I I I I I I I I I IGA 
CGMTTCAGGAGTCCTTTATTAGCCGGCAGCCGANAGACAGCTAGCGCTCAAAATTNTCT 
TGGCCCCGAAGAAGGAGCTAGATTTTCTTTTATAC 
Sequence 1372 

TTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATAGCACATTACATGTCAA 

ATCCCTTCTCGCCCCCATGGATGACCCCCCTCAGATAGGGGTCCCTTGACCACCATCCTC 

CGTGAAATCAATATCCCGCACAAGAGTGCTAAANAAAAAAAAGNNANAGGNCTGCCTNTG 

TnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTNGNTATCAAGCTTAN 

TCGATACCCGTTNACCTTGNGGGGGGGGCCCGNTANCCCANT^ 

GGGGGGGGTAAATTTNGCCCCNNNTNGGGGNNAAAAAAAANGGGNAAAAAA A I 1 1 I I I I I 
CCTNGNGGGNAAAIsrrTTTTTATNTN^ 

NGGNCCNGGGGGNNNNNTAAAAGNTAAAAAANNNNTNGGGGGGNCCNTTTNNGGGGGGGN 
CCNNNANNNNCCAATTNNNNNNGGGGGGNNCGNCNNTNTTNCC C I I I 1 1 I I I AAAAAANN 
MAAAAANCNCNGCCCCCCCCNTATTTTTNAATTNAAANCTCCCCCCCCNNGGGGGGGGG 
G 

Sequence 1373 

ccgcggtggcggccgcccgggcaggttatagacmtatgctccttaaggtccctttcagt 

cccgttctatggatctgtgtagtttcgcttcttttttcaatatgctcagaattaggacac 

caatgttaatggaagataaggaaactataccacctatcccttatagaagatttgtgcact 

aactaatatgagccctggaagatcaagccagtagaagatagaagatctatccctgcttta 

tactttggatcatttatttgtgaagatcac^ 

ttcatggaagttcggggaaattaactggatctacttctagtctaaataagctcagtgttc 
agagttcagggaatcgcagatctcagtcatcttccctgttggatatgg 

Sequence 1374 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 
TCCGAAGAAGTGCTATGTGGCGAGAAAAAAAAGTTTTAATGTATTGGAGMGTTTTAAAA 
AACCCAG AAAAATGCC I I I 1 1 I I I I I I I I I I GAACATAAACTCAAGATTTTATTGTCTTC 
ATMTAAAAGATGACACTTAGAACTGGATCACTTGGCCCTTCTCTTCTTATCTCCTCCCA 
GTTCAAAATGCTTGCATNTTTTAATAGCCAGCATTNTTTTAANATCTGCAGGTNGGGCTT 
AAACNCCNCTTAAAGCCTTTAAACACAAA I I I I I I I I 1 I GGNGGNTTTTTMGCCCTTTTT 
TCCCGGAAAAAANTCGGNTTTNAATTTTGGNCCNANAAAANNACCCCTTTTG I I I I I 1 1 I 
NNNAANMAMCCNCNTTTTTTNNTNNG 

TTTAAAAAAAAAAGGGGGNNCCCCCCCGGGCNGNGGGGGAI I I I I I I I NTAAANNTTTTT 

TTNNNCCCCCCCCCCC 

Sequence 1375 

CCGCGGTGGCGGCCGAGGTACTTCAAAGTTATTGCACATACACTTGTTTACTTTGTATGT 

TTTGCAGGATTMACTTTGTATMTCTTTTTACAAAAI I I I I I 1 1 I CAGTATGCAAGCTT 

GCAAGATGAAAATAAAACCTGTTTGCCTGATTAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

GTGCGGCCTCGAGCGGCCGCCCGGGCAGGTACTCTATAAGAGGTGTGGGTGTCTTGTTTG 

GTCAGGATGTTAGAMGTGCTGATMGTNNCATGATCAGTGTATNCCAAMGGTTTTTAG 

GMGTATGGCAMAANTGTTGTATTGGCTTTTATGGGGACATGATNNTAGTCNNCTTCCT 

TTTTAAANANGNNTTATNTTGTTCNANTGGNTTAANTGGTTTTTAAAAA 

Sequence 1 376 

TACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGAC 
CCACGCGTCCGGCCACATTTTCAATTTAGCCATTm 

ATTAC TCTG TAATTCCACTAAGAAAAGTCAATAGATAATTCCAATAATGACTTCACTCCT 

GAGAATTTTATTAGCTGCTAACGCTTGTCTCATCATAAGCACTCATATGTTCATTGAGTA 

MTATTTATTGAGTATTTGCTATGGTCCAGGCACTGTGCTAAGTATTGAGGATAAAATGG 

TGATTGAMCATTTTCCCTTCTTGATTTTAACATCTACAAAATAAAA 

Sequence 1377 

CCGCGGTGGCGGCCGCCCGGGCAGGTMTCACACAACACTTTCTTTTCCAACTGCTGCAA 
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AGTGCATCTACMTATGCTATTACAGATCCACTTTTAAAAGGTTTCCTGTGACATTACAG 

CAAGCCTCTTTTTTCAAACAGAGGAATAATCCCAAATTCTTCCTCAAATAAACTCCATTC 

CAGTAAATGGTAAATACATAAAAAATTACAGTAAGCCAGACACTTAAAAGGACAGCCAAG 

AAGTCTTCCAACAGTTTATTAGAAAGAATGTAGACATTTAAAAAAATCCCCACTGTCATG 

MCATAMTTGAGGTTTTCAGCCCGGGTATAAGCTGAATCAAAAAAAGGAAATAAAAAAT 

CCAATAGTGTATTAMCATTTTTCACTCATTTGCCATACTGACAGTGCAAATCAAATCTG 

GACTAA 

Sequence 1378 

CCACTGGATTTGACTCAGAGAGGACCCCCAGAGGGTGTCTCCATCTTCCCTATTTATTTT 

CAGCCCTTGAGGGCTTCATTGTAGATCAAAGCCAAGGCCCCCAGGAAGGNGACATACTCC 

TGGMGTTCACCTCCTGGTCCTTGTTCCGGTCCMGTCTTCCATCAGCCTTGCAATTTCA 

GCATCCTGCAGCTTCNAGCCAATGGTGAGCTCCTTCTGGATCAGCTCCTTCAGCTCCTTC 

TTGCTCAGGGTGTGCTTGTCACCCTCCCTGCCGGAGATACCTGCCCG 

Sequence 1379 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCAGGGATGGGAGG 

CACAAGTTGTGATTGGGCAAAGTTTATTTTCTATGTCAGCCTGTCAGTCCACTGCCCCAT 

TTTGCAAGAC I I I II I I I AGCCTTGACAAAATGTCTCAGTTAAGTATAAAAGTTTTTCCA 

CTACTTAGTCCAAAAAAAACTATTAAATCTTAATGAAATAGCCACTCTCAAAAAAAAAAA 

AAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGTCTTTTTC 

TTCCCAAACATAGACTTGCAAGACATGGCCTGTATGAGAAGAAAAAGACCTNAAGAAAGC 

AACGAAAGGAACGCAAGAACAGAATGAAGAAAGTCAGGGGGA 

Sequence 1380 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGCACATTACAGTCAAA 

TCCCTTCTCGCCCCCATGGATGACCCCCCTCAGATAGGGGTCCCTTGACCACCATCCTCC 

GTGAAATCAATATCCCGCACAAGAGTGCTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAMGTGCGGCCTNGAGCGGCCGCCCGGGCAGGTACATNTATTTTGGATTGTATATTGNG 

TTTGTGTATTTACGCTTTGATTCATAGTAACTTN^ 

AAACTGTAAATAAAAAGAA 

Sequence 1381 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATCTGTCAA 

ATATACTATGAAATGCATAGTCTCCACTTAAAATGCTGAATGACACACACGTTTTGCAAG 

CATTACTGCTTTCCACAAAAACTGCTGAATAGGAGTTCCGTCCCTGCCAAGATCAGTGTT 

TAAGAGATACTTTATGATGCTGATAAGTATTATTGGTGGTGGTGGTGTTCAGAAAGTTTG 

TCACTCATGCAGATGTCTGAAATCTTGTTCCGAATCCATGGAACATAGGGTGGAGGCCAG 

CTCCCCCTTTTTTTAGATGATCACATAGTTCTGAGCAGAGATGTGGTCCTCACCCTGCAG 

TTCCTGCAGGAGCTGCTGCTGCTGGGATGGCTGCTGG 

Sequence 1382 

CCTAT AGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 

TCCGATT TTTTT AMGTATTTTCTAGTCTTTTCTCTCTCTGTGGMTGGTGAM 

TGCCGTGTTTTGAAAGTMGATGATGAAATGMTTTTTAATTCAAGAAACATTCAGAAAC 

ATAGGAATTAAMCTTAGAGAAATGATCTAATTTCCCTGTTCACACAMCTTTACACTTT 

AATCTGATGATTGGATATTTTATTTTAGTGAMCATCATCTTGTTAGCTAACTTTAAAAA 

ATGGATGTAGAATGATTAAAGGTTGGTATGA I I I I I I I I I AATGTATCAGTTTGAACCTA 

GMTATTGAATTAAAATGCTGTCTCAGTATTTTAAAAGCAAAAAAGGAATGGAGGAAAAT 

TGCATCTTAGACCATTTTTATATGCAGTGTACCTGCCC 

Sequence 1383 

CGAGGGAGTCTATTGGAGCCCTTGGGAACTTCTGGATCAAATTGGACCTGAATTGAGATC 
TATTTCTCAGCTTTCACTTATGTGAGCCAATAMTTCCTTTTITGTTGAAGGCAAAAAAA 
AAAAAAAAAAAAAAAAAAAGT 
Sequence 1384 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I M I I I I 1 1 I I I II I I I I 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

228/467 



' TTTTAACGTTTTAATTAMTGGATTTATTTAAAMGACTATAAAATCTGACATCAAGAGA 
GATAAAAAAAAAAGACCCATAAGATTTAAATTGACAAATGTAAAATGATTGGCTACAATG 
TAAAAATACATTTNCCAGCCCCCAMCAAMCACAAGTATAGTAATTATAAMTTTTTGG 
ACCTGCCCG 
Sequence 1385 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGGCCGCTGTTCGTATTTCTTATTCTACAAC 

AAGGGTCAGCCTACAGGCAAAACACATCCCATTGTCATTTTTTTGTAAATAAAGGTTGTA 

TTNGGMCATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGCTTTCACTTACAACC 

TGA GTGG TTGCCACAGAAACTGTATGGCCTGCAAAGTCTAAAATATTTACTATGTAGCTT 

TTTCTTTTC 

Sequence 1386 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTC 

CGAGAAGAG TTTG CMATGCAACAAMTATTTAATTACCGGTTGTTAAAACTGGTTTAGC 

ACMTTTATATTTTCCCTCTCTTGCCTTTCTTMTTTGCAATAAAAGGTATTGAGCCATT 

TTTTAAATGACATTTTTGA 

Sequence 1387 

CTACTATAG GGCGATTGGAGCTCCCCGCGGTGGCGGCCGCACI I I I > I I I IN 1 1 1 1 I 1 1 
I I I 1 1 I I I IGCCTTCAACAAAAAAGGAATTTATTGGCTCACATAAGTGAAAGCTGAGAAA 
TAGATCTCAATTCAGGTCCAATTTGATCCANAAGTTCCCAAGGGCTCCAATAGACTCCCT 
NTCACCCTGGTACCTGCCCG 
Sequence 1388 

TAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGG 

GGAGGACCTAGGCAACG GCCT GAGACTCCGAGACTCTATGTTGAAGATGCCTGGACTAAC 

CTACTGAAGATACCGTGGTTTTACCAACAGCCAGCACCAATAGGAAGATATGAATGAAGC 

CATCTGAGACCAGCCATCTGGCAGCCAAACTGCCAACTGACTGCAAATGCATGAATGATC 

CCACTGACACCACGTAGAGCACAAATGAGTTGCCTCCACTGAGCCCAGCCCAAATTGTTA 

TCCTATAAAATCATAAAAACATAAACAGTTGTTTTAAGTCAAAAAAAAAAAAAAAAAAAA 

ATTAAGTGCGACCTGCCCGGNNCGGCCGGCCGCCCGGGCAGGTACCCATTAATTTGCTCA 

GATATAGCAGGCTTMTGGTTCTATATTTTCAAMGTTTTTAAGAATGGTT 

Sequence 1389 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAATAAGCCCACG 

CCACTAGGMCTATGTTAAAAAAAMTTCAAGAAAGAATTTAAGGGAGATTACAGTGTTA 

CTGTGACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGG 

GTTGGCCATNANAAGGAAGCCTGGACAGGTCCCTTGTTTCAAAGGTATGACACAAGGTAA 

CCCNTMGCCMGGCACCCAGACCAGTTTNCATACATAGAAAGTTACAGCTGCTTTTATA 

CCCCCTTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAG 

GAMGACCAGTAGAGAGAAAAAAAGGCCATCTATACCAATTNTAAGTTAATTTAGACTAA 

ACAAGGTCTTMTAGCAMGGATMTTGAAATCCCMACTTACAAGGTTTTTTAAC 

Sequence 1390 

CCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGGGTTTCATGAATGGAAACCTAAG 

TAAMCTAAGCTCATTAGTGACAGACTTGTTTTCTTCTTGTTATTCCTCCAGCAACTCCC 

TCACCACCACGCCTCCCTGCCTACCATCCCCGGAAGGGTGCTTATTCTTTAACAAAGAGA 

ATCTAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTGAGAAAACAGACCATATT 

TACTCACATAATTCGCCTTCTACCTTTCACCTGCTTATGTAATMTTTAACACTGTAGAG 

GGGACATGGAGGTGACCGGAGTATTTAGTGGGTTCCTTGCTCCTGGGCTGGGCAGGTTCA 

CAGGCCCCACAGGCCTTGGGCCCCAGCATC 

Sequence 1391 

CCGCGGTGGCGGCCGAGGTACTCTGAGGTCACGTCAAACTATGCTTTAGAACCTTCATCT 
TTTGCTTTCTTGGGCTTTACTTTCCAAAATGGACTACAGGATAATGAGGC I UN I AAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGT 

Sequence 1392 
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CCGCGGTG GCGGCCGCACCTGCCCGGGCGGCCGCTCGAGGCCGCACI I I I I I 11 i 1 1 1 I 1 
I I I I I 1 I I I I I I AAAAATTCAAAAAATTAGTTTATTAGCTTAATATAATTAGGTCAATGG 
MTCCTGTTTTGATCTCAATACTTCCCATATTGCAATATATAAATGNGACAAATTCAGCT 
GTTTTGTGGCATAAATAAGTGTCTAAGCTGGGCAGTTAGTCTACCC 
Sequence 1393 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGT C I I I » I I I I I I I I I I I I I CT 

TTTGAAACAATTTTTCTGAMTTTATTTCTAAMGTCAGAGACAAMCTTTAGGAGTGAC 

ACATTTATACTAAGCATACATGCGTGAGCAAAAAAAAATAAGCACAGAATACAAAAATGA 

MTAGTAMATTTTAATACAGTATTCTGAATACAAGTAGAATACCACTAGATAAGAATTG 

TATTTACCTAAGAAATCTATGATAGNGNGGGNGGAGATAAACCAGTTTAGGATAGCCACT 

TCACTATTCACATTTTAATCAGTGCTGACCAGAAGCTAAAGCAA 

Sequence 1394 

ACTTAGGGCGATTGGAGCTCCCCGCGGNGGCGGCCGNGGTACAAATAAGCCCACCCCACT 

AGGAACTATGTTAAAAAAAAATTCMGAAAGAATTTAAGGGAGATTACAGTGTTACTGTG 

ACACCAGGAAAACTTAGAACTTTGTGTGAAATAGACTGGCCAGCATTAGAGGTGGGTTGG 

CCATCAGAAGGAAGCCTGNACAGGTCCCTTGTTTCAAAGGTATGACACANGGTAACCCGT 

ANGCCAAGGCACCCAGACCAGTTTCCATACATAGAMGNTACAGCTGCTTTTATACCCCC 

TTGCCCCGCCAACGTAGTTAAGAGAACAGCAGCATAAGCGGCTGGCAGAGGCAAGGAAAG 

ACCAGTNGAGAGAAAAAAAMGGCCATCTATACCAATTCTAAGTTAATTTAGACTAAACA 

A 

Sequence 1395 

CCGGGCAGGTACAAAAGGGTTCCTCTATATGCCAACTAATTCCAAA I 1 I I I ACTTTTACT 

GCAAAAAMCCTTTTTGGCATCAAAACTCCATTGTTTCTCTGCACTCTGACACCATCATT 

TCAAAGGGGCTCACATAAATGATCACTACTGCTCTCTCCCTMTTTTTGAAAAAGGAGTT 

TTGAGMTAA MCA GTGCTTTTATTATTAGCCAACACAAAGTGTGAGAAAATCATTCCTG 

AGAATTMCATTTTMGCTAACAGMATTCAGTATACTTAAMCATAATTATATTTAATG 

AGTCATTATTTGGATCTAAAACGGACGCGTGGGTCGAAGACCTCGGCCGCTCTAGAA 

Sequence 1396 

CCGGGCAGGTACCAGTTTGAGTTGAAACGGTATGTGACTTCCCCAGCTGCGCCCTGGGCA 

GTGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAAAACTCCACAGATCACTC 

CTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCCAGTGCATTGAGGCTGGA 

AGCAGGAGCAGAGACTCTGGGATATAGTGCGAAAGTCTCTTTCCCCTGTAGTTGGGCTAA 

TCTGGAAAA ACTCAA AAACCTGGCCTGATTACCGAGGTTTCTTTTATGGATATTTAGTAT 

TTAGATAAMTTTTTACAGTATTCTTGAAATGAACCCAATTAAACACATAGT 

Sequence 1397 

AGGTACTTTAATCCCGTCTTACAGMGAGAAAACTGAGATTTAGCAACATAAAAGTATTT 

CCCGTAAGTAAACAGTAGAGCCAAGATCTTGACCTACGCCATCTGATACCTGAGCCCATG 

CTAT AAAAGAGGAGCATTAGAAATATTTGAAAGATAGAAATGAGAACTAGTCAATATTTA 

TTTTGCTTAGCACTGTATTCAGTATTATGGCATCTTAMGTAGTTMGACTCMTATTTC 

ATCAAAAAAGTTTAAAATCTAATCAGAGAAT 

Sequence 1398 

AGGTTTGAGTCGACCCACGCGTCCGGATTGATAGCTCTTTCTCGATTCCGTGGGTGGTGG 

TGCATGGCCGTTCTTAG7TGGTGGAGCGATTTGTCTGGTTAATTCCGATAACGAACGAGA 

CTCTGGCATGCTAACTAGTTACGCGGACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTGC 

AAGATTCCTGATCGGTATACAGTGATGTATTTACTAAACAGAGACCTGTGCAGAAATTAC 

ATAC TATC CATCTAGATAGGTTGTTACACTTTTGCCTATTGATGGMTAGTTCCATTTAT 

CMGTTTTATACATCAAAAAGCTTTTGAAGTTCACCAGACTGTCCAT 

Sequence 1399 

CCGGGCAGGTACTGTAAATCTACTGTAATCCTGTTTTGCAGAATACTGCACGACGGAGAT - 

TGAGAAGTGAGAGCTCTTATGACATAGATAACATTGTGATTCCCATGTCATTAGTAGCCC 

CAGCTAAATTTAGAGAAACTCCAATATAAGGGAAATACTTACTCCCAGGGTATGGTATAC 
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TTTACCATCTTCATAN I 1 1 I I I CTTTCCCTTCCCTTCCCTTAAAAAAACTNAAN I I I I I C 
NMGGTGGMGAANTTTTTAATTNMNTGGAAAGGGANGCTTCCCTTCTTCCCCAGTTCC 
CTTCTTAGCCNATGGGAGGGGGAAACCGGG 
Sequence 1400 

CCGCGGTGGCGGCCGATTCTCTGATTAGATTTTAAAC I MM I GATGAAATATTGAGTCT 

TAACTACTTTAAGATGCCATAATACTGAATACAGTGCTAAGCAAAATAAATATTGACTAG 

TTCTCATTTCTATCTTTCAMTATTTCTAATGCTCCTCTTTTATAGCATGGGCTCAGGTA 

TCAGATGGCGTAGGTCMGATCTTGGCTCTACTGTTTACTTACGGGAAATACTTTTATGT 

TGCTAAATCTCAGNTTTCTCTTCTGTAAGACGGGATTAAAGTACCT 

Sequence 1401 

CCGGGCAGGTACCAGTTTGAGTTGAAACGGTATGTGACTTCCCCAGCTGCACCCTGGGCA 

GNGACTGCATGCATCACTGAGAGGTCCTGTCTACAGCAGATAAAACTCCACAGATCACTC 

CTCCTGTAATCCCTCTAAGTGCTCCAAGGCAGCAGAAAGGCCCAGTGCATTGAGGCTGGA 

AGCAGGAGCAGAGACTCTGGGATATAGNGCGAAAGTCTCTTTCCCCTGTAGTTGGGCTAA 

TCTGGAAAAACTCAAAAACCTGGCCTGATTACCGAGGTTTCTTTTATGGATATTTAGTAT 

TTAGATAAAATNTTTACAGTATTCTTGAAATA 

Sequence 1402 

AGGTACTCCCATTTCCCTGAAACAAGCAGCCAGCAACTATCTCAGAAATGTGTCAI MM 

ACTGGTTATAATTCTTAAAA AGCTT GTTTTCCTAAGATATGAAATGCCTGCCAGTATACA 

MCTGCTGTMCTACTTCCCTTTTTGCTTTTAGCGGGGAAAAAATAGCTTAATGACAGCA 

TAGAATCATGTAGTAAATATAATTCATTTTTTGAAAGGTTCAGCTATATCCTCTTCCATT 

TGTTTATTTTMATGATCTAATTGCAAACATGTCATCACTCCCTTGATGTTTACCTNCTT 

GTTATGCATTTTTAGCAGGCTTTATTGTCACC 

Sequence 1403 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGATTTTTGCCTCCAGACTACAGATCAGAAA 

ACTGAGACTCAGAATGTTTCAATTCCTTGTTTAAGATCACAAAACTAGTTTGAGGTATAA 

TGGAAACTGAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCACI I I I I I I I I I I I HI I U 

CAATATTATTTATCAAAATAAATTTATTAAAAGTATTCAAAGACCACTTCAAAGNGTAGC 

TGCCTTCAAGACAGATTTTTGGCACTCATMCGGACACTGCAGTTTTCAACACCATAGCA 

CTCATTCTATTTCACACATCA I I I I IAACAA 

Sequence 1404 

AGGTGTTAGTTACCACTTCATTACTGGAGGGCACTGTCACAAACTTCTGACTATCCAGAC 

TTGAAGCTGGAAGCAAATACAAGTCTGAGGGGCTGTAAGCTGGGAGGTTCTGGCCTCTCC 

CTAGCTCTCTATGGCTCTACCTCTCTGCTTGAAGCTCCCTGCACTGCACTCCCATTACTC 

TGACTGGGGATAGGACCACTGCTGACAGGGCCCCACCTTCAACTTCTTTCATTGCTCCTC 

TTTTCAGGAAATCCCCACCCTGGGGATACTTCAAAAGACCT 

Sequence 1405 

AGGTGATTCAGCAGGTCTGGGGTGGGACTGAGAGCTTGCATCTCTAACAAGCTCCCAGCG 

AGGCTGATCCTGTTGCTCCAGGGACCACACCTTGAGAACCACTGGTTGGGCATTGATGAG 

GTCAACCAGGAGAAGCAGTGTCCCCTAGAACTGGCAGGAGAGAAAGGACAAGGCTAAGAA 

ACAGTGAACAGGAGTCAAGTAAATGCAGCTGCCAACAGGCGGGGGTCCTTGAGTTCACAT 

TCTTGGTTCCAGGTGACGTTTCCTGGGAGTCAACAACCCTTCTCCTATGAAAAAGAAAAG 

GGCCAGACACAGTGGCACACGGCTGTAACC 

Sequence 1406 

ATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGTCCCTAAATATTTAAC 

TGTTACTTGTAMCTTGTGTAATTTATTATTTATTTTAATCAMATTCTGAATATTTCAT 

TTAAATGAAAGTTGCAAAAAAAAAAAAAAAAAAAAGTGCGGCCCGGCCGCCCGGGCAGGT 

ACATACCTCCTTGCACAAATGGAGGGGAATTCATTTTCATCACTGGGAGTGTCCTTAGTG 

TATAAAAACCATGCTGGTATATGGCTTCAAGTTGTAAAAATGAAAGTGACTTTAAAAGAA 

AATAGGGGATGGTCCAGGATCTCCACTGATAAGACTGTTTnTAAGTAACTTTAAGGAC 

Sequence 1407 
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AG GTACTCA ATCTMTCCAAAATTTTCTTTCTTAGCAATCTATTTTCTGTATTTAGAAAA 
ATGTTTTTTATTTCAAAGAGCCTCTCAMGAGCATTTC 

CTCCACCTCCAAGGGGTCTGTCTAGATCAGTGCGGACGCGTGGGTCGAAGACCTGCCCGG 
GCGGCCGGCCGCCCGGGCAGGTACTGAGGACAAATCAGTTCTCTGTGACCAGACATGAGA 
AGGTTGCCAATGGGCTGTTGGGCGACCAAGGCCTTCCCGGAGTCTTCGTCCTCTATGAGC 
TCTCGCCCATGATGGTGAAGCTGACGGAGAAGCACAGGTCCTTCACCCACTT 
Sequence 1408 

AGGTACATATCACACATTTCCAAATTTGAGACCACTMTGTTTTTTMTTTCAAATATGT 

ATATAAATATGTATTCTTATTTCCMTTATTTCCTTGGCATGAATTCCTAGAAATTGATC 

TATTTAGTATAAGTGCTTTTTTAGCTATATGTCCACTAGTATGGTATGAGAATGCCCTGT 

TTATGCCAGTATTATCATCATTGAATATATTACTGCTGATGTTGTGGTAATACATTTAAA 

CCMTGTGATGGGGCAAAAAAATTATATTTTTACTTACATCTTTAAAATTACTGGNGATC 

TCTG NTATTG AC AAGCTGGGC ATAN AAAAAGTAAATT AATAG AATT 

Sequence 1409 

CCGGGCAGGTCGGACGCGTGGGTCGAAGCTTGTACAAAAACCCAAGTATCACCTGAATTA 

CAATTATTCTTAAAATTTGTCCTTAMTAGCTTACTCTTGGMGATTTGTTTCTATGTAG 

ACATTATGGTAAAAGTTACTCTGAMCTCTTTTCT7TAGTTATCTGTTTATTCTGAGCTC 

AACAAGATTGAAGTMGTTTTCGGGAGCTACAGAAATTAAATCAAGAAAAGAATAATAGA 

GGATT ATATTCAATTGAAGTGCTGGAGCTCTTCTGATATTATCAATTCTCCTTCATAGAC 

ATTTTATAAAGCTCTTTTATGTGAACTCTTGCTTCATCCAGGCAAG 

Sequence 1410 

AGGT CTTCGACCCACGCGTCCGTTTTAGATCCAAATAATGACTCATTAAATATAATTATG 

TTTTMGTATACTGMTTTCTGTTAGCTTAAMTGTTAATTCTCAGGAATGATTTTCTC^ 

CACTTTGTGTTGGCTMTAATAAMGCACTGTTTTATTCTCAAAACTCCTTTTTCAAAAA 

TTAGGGAGAGAGCAGTAGTGATCATTTTATGTGAGCCCCTTTGAAATGATGGTGTCAGAG 

NGCAGAGAANCAATGGGAGTTTTGATGCCAAAAAGG I I I I I I I I GCAGTNAAAGTAAAAA 

TTTGGMTTAGTTGGCATTATAGAGGAACCCTTTTTTGTACCTGGCCCGGGCGGCC 

Sequence 1411 

AGGTGATTCAGCAGGTCTGGGGTGGGACTGAGAGCTTGCATCTCTAACAAGCTCCCAGCG 

AGGCTGATCCTGTTGCTCCAGGGACCACACCTTGAGAACCACTGGTTGGGCATTGATGAG 

GTCAACCAGGAGAAGCAGTGTCCCCTAGMCTGGCAGGAGAGAAAGGACAAGGCTAAGAA 

ACAGTGAACAGGAGTCAAGTAAATGCAGCTGCCAACAGGCGGGGGTCCTTGAGTTCACAT 

TCTTGGTTCCAGGTGACGTTTCCTGGGAGTCAACAACCCTTCTCCTATGAAAAAGAAA 

Sequence 1412 

CCGGGCAGGTGCCTAATATATTTACTCTCTGGTCCTTTACAGGMAAGTTTGCCAACCTC 

TGGCTTAGATGATCACCTGAGGCCAAGGAGCCTCGCCCTTGAGCACAAGACTATGTAGTC 

AGTAAAGCACAAACAAAATTGGGGCTTTCCCTAGCAAGGTTGGAAAGGCGGAGAAGAAAT 

GGATTTGGATAGGTAGTCMCAATGTCTGTTTTATGTTACCACACATTTTCTCGAGAAAT 

TTCAATCAGCTCTCTGAGAACAGATTCATCTTTAAATGAATGTTCATAGGTAACAGCAAC 

TCATGCATCAATGTTGCAMGTGAGCTCATTTTCACATTGCTTCAG 

Sequence 1413 

AGGTCAAGCTTCGNTCCACGCGTCCGGGAAAAACGGGGTTACTAGTAGCCGCCCATAGCC 
TGCAACCTTTGCACTCCACTGTGCAATGCTGGCCCTGCACGCTGGGGGCTGTTNGCCCCT 
GGCCCCCCTTTGGTTCCTGGCCCCTTAMNAACAGGCNGGTTTTATTAAACCCCAANNNN 
CCCGGNTTANAAGGGGAATTNAAAAAAGGGCCCCGGCTTTNGNAAAAAAAAAA 
Sequence 1414 

NCNGNCCAGGTCTACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGNGTAGA 

ACANGGAGTTTTGATCTGTGTTCAACATGATTGCGAACCATCAATTGAGATAACTCACTA 

CCTTCAGGCCAGCCAGNTACATACTTTTGAAAAGCCAAGAGTGAAGCANGGTTGATNTTC 

ATCCMTTCTTGNNCTTTTTGTTAAAGGCANNAATAAGANAGGGTGGNTNCGGGCAATCA 

CTTAGCTAA 
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Sequence 1415 

AGGTCTTCGACCCACGCGTCCGTTTTAGATCCAAATAATGACTCATTAAATATAATTATG 

TTTTAAGTATACTGAATTTCTGTTAGCTTAAAATGTTMTTCTCAGGAATGATTTTCTCA 

CACTTTGTGTTGGCTAATAATAAMGCACTGTTTTATTCTCAAAACTCCTTm^ 

TTAGGGAGAGAGCAGTAGTGATCATTTATGTGAGCCCCTTTGAAATGATGGTGTCAGAGT 

GCAGAGAAACAATGGAGTTTTGATGCCAAAAAGGTTTTTTTGCAGTAAAAGTAAAMTTT 

GGMTTAGTTGGCATATAGAGGAACCCTTTTGTACCTGCCCGGGCGG 

Sequence 1416 

AGGTGTACAAGCTTCGACCCACGCGTCCGGGATGAGTTTGTATGTGTAAAGTGCTTGAAA 

CAGTGCCTGCCACATACTAAGTGTTGGATMGTGTTTGTTATTAAAAAAAAAAAAAAAAA 

AAGTGCNGGCCGGCCGCCCGGGCAGGTCAGATGATTGCAGAATTTATGTGATTACTGGGT 

ACTCTMTGGTAAGGAGAAATTAAGACCAGCTAGTTGTTMTCTTMCTTTTAGTCATTA 

AGGAGAATTTCCAAGACAAAACTGCAATCCAGCTGCTTACCTAGGAATACGGCTTAGGCT 

GAAAACTTCTATCGTCTTAGAAATGGGA 

Sequence 1417 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTAATCCTGTGAGAAAG 

ACAGGACAGAAACCACTGTGCCTATTTTACAGATACGAAAACTGAGGCACAGGTAAAGGG 

GCTTGTCTGTAGTCCCATAGCTAGCAGATGGCTGGAGCCAAGACTGAGGCTCGTTCTTCA 

ATGCTGAGCCAGGGCTCCTTCCGCTGCACCACAAGAACGCTAGACCACTCGCCACCAGCC 

TTCTCATTCCCTCTTCCTCCATTCTAATCATTTCTAGCTGGCTGGCCTCCACAGAGCATA 

GGAAAACAGCCAGGGCCGGGCACGGTGGCTCATGCCTGTAATCTCAACACTCTGGGAGGC 

CGAGCCGGGTGGATCACCTGAGGTCAGGAATTCGAGACCAGCCTGGCCAACATGTTAAAA 

CCCCATCTCTACTAAAAATATAAAAATTAGCCAGGCATGGTGGCGCACACCTGTAATCCC 

AGCTACTCAAGAGGCTGAGGCAGGAGAATTGCTTAAATCTGGGAGGCGGAAGTT 

Sequence 1418 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCCGGNCCGAGGNNCAAAAGAGAGACAAAAGG 

GTTCTCTTGGAAACAAGAAGAGTGACTCCAGATGTGGCCTGAATAATTGCCATGTTAAGT 

TAATGCAAAAGATCAGAACAGGGCTACATTTGCACAGGCAGTTTCTCTCCGGGCCGTAGT 

TTTCACTGATGATCACCTTTCACAGCATTTTCCCCAACCAGCATTTCACTTAGTCTTCTC 

TATACCCAGCACCTCCCCCGGCACCCCCGGCAAGCCCACTATCACTTCCGACTTCCAACG 

TGGCATCCGTGAGATCTGTCCACATTAGGCGAAGCAGGAGAACACTGAGAGCAGCAGGAT 

GGGTTTGGAAAGAGCATGCCTCTGGAAACACAGCTTCCTGGGAATTCACATGAGGCCAGT 

CCTACAGAGAGCAAGATGCACCCCAGGATTTCTTCATTTTCTAATAGATGTGGGAGTGCT 

CCATTTTCCCCGACAGCGAATTTCCCCTGAGAAACGATACTAGACCCTGGGTTTGCCCAC 

CTTGTMCTCTTCCTTATCTNCTCCTTTTCATCCCTAATTCA 

Sequence 1419 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGT 

CACCACACTCTACAGAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAA 

CTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCC 

CAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGG 

CTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACC 

AACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATA 

TTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGA 

GGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

Sequence 1420 

CCGCGGTGGCGGCCGAGGTACACTGTAAATAGCCTTTACCAAACGTGTTTGACAAGGACC 

ATAATTAACATCACTTAGTGAATTGTGATAMGAAAAAAAAGCCATGATTTATTCGATGT 

GATTGGCTTGTTTTTATGTGGCGCCAAGAACGAACCTGTTTAGCAGCTGTAACCAATGGT 

ACGCGGGGGAGGCGAACAATGGCGGAGCTGGGCGAAGCCGATGAAGCGGAGTTGCAGCGC 

CTGGTGGCCGCCGAGCAGCAGAAGGCGCAGTTTACTGCACAGGTGCATCACTTCATGGAG 

TTATGTTGGGATAAATGTGTGGAGAAGCCAGGGAATCGCCTAGACTCTCGCACTGAAAAT 
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TGTCTCTCCAGCTGTGTAGACCGCTTCATTGACACCACTCTTGCCATCACCAGTCGGTTT 

GCCCAGATTGTACCTGCCCCGGCCGCTCTAGAACTA 

Sequence 1421 

CCCCGCGGTGGCGGCCGAGGTACTTTGGGAGACCACCCCCAGCTATGGTTCCATACACTT 

ANACTGCGCCCAGCTACAGNTTNATACACTTNGGACAAANTATCTGATAAAATAGAGAAA 

AAMTCTTATTTACTATAGCATTACATAATAAATTTNTGAGAAAAAAATTAACCAGGGAT 

GTAAA AAACCTTTACMTAAAAAATAMTAAAAAAGGMGATCCAAATAAATTTTAAAAT 

ATTTTATGTCTTTGGATTGAAAGAATAAATATTAATAAAGTGCCATATTATCCAAAGTGA 

TCTATAGATTCAATACACTTCCTATCAAAATTGCAGTATTTTTTTCACAGTAATGGAAAT 

TCMTTCTAAAATTTACATGAMCTAAAATAMCTTTGAATAGCCAAAACAGTCTTGAGG 

AAMGGAACAAGGCAGAAGMTATCATACTTACAATTTCAATCTATATTTGAAGACTTTA 

TAGAANTAAAA 

Sequence 1422 

CGGGCAGGTACGATGGGAGGACAGCTTTGTAGAAAGGACATTATCCAGCTAATAGCAAAC 

TTTGTGGATCCCMTCCGAGATTTCCCTTGCTGAAAGACAAGAAAGTATCTCATATAAAA 

GTGCTGTAGCAAGTATTTGTATACTCCAGAAATAAGCTTCTGTAATTCTTAGCTGCCAAT 

GTGTTCAGGCGTGATGACTCGGTTTCTGTTTCTCTGAACATCAATACTAGGGTCTGTATA 

ATTTCAATGCATGCCACCAGCTTCATCAACCCTT 

Sequence 1423 

AGGTACAATCAGAATGCTGCATTCTCCAGCCATAAAGATCGCTCCCTCTTCTTTTCAAAC 

ATCCCTGTCCCTCAAGGTCTAGCTCAAGACGGTCACCTTAAGAAAAGCTCCCTTTGTCGA 

GCAGTGACTCCATACCAGGCCCTGCTTTAMCGCTTTATCTGCATTATCTTACTTGATTC 

TCGCAATAGCCCTGGGTGGTAGGTGCAATTATTATCTCCAGTTTATAAAAGAAGATACTG 

AGGGTCAGAGAAGTTAAGTGACCGGCTCAAGGTGTCACATTCAGTAAGCGTTGAAGGGGC 

CTGTGTTGGTCTGTCCTTGMGATGCCCCCTACCGACTACACTTTCMTGATTTTCTGCC 

TTGAACCTGGCCCCATGACTAAA 

Sequence 1424 

NNCAMCCTCCTATGCTTTCCTTGGCATCGGCTACACATCATAGTATTCATTGCCTCCTT 

GAGGTCATCTTGCAGCTTGGACAGAAACTCATTTACTGACCGGCTCAGCTCATTCTCTGC 

CATTCGTTTCATCTCATACTCCTTTCGCTTTTCAGCATTGCTGACAATGTCCCAAGCTGC 

TCGCAAAACCTTGAAGGCCTCCTCAGCCCGGGGATGATGATTTTTGTCAGGATGAACCAT 

CACTGCCAGCTGTCTATAG 

Sequence 1425 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGAACTGATTTCCCC 

AAATGGCTCTTTGTGCAGCCGCAGCAGCTGTTCAGACTTATGAGCAGACAGGAACTGCCA 

GAGGGCMTGGTGTGTTTTMCTGGCATCTGTTTMGGCCTTTAACACGTGAATCGTCTG 

ATCACCCATTTGCAGGATGTCTTGAGTATACACATTCAGCTGCATGTTTGGATCCCCACC 

AGCTGTGCTCAGAAACCCCAGAGTGACTTCTACGACAGACAGCACTTCACAGGCATCGCT 

GTAGGACTGCAGCTGTCCACTGATGGCACTAATGACCGAGCTGGGGAGGGAGTCCTGGGA 

AATGAAAAGCAGGAGAGGGATGTCTGTGGGCTGGGTTTCTGGCATCTCACCACCTGGTAA 

GAGAGCCGAGCCCCTTCACTGCCCAAGCCACATGCG 

Sequence 1426 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGGGCCAAGGCTGCAAT 

CAGTGATTCAGCCGACTGCTCTTTGAGTCCAGATGTTGATCCAGTTCTTGCTTTTCAACG 

AGAAGGATTTGGACGTCAGAGTATGTCAGAAAAACGCACAAAGCAATTTTCAGATGCCAG 

TCMTTGGATTTCGTTAAAACACGAAAATCAAAAAGCATGGATTTAGGTATAGCTGACGA 

GACTAAACTCAATACAGTGGATGACCAGAAAGCAGGTTCTCCCAGCAGAGATGTGGGTCC 

TTCCCTGGGTCTGAAGAAGTCAAGCTCGTTGGAGAGTCTGCAGACCGCAGTTGCCGAGGT 

GACTTTGAATGGGGATATTCCTTTCCATCGTCCA 

Sequence 1427 

AATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGAC 



# 
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AAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTC 

ACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCMGGCATTGTTCTCCTCCAATTTG 

GACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGG 

CTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTAT 

CAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTA 

CCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAAT 

ATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCMGAGTTATTTTTCTGAC 

TGTCAAGTTTCAACATTCAGGGTCTGTCCCCAACAGGCACCAC 

Sequence 1428 

AGGTACAAATAGATACCTTCAAGGAGAATGAAAACGGGGAATATACTGAGCACTTACACT 

CGGCCAGCTGCCAGATCAAAGTTTTCAAGCCCAAAGGTGCAGACAGAAAGCAAAAAACGG 

ATAGGGAAAAAATGGAGAAACGAACACCTCATGAAAAGGAGAAATATCAGCCTTCCTATG 

AGACAACCATACTCACAGAGGTAAAAAGATTTCTTTTGGTGACAATTTCAGTTCATAATT 

TTTMTCTTAAAAATTCATCACTTCCAAACTGGTCAGMTTTACTTCTCCTAAGCCTTGA 

GGGACACAGTATCACATGGATTCTGTGTCCAGCGGCCTTAACAGGAAGATTGCTTTAGAA 

TTTGGCACGAACCATGCCACTGTCTCTGT 

Sequence 1429 

NCNGNCCAGGTACTCNNNNACANTGNAAACTNNTCANGNGCCCATCATTGCTGGATTTGT 

ATTTAACATTATGTTTCACCCAGACAACAGCTCAGAGAACTGGGCAATGGCTGCTNATGT 

GTTGAGCCGGGGCATACAGGATGAAGAGGGACAATGAGAGGGAATGAATTCTATTCTANA 

CACCCTGAGTTTGAGGAACCTATGGAAATGTCCAGGAGGCAACTAAATGAAACAGCCTGT 

GGTAGACAGAATAATGGCCCCAAAGATGTCTACAGCCTAATCCCAGGAGCCTGTGAAAAT 

GTTCCCTTCGCATGGTAAAGGGATGTGGCAGATATGATTAAGCTAAGGATCTTGAGATGG 

AGAGTTTATCCAGGATTATCCAGGTGTGCCCAGTATAAT 

Sequence 1430 

AGGTACGCGGGACACAGGGTCCTGTGCAACANGGNGGACTAACAGTAACACCGCCACGCC 
GGCAGCAAAGCTCATTTTGGTCCCCGCCCCGTTCCTCTTTCTCI I I 1 1 AACTCCTTCCCT 
CTTTGCGGATTCTAGAACGGAACCTTTTTTTAATTCTTCCCAGTAGAAACGTAGGAACAA 
TTTCGTGAACGCAATCNGGAGTGCCCAACATGGC 
Sequence 1431 

AGGTACCCCTGTTTAACAAGGGGTAGGGGCCTTCTGAGACTGTTTCCTCTACAGAGTAAG 

GGTTCGTTCAGCCTTTTCCGTGGCCTGCCAAGAACTCAACTCCATGTTCCCTCACTTCCT 

GTAATTGACCTTGTCCAGGACTTTCTGACCTTGGAGAATTCACCTTTGCTCTTTCTGCTG 

CTTCGTGCATTCTTCCACCAAATGTCTTAACTGACTGGGCTCCTTTCCAAACTCAAGGGC 

TTTGCCAAATGCCACCAGCTCAGGGAGGCCTTTNCTGGCCATGACACTTGAAGTTGCAAC 

ACTCCCCCGCAGTCTCCCGTGCCCCAGATGTAAGTTCCATGAGGGCAAGCCCTGTGCTTT 

TACCACCATATCCCCAGCATCTTGAGCTGTGCCTGGCCCAAGAAATATTTGTTGAATGAA 

TGAATTTAAAAGGGGATATTCATGANGGCTTACACATTCTCAATGGGTT 

Sequence 1432 

G GGCNAATTGG AGCTCCCCGCGGTGGCGGCCGCCCGGCAGGTAC l I I I I I I H ICTTTTT 
CI I I I I I I I I I I I I I I IAAGTTTTTGGGATCGTGTCTCACTCCTGTTGCGCATGCCTGTA 
GTCCCAGCTACTCAGGAGGCTGAGGCAGGATAATTGCTTGAACCCGGGAGGTGGAGGTTG 
CAGTGAGCCGAGATCATGCCACTGCACTCCAGCCTGGGCAACAGAGTGAGACTTTGTCTT 
CGGAAAAAAAAAAAAAAAAGATTTGGCGGATGAAAATAACCAGAATGAAAATAGCTNGAA 
MCTCANCMGCAGGAAGCTCCCCTTCTCACCCTTTTGTTCCCTTGCCGATAGAATCAGT 
CACTATTAGAAAAAATGAAAGACGCTCTGTTTAAAACAATGATGACAGCAGTACCT 
Sequence 1433 

GCGG TGGCGGCC GAGGTACTTCCCTTTTAAGAGATGAGTCACCGCAACTGAAACTTCTCT 
ATTTCTTTTCTTTTCTGATTGTTCTCCAGMTTAGGACTAGTAACAGTCCTGMNNCTTG 
TNrTTTCCTTATCTAGAAMCTCAGTATCTTCCCTTCCGTTTGTCTTAAAATATTAGTACA 
CGCTTTCTCAAGCCTAGCCGATTAGAAGGGGCTGCCGGGCTTCCACCACACCTCATCGAG 
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GNAATGGTTTTNTGGNNAAAAAGCCCATGGAAATACTGAGCCCATGCCNCTCACGTTGNA 

AAAGCCCCGTTCCTTGCC 

Sequence 1434 

AGCTCCCCGCGGTGGCGTMCTTATCTCATTTTAGATNAGTTTGCAAAGAGAGTTGGTGG 

CTMGGCCATAGCTTAGCCTCCTGACCCCTACCTTCCCACGTTCTTTCCAAGAGATTCTC 

CTCAGGAATAACACTTGCMGGGAGTTCCTGATGAAGTGGATTCTTGTTATTCTAGGAAT 

AGGCCTACATGGTGCACTGGCAATGTGAGATTATACCTCAGCATTTTCAAAGAGCATAAA 

MTC TAGAGC TGGGGGGTTTAAMCATGACAAACCTMTTTTAAGTAGGCAGACAAATAT 

TTAMTTTTTCCCCTACCCTTGTTTCTACATCGGTCCATTCAGACTCTGCACCATCTGGT 

TGGGCAGGTGCTACTGTGGMGATCTTCGTTTTCGACTACCATTTGGTGATTCTTGCTTT 

MAGTCTCMTATCAGTAAACTGAACAGATTNCCACCACCCCTTGTTTTATAAATATCAC 

CCTTMTTAGTTTAAGTTTCAATCTCCCCATCGGAGGCTAGTTCCTGGTGGGTCAGCATG 

TACCTGCCCNGGCNGGCCGTCTANAACTAAGTGGATCCCC 

Sequence 1435 

TACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTN l I I I I 
n n I I I I I I I I 1 1 I I IGGAGCCAAAATTGTGTGCATTCCTACTGGGAAACACAGTGGCC 
MATCCTTTTGMTTGTTTCCTTCTAGAGACTTTAACTCTTCTGACTGCAAATCTTAGTG 
TCCTGTGAGTATTAGTTGATTAATTATACTTGCTGCTTAGTGAAATACAGCCAGCTATAG 
GTATCTTCTGGAGTAGCTCAACACAACTTTTCTCTTGCTAGAGTGACTCTTGCTAACAGA 
ACCCAAAGATGCGCACATATACCCACAGGAGCTGGAGGTCCCTCGCATGCTCCTCTCGTG 
CCAGCCTTTGCCTTACCCTTCACTCTCTCCCTCCAGGAGCCGTCGGTACCTCGG 
Sequence 1436 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGATTTGTCTTGCTC 

ACCTCTTACCAAATCCAAGAATGGTTTCTATCCAGTGAATGGCAAATTCATCTGATAGGT 

MGGGAATAATGGGTCAAAATGGTAGCAAACACTTCTTTCAMTTTCTACTAAAAGACTT 

GCTGTTGTTTTTCTTATAAAGGGGCAATTTCATCMCATACATCTTTTAAAGGAATCTCT 

AGAAATTTGAGTGACTTTTTGGCCATAATCCTGTTTGATATATTTTTGGTCAGCTGCTCA 

MACAAACATTCTCCTTGTAAGGTTATCTATCTGAAAGATACTAATTCATTTAAAGCAGC 

TGCAGGTGAACAACCTAAAGATGACATGATTTGGGAGAAGAGGAAGGCAGATTACTGAAC 

TGACAAGTGACCCAAAGCATAATTAGGTTTGTGCACATGGTAGCATGGAGGTTCCACACC 

TACCTTCTACAGCGTATTAMATAAGAATATTGTCTTTGAAACATCTTCTAGCACCTTTT 

TAATAAAACAMATTTCCCATCTTCMTTCTATTTTTTTCCCAMTCTACCTTTAAAAAA 

TTGT 

Sequence 1437 

CCGCGGTGGCGGCCGAGGTACAATAAACAGGGAATGAGAACTATTTACATGGAAGTTTCT 

TTCTCATGATGCGGTGGAGAAGCCTCGGCCACTTGGTTCTGCCAGATGTTCCTGGGGTTA 

CTGTAAATGGGMGGACAGGCAGAGCTMACAAGGTAGGAGMTCGCCCCCCTTTTTTGA 

ATGTTTAAAGAGTTTGCTGCAGTATGCTGCATTCCATGTGTGCTGCTTACGGGAGCCAGG 

GAAACTGGGATTCCACTMTTCAATTGTMTACTTGCGGGGGACCCTGGAGTTTTACGTA 

ACATTTTGATTTGGGAAAAAAAAAAAAANAAAAANANTGTTCCTGCCCGGGCG 

Sequence 1438 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCAGGC 

ATGCTCCTGCCTTGGGCAGAGTGTATATGTGTGAGTGCACCTGCCCTTCCCACAGCCTAG 

MCGTTCCTCCTCCAGACAGCACATATGGCCTGCTCTCTCACTTCCTTAAGGTCTTTATT 

CAAAAGTGACTTTCTCAGTGAAGCCCTGTCTGCTCACCCTGCGTAAAATTTCAGCTCTTC 

TTTCTATCTCTCTTCCCAGA I Mil! I I CTCCTTCATGTTCGTTGGTGTCTAAGGTTTAT 

CATCTATTTCGCTAATGGTCAGTAGAATGTAACCTCCACGTMGCMGGAGTTTTGTCTG 

TTTTGTTCATGTCTATGTCCTTAGTGCCTGGAGCATTCCCTAGTATGCAGTAGGTGCTCA 

ATAAATGTCAGTTGGATTAATGGCTGAAAG AAAGG TCACCGCTATAAGGATGGAGTCAGA 

GAACAAACACAGTTMTTCCTGGTCCACTGTTTTTGCTTCCACTAMTTGTATTTGGTCT 

ACGGCTTCTCCGC7TGCCCTGGAACCCTGCTCAGAACACTGCTCCCTTCTCC7TCTTCTT 



# 
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' CTCCCTCCGG ATAAATTCT 
Sequence 1439 

CCGGGCAGGTACCGCGGGGGGCCGTGGTCAGAGCGAGCTTCGGAGAAGCAGTGGTGGGTT 

CCATGTGATGGTGGAGTAGGAGGCAGGTCTCCGCGGTTCATCTGTGTTGCTCTAAATGAC 

ACTGtTTCATTATTTTGATGGCTGGAGAATATTTCCTAAGTGTATGTATATGAAGAAGTT 

TCTTGATCTCTTTATCTGTGGATGAACAGCTACTTTGAAACATATGGTACCTCTGTGGTC 

AGACCATTTGCCAAGCTTGTGAGGCCTCCTGTTCAGGTATACGGTATTGAAGGTCGCTAT 

GCCACAGCTCTTTATTCTGCTGCATCAAAACAGAATAAGCTGGAGCAAGTAGAAAAGGAA 

GTTGTTGAGAAGTAGCACAAATCCTGAAGGAACCCAAAGTGGCTGCTTCTGTTTTGAATC 

CC 

Sequence 1440 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGGTACGCGGGTGCT 

TTCATAGCCGCGGACACAACTTGGGCCCACAGTTAACCGAGGAGGAAGGCAGAGCGTGCT 

GAGCAGAGCACCAAGGAAGAGAGCTCGGCTAGCCGGGAAGGTCCGAATGGAtTTATTTGG 

TGAGGCCAAGGAACCCACTGCCTCCACGGTGTTCCTCCAGGAGGCTCCTCCCTTCACTAA 

GGCAGCAAGGAGTGCACTGGTGAGGGGAAGAATTGTTGTCATCCTTTCTCTACGACTCCA 

AGAGAAACTTTATACTGGAGGAAGAATATTCTACCACTTTGGGATGCTTCCAAAGAAATG 

GGATACCAAAGGAGTGAACTTCAGGTCATTGGAAGTGGCCAAGCTGGAGGCGACCTATGG 

AGACATGACCTTCAGTTCTTCCTGAGATCGATAAAGCAAAAAAAAAAAAAAAAAAAAAGT 

ACCT 

Sequence 1441 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGATATTGTCAATGCGCAACATGGAGAG 

ACTTTAAAACAAATGCTAGGGATTAGAGTATAGATCAGATAGCTGGCAAATCTATAGGAA 

GGGA MAGTMTTTTAAACACACAGCATTTGTTTCCTGCTGCTCTATCACAATAGCTAGG 

TTTTTAAATAAGTAGGCTTTATACCMGCCATAAAAATGAATTGCTGGGGCTCTTTGGGA 

CTAGGGAAGGCGGGAAATTTTAGATATTGCTGTTGGCTTAGTGAAAATGCATGCTTACCC 

GGTCACCTGTGGCTCCAGCAGGACCAGGGGCACCTACAGCACCAGGAGCACCCTAGTACC 

T 

Sequence 1442 

ATA GGGCG AATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACCTTATTACATA 

TGATTTTTA7TAGTTTCTGGAGGCAAATTTAA I I I J \ ATTTTAAAATCA AAATCTATTTT 

AAAAGAAATAGTTCTCAAAAAGACAACGATGACTGGGTGTGGTGGTGTGTGCCTGTAGTT 

CAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTTGAGGCCAGTTCAGTCTAGCCTGGG 

TAACATAGCAGGACCCTGTCCCTTAAATAATAAAAAATTTAA 

Sequence 1443 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAGGGTGCTCCTGGTGCTGTAGGTGCCCCTGG 

TCCTGCTGGAGCCACAGGTGACCGGGTAAGCATGCATTTTCACTAAGCCAACAGCAATAT 

CTAMATTTCCCGCCTTCCCTAGTCCCAMGAGCCCCAGCAATTCATTTTTATGGCTTGG 

TATAAAGCCTACTTATTTAAAAACCTAGCTATTGTGATAGAGCAGCAGGAAACAAATGCT 

GTGTGTTTAAAATTACTTTTCCCTTCCTATAGATTTGCCAGCTATCTGATCTATACTCTA 

ATCCCTAGCATTTGTTTTAAAGTCTCTCCATGTTGCGCATTAACAATATCCTAATGCACT 

GAGGCTTCTCAAAGCCTTCMTTATTACCAAAAAAAAAAAAANNTTTNNNAGGTACCT 

Sequence 1444 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGACCAAT 

TCCTTCCTAACCTGGATTCCACTGTCCTTGGTGAAAACTACTTTGATGGAACCTACCAGA 

TGCTTTATCTTTTGGTTAAAGGAACTATACCTGTNGAAATTCACACTGCCACAGNGATAT 

TTGTTTCTTTCCAATTATNTGTTGCAACANMGATGACTTTTATACCTCTCACAATCTGG 

NTAAAAATCTTGCCTTGTTCCTAAAGATACCAAGTGACAAAATCCGTATCAGCAAAATAA 

GAGGGAAGAGTCTGAGGAGGAAGAGATCCATGGGATTCATAATTGAAATAGAGATTGGAG 

ACCCTCCTATTCAGTTCATAAGCAATGGCACCACAGGTCAGATGCAGTTATCTGAACTCC 

AGGAANTTGCTGGTTCTCTTGGACAAGCTGTNATTTTAGGAAA 
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Sequence 1445 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAGAGTCACTCCTGCC 

TTCACCATGAAGTCCAGCGGCCTCTTCCCCTTCCTGGTGCTGCTTGCCCTGGGAACTCTG 

GCACCTTGGGCTGTGGAAGGCTCTGGAAAGTGTAAGTTGGAGTCACTCTGGTCTAATCTG 

GGCTGCAGGGTCAGAGGTGGGGTCTCCTTGTGGTGTGGGTGTGTCCCCTTCTGTAGGCTC 

TGATCCCTCAGCTTAGTTTCGGGAGACCTCCCTGAGGGTGGAATACATGTCTGGCTGGGC 

TCCMGGTTTGTGTGGCAGTTTGAGCTTCTGGAAATGCTTCCTCTATGCAGCCATGCTGT 

CAGCCCAGGTCCCACTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATACTNTGCCT 

TCTTTCTTTCACCTGTCTGCGACTCTTCAAAAAAAAAAAAAAAAAAA 

Sequence 1446 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTAATCCAAGACAA 

A AGM TAGCTCCTTTCAGAGTTCTCATCATTTCTTCACCCCAGCCTTAATGGGTATATTC 

CTnTCCCCAGGTGTCCAGAGTTTTCCAGAATTGTCCCAAGGCTCAGTCCACACCAGTTC 

TTCTCCAGTGTGCTCTCCTGAGAGGCCAGGCACACTCAACAATTATCTAGATGAGTTCCC 

ACTTCTTTCCAAGTGMGGTTCTGCTATCCTAGCATAGTCAGAATAAGTTAAATTATGTC 

TCTTMGAGGCACTGTTCCACTCCTTTTCAAGGNGTGTGGCATATTTGAAATATGTGACT 

TAAAAGT CTAC AGTCTCTTACCAAAAGCCTTGGGCCAAATACATGTCAAAATTCAGAATT 

TTNCAGATTTTAGAAAAGTGACCCATATACCATACATTGCA 

Sequence 1447 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACA 

TCCAGGACAAGGTCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCT 

GCCTGGTCACCAACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCT 

CCMTTTGGACCCCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCAT 

TCCATTGGCTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCAT 

CAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCA 

CCAACCTACCATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACA 

AAAGGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

TTTCTGAC 

Sequence 1448 

TAGGGCGMTTGGA GCTC CCCGCGGTGGCGGCCGAGGTACAAATTGTCGTTTTTATTCCT 

CTTATTGGGATATCATTTTAAAAACTTTATTGGGTTTTTATTGTTG^ 

ACCCTACAAAGAGCCTTCCTATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTC 

CAGC AAGCM A GTGC TTTGACTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGAACAAAAC 

TGTTCTTTTGCATTTTGCCGCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATT 

TATAGCAAGATATTGATCAAATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAA 

GTATAATCTTTC TATAG CCMCTGCTAATGCAMTTAMACATATTTCATTTTAACATGA 

7TTCAAAATCAGTTTTTCATACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGC 

AGAACCACA MCA ATTCGAATGGGGTAGAMCATTGTAAATATTTACTCTTTGCAAACCC 

TGGGNGGTATTTTATTTTGGCTTCATTTCAATCATTGAAGTATATTCTTATTGGAAATGT 

ACCTGCCCGGCCG 

Sequence 1449 

CCGCGGTGGCGGCCGCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTC 

ACCACACTCTACAAAGGCAQTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAAC 

TTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCC 

AGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGC 

TCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCA 

ACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATAT 

TCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAG 

GATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATT 

Sequence 1450 

CCGCGGTGGCGGCCGAGGTACAAATTGNCGTTTTTATTCCTCTTATTGGGATATCATTTT 
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AAAAACTTTATTGGGTTTTTATTGTTGNTGNNTGATCCCTAACCCTACAAAGAGCCTTCC 

TATTCCCCTCGCTGTTGGAGCAMCCATTATACCTTACTTCCAGCAAGCAAAGTGCTTTG 

ACTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGMCAAAACTGTTCTTTTGCATTTTGCC 

GCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGATCA 

AATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCC 

AACTGCTAATGCAAATTAAMCATATTTCATTTTMCATGATTTCAAMTCAG 

TACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGA 

ATGGGGTAGAAACATTGTAAATATTTACTCTTTGCAMCCCTGGNGGTATTTTATTTTGG 

CTTCATTTCAATCATTGNAGTATATTCTTAT 

Sequence 1451 

CCCCGCGGTGGCGGCCGNGGNACAAATTGTCGNTNNTATTCCTCTTATTGGGATATCATN 

TTAAAAACTTTATTGGGTTNTTATTGTTGNTGTGGGNTCCCTAACCCTACAAAGAGCCTT 

CCTATTCCCCTCGCTGNTGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTT 

TGACTNCTTGCTTCAGTCATCAGCCAGCMGAGGGMCAAMCTGGTCTTTNNCATTTTG 

CCNCTGNGATATGNCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGAN 

CAAATATNGAAAGTTGNTATTCAACCTCACANGGGCTCTCAAAGTATAATCTTTCTATAG 

CCAACTGCTMTGCAMTTAAMCATATTTCATTNTMCATGATTTCAAAATCAGATTTT 

CATACTACCCTTTGCTGGAAGAAACTAAAAATAT 

Sequence 1452 

CCCCGCGGTGGCGGCCGAGGTACAAATTGTCG I I I I I ATTCCTCTTATTGGG ATATCATT 

TTAAAMCTTTATTGGGTTTTTATTGTTGNTGTNGGGNCCNTAACCCTACAAAGAGCCTT 

CCTATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTT 

TGACTTCTTGCTTCANTCATCAGCCAGCMGAGGGAACAAAACTGTTCTTTTGCATTTTG 

CCGCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGAT 

CAAATATAGAAAGTTGATATTCAACCTCACANGGGCTCTCAAAGTATAATCTTTCTATAG 

CCAACTGCTAATGCAAATTAAMCATATTTCATTTTMCATGATTTCAAAATCAGTTTTT 

CATACTACCCTTTGCTGGAAGAAA 

Sequence 1453 

GAANCCCCCTTTNGACGAANANNCGCGAATCGNGAGCTCCACCGNGGNGGCGGGCCCGAG 

GGGGACNANGANTTTTCTTGNNCN I I I I I I I I NNAAAAACNGGNGACTATTTAATCCATC 

TAAAAATACAAATCAGGNAANGGGGGGAACCATAGGAAAATCCTCCACCTNTAACAAGAG 

CCGAAGNTACNGGGCTTTCTGCTTGCTCCAAANAAATCCCAAAGGGCTTGGATAGTTTGN 

GGAANGGGGAATTATCTGTGTCTTCAAACTAACTCCCAAGGATACCTCAAAGGACATTAA 

AGGTNTACCACCACCATTCTGGGGGAAGAAAAAGGGGGGTTTCTTGCCCCTTGCTTGAAA 

AGCCT TANAAAT NGGGGGAAGCCTCCAAATNGCCNTTNGGGGGNGNAAAAAGGGGNNCCC 

TNTNAATTTTTTNAAAAAAAMAAAATTGGGTTCCAAAAAANAACCCCCCTTTNGAAAGG 

GANCAAANGGGGGGGGGGGCCCCCTNCCCCNAAGGCNAAGGGGGGGGGCCCNNTTNGGGG 

GGGCCNCNTGGGGGGNNGGNNCCCCAAAAACCCCCCTAAACAAAANNTNGGGGGCTGGGC 

CCNTAANGAGGAGGGCCCCNCCAGAGCCCAAAANTTNGGGGTTTNNACCCCANNNAAAAT 

TTTGGGGGGGTN I I I I I I I I I ACCCCCNTTTNNGNNCCCCNCCCCCCTTGGGGGGGAAAA 

AAAACCCTTNAAAACCCCNTTNTTTNAAAACCCCCCCGGGGGGGGG 

Sequence 1454 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTTTN I I I M I I I 

I I I I I I I I I I I ATACCAGATTTTAAGGAAAGACTGCTTGCTTCTGANAAAGAATTCTCGG 

AGTTGATTCTCTGCTCCATTTGCTCCTTTCTCAACCTCTTTAGTCCTCGTTCTCTGCAGT 

AAGAAGCTAGTGACAACTGGAATTTAGCTCCCAGNGGGCTTTCTTCCGGGTGGNGCCTGG 

ACAGGCTGCTCCTGCTGCTAAGGCTTCTGGAGCTGTTATTGAAGATGTCAGCTTCTGCCA 

CTGNGGTCTGCTTTTrGGAGTCTGCATTGGTTTTGCCTCGATCTCTATCATTCTTCTCAT 

TATTTTCATGAATGAA 

Sequence 1455 

CGAGGTACTGACCTCGTNTGTCCCTTCCCCTNCACCGNTCCCCACAGCTTTGCACCCCTT 
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TCCTCCCCATACANACACNNNCCATTTTATTNTTTGGGCCATTACCCCATACCCCTTATT 

GCTGCCAAAACCACATTGGGGGCTTGGGGGGGGGCCAAGGGGCCTTGGGCATGGGACCAA 

GGAC CACCTCCCCCTACCCATATCCCTCCCCGTGTTGTGGGGTTTNGGGGAAAAAAACCT 

TTTTGGI I I I I I I I GGGGGGGGTC I I 1 1 I I 1 1 I CCTCGGAAATTAAANAAAAAAAGGATT 

NCTTACCTACAAGAGAAANAAAGAAAAAAAGGGGTACCCTTGGCCCCGGGGGCCGGGGCC 

CGGCTTTCTTAAGGAACCTAAGGTGGGGAATTCCCCCCCCGGGGGCCTTGCCAAGGGAAA 

TTTCCGNATTATTCNAAAGGCCTTTAATCCGGAATNACCCCGGTNCGNACCCTTCGGAGG 

GGGGGGGGGGCCCCCGGGGTAACCCCCAAGC 1 1111 I GGTTTCCCC 

Sequence 1456 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGATAAGACCC 

TGAATGCCTCATTCCATTGGGTGGGCTCCACCTACCAGTTGGTGGACATCCATGTGACAG 

AAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGA 

ATTTCACCATCACCAACCTACCATATTCCCGGGACAAAGCCCAGCCAGGCACCACCAATT 

ACCAGAGGAACAAAAGGAATATTGAGGATGCGCTCAACCAACTCTTCCGAAACAGCAGCA 

TCMGAGTTATTTTTCTGACTGTCMGTTTCAACATTCAGGTCTGTCCCCAACAGGCACC 

ACACCGGGGTGGACTCCCTGTGTAACTTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTG 

CCATCTATGAGGAATTTCTGCGGATGACCCGGAATGGGTACCTGCCCGGGCCGGCCGCTT 

CGGCTTTAGAACTAGTN 

Sequence 1457 

GGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGTCCAAATTTTAAGGAAAATGA 

GTCCCGCAATGAGTTTCCTCATGCTTCGCCTGTGCGTGGACCGGNCAGCTTCTGGGTGTG 

ACTGGAGCAGGGCTTGTCTCCTTCTTCAGAATCACTTTGCAGGGGTTGGCAAAGCCGCTC 

CCATCCACGTACTCTCTGGACACAATAATTTTGGCCTATTGCCATCAMTGCCCATTTTC 

CACTGCTGGAAGCAATGTCAAAAAAGGGCTGGCCCAAAAAAAGACCCAGAGCTGTCAATA 

CMCACTGGAGACAGATGCAACTGMTMACCCTGTTTTACCCAATTGCACTATTTGGTC 

CT 

Sequence 1458 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGCCCGGCAGGTACTGGAACAGGGATAA 

GTTCTTGGATAAGGNGCCMCATACCTATAAAAGCTGATTTTTGAGTAAATTATTGATTC 

TAACATATGTAATGGATTTGGTGTGATMTTTTCTGATCTTTAACTATMGTGACTTTTT 

ATTCTCCACCAGAAAAGATAAATGACTGAGAATGTAAGTCTGCGCTCTGATTAACACAAT 

GGAGAAACGGAAAAACTATCTCTGTTAAAAACTGATTCCTGTCATTCTTCTGATATCAAA 

TAAGAGGAAGGMAATMACTTTTTGTGTGTAGATAGAAAAACATACCTGAGGCCAGGTG 

CAGTGGATCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCAGCTGAG 

GTCAGGAGTTCGAGACCAGCCTGGCCAACATGGNGAAATCACGTCTCTACTAAAAATACA 

A 

Sequence 1459 

NGGGCGAATTGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCACTC 

AGGGAGCTCAGATTTTGAGACAGTAGCTGGCCGATGCTCCCAGCTGAATAAAGCCCTTCC 

TTCTACAAAAAAAGAAAAAGAAAAAAGAAAACAGGATATCTGAAATTAAGACTGCAGGAT 

GGAGTAGTTTTCTGAAAATGACAGGGTCCAAGGTGTGACNCACCGGGACCAAGTGGCTGA 

ACTGGAATGAAGTTAAGAAGCCAGTAAGAAAACATCNCGGATAATATGGTGGATCAGTTC 

AACAGNAATGACATTATTTCACCCATGGTCCCCAAAGGGAGGGAGATGACTGGAGNATTT 

CAMTTCTTCMGGCAAGCCCTCMTGCCAGCCCAGAGGATTTTAANGAGGGGCCTATTG 

TTGTTCCCAGAAAGGAGGACTCTGNGGCCAAAACCGCCAAGAATGGGATTTCAAGAATTT 

ACTTCAAAATTCCTGTCAGGATTTCTTTAACCCTGGGTGGGGCTTATACCCAAAACCCCA 

AAAAAACTTTTMGCCAGCCTTCNTACTTTTTGGCTTAATTTTTCTCCCTMAGCCC 

CCCTTGGGCN l M I NI 

Sequence 1460 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGGCAGGTACTCAGAAATCCTTCCCCGAATT 
TACAGCACAGGCAGGATGACCTAAGAGGCAGTTTACTTCCCTGAGACCCACAGTTGGGCT 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

240/467 



GTTCTGGAAACACATCTGTGAATCATAGCCAATTGCCACAGAGAAAACAGAACCAAGCCT 

CCGGTGAGGCCACTCCACCCCAGAGAAGTCTGCAGAATTCCAAGGACTCGGATTGGATGT 

TCAGAATTCAGCAACTGGAAAGTCCTTAAAAACAAACAGGCCAAACCAAATCAATATTGC 

TGTTTCTAGATGTCCCTTCTGTGGTTGAGCTAGTTTTACAGAGATAAATATATTAAGACA 

AGGAGGTGGGGGTGTT ATATGATCAATGATAGCCATTTTGAAAGAGAGGGAGGAGTACTT 

I I I 1 1 1 1 I I I I I I I I I 1 I ICCAAGCACGTGCCACTTTATTGAATGACACTGTAGACAGGT 

GTGTGGGTATAAACTGCTGTATCTAGGGGCAGGACCAAGGGGGCAGGG 

Sequence 1461 

AGGTACGCGGGGCTCAAGAATAAGCTGAAATATGGCCAGACTATCAGGCCCATTTGTCTC 

CCCTGCACCGAGGGAACAACTCGAGCCTTTGAGGGCTTCCTCCAANCCTACCCACTTTGG 

CCCAGTCANACCAAAAAAGGGGAAAGNAGGCCTGGCTTCCCCTTGNCAACAAGGGNATTA 

ATTCCMAAAGNCCTTCCTGGTTTTTTGGNTGGTNCCTTGAAGGGNAAGGGNAGGAAAAA 

AAAAGNCCTTGNAACCTTCGGGGAAGGGGNAGGGTTCCTTAACAATTCAAAAGGAAATTG 

GGGGGGGGAATAAAAGGAAAAANGGGGCCAAGNCCTTGGTTTGGAAGGAAGGNAGGTATG 

GCTTCAAATTAATGGCCCCCCAANGGCTTAATTGAACCAAAAAAGGGTNCAAAAGGGGAA 

CAATTCTTCAANAAGGGGTGGGGTTCCACCCCCCCNTTCGGGGTTTTCCCCTTTTTGGTT 

ACCCCTTGGCCCCCGGNNNCCGGGCCCNGCCTTCCTAAGAAANCNTAGGGTTGGGGNANT 

NCCCCCCCCNGGG 

Sequence 1462 

GCGGGCAGGTACATGGATGGGAGCAGCTTCACCAACCCCTGCAAAGTGACTCTGAAGAAG 

ACGACAAGCCCTGCTCCAGTCACACCCGGNAAGNCTGACTGNNTNCCACCGCNACAGNCT 

GAA AGGCATT GAGGGNAAMCCTCAATTCNTCGGGGNACCTAAATTTTTTCCCCTTTAAA 

MNATTTTTTAAGTMCCTTTGGCCAACCMGGTAAAGGGGGAACTTTTCCAAAACNTNG 

AACCCCTTTCCCTCCAAGNATTTGNAAGGMACNTGTTNTTTCCCAGGTTAATTAATTAC 

CNATTTCCAAAAGGTTCCMCCTTTNGGAANGGGGTTAAGGGGGAAACCAANAAAAAATT 

TTGGCTTAACCAAGGTTCCCTNAATTAAAAA I I I I I AAAANGGGGGG I I I 1 1 1 I I I I I I A 

TATAAAAAAGGGNGNTGGTNTNANNCCCCTTTTNGGNGGTCCCCCC I I I I I 1 1 I I I I NAA 

GAAAAANCCNTAAGGGGNGGGGGAAATTCCCCCCCCCCCCGGGGGNNCCNTTGGGCCAAA 

NGGGAAAAATTTTTCCNGNATTTAATTTCCAAA 

Sequence 1463 

AGGTACGCGGGGAGGCATTGAGGCAGCCAGCGCAGGGGCTTCTGCTGAGGGGGCAGGCGG 

AGCTTGAGGAMCCCGCAGATAAGTrTTTTTCTCTTTTGNAMGANTCCCNCNTTTAAAA 

TAACCAAAACCTTACCTTTAAAAAAAAAATTMTTANGTTCCAAAATTAANGGGNTTTTA 

ACCTTNAAAAGGNMTTAATTTTTNGGCCTTTTTAMNNCCGGGTTT^ 

I I i I I ! I TT AAAAACCCGGTTTAAAAA I I I I I I I I I I AAAAATTAAAGGCCCT 1 1 I I AAA 

AGNNAAI I I I I I I I I I I AAAGGNGANGGAAAAAAAAA I I I I I I I I I I GGNANAAGTGAAA 

ANCTTTTTTTmTAMAAM^ 

NAAATGGGNGGGGTNNNANANAAGGAATTTAAAAMNTGGGNTNTTTTTNTTTAAAAAAA 
M ACCAA ATTTTATCCCTTGNTNAGGGGTTTTTTGGGNAAGNNAATATGGAAAAAANAAN 
CC I 1 1 1 1 NCTT 
Sequence 1464 

TTTGAGAAGCCAGCGCTCACCCACCCGGGGTCTCTGTGCATTGACCTTTGGGTGCTGACT 

TGGAGAAAAGCACAAACACGACCAGTCCCATCCTGGGCTCCCGNNGGCCGNCCTTTCTTT 

CCTMTNCTTACCGCCATTTTGGTATTCCGGACCTGGCCMTTTTAGGTTTNGGGACCTT 

MAAGNAATTGGMTTGGMCCTTCCMGTTTTTTAAAAAAGGGGGAAGGGGAAGNAACC . 

NMANATTNNCCTTNGAACCCTTGGTTCCTTAAAAGNCCAAANGGAAATTGGGGCCCCCC 

AANAGNCCCTTGNNNGCCAAAAAGGNAAAAAAAAAAAAGNNCCNAACCCAACCTTTGGNN 

CN AATT TAACCCNAATTTAAAGGGGGGAAATTACCAAGGAAAAGGGGGGNCCAAGGAAGG 

CCTTTT CTTGGGCC CCTTGNGCCGGGGGGAATTTCCTTNGGCCAAAAACCCMTTTT^ 

ACCCAN I I I I I I | IGGG 1 1 I I I I I I I GGGCCCCCTTTGGCCCAANAAAACCCTTAACCCC 

CCCCGGGCGGGTTNACCCCCTTTCGGGGGGGCCCCNGGCCTTTCCTTAAGGNAAAACCTT 
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AAAGGNNTTGGGGGGNAATTTCCCCCCCCCCCCGG 
Sequence 1465 

AGGTACTTTATTTTTC I I I I I M I II M I I I GTGGATGGGGACTTGTGAATTTTTCTAAA 

GG TGCTAT TTAACATGGGAGGAGAGCCGTGTGCCGGNATCCAGCCCAAGCCCNCCCGNTT 

CACTTTTTCNCACACNCTTTN I I I I I CMCCCTTGGCACCTTCTGGGGCNTTTACTTTAA 

MGNGGNCCCCTTTNTTTGGNCATTACATTAGCAGCGGMMCANCCTTTTANTTNCTTT 

CCCCNTTNATTGGNGMNANMCCTCCCATTTTNTTTTNTTCAAANAAGGAGCATTNGNC 

AAANANCTNCCCAAATNTNNCCTTNNACCACNGNGGGNCNNTANNCCACNNTNCTANCTT 

NAGG NGNNGCCATTTGGGTTNCNCCCTTTGNNCCNGGAANTTCTNCCCTTTTCCNTTNGN 

TTTTNTCCCCNCNGGGNNGTTTTTTTC 

CNCNAANAAAAANGTTNAANCCNAAAAAAAAGNGCCCAAGAG I 1 I II I I I 1 1 I I I I CCCC 

CCCCCC 

Sequence 1466 

AGGTACACTGAAACATAAATCCGCAAGTCACCACACATACAACACCCGGCAGGAAAAAAC 

AAAAACAGCAAGTTTACATGATCCCTGTAACAGGCCATGGNNCTNCAANNCTTCAGGAAT 

GCCTTNCCNTCNCATTCTGGCCCAAAGNTGGTTGTTTNCNTGGGMTNACCAGGNAGGCC 

ACCMTTCGGTGGGGCCTNrrCCTGGGGGGNGGTTCMCCMCCTNCAAGNCTTTTAAGGG 

CTTGGTTGGGGGTTCCCCACCANGAGGCCACCTTNCAATTCTTGGGCTTGGGGGACCTAA 

TTGNGGNTGGGGTTGGGNTTGGGACC7TCCTTAACCTTCCAAAAGTAAGNCAANAAGCNT 

GTTTM CCCMGCCAACAATTTNCAAAAAACCMGGTTGGTAATTTGGNAAACCANTCCT 

TTTTTTNAAAAATNATTCCAAAAAAGTTNGAAGGAAAAANCCANAGGAAANGGGNCAAAC 

CATTAAANTTAAAATNGGTTTAATCCAAGNAAAAAAG 

Sequence 1467 

CCGGGCAGGTAC I I I I I I I I I II I I I I I I I ATAACTGAAGCTTTATCTGGAGTGGGGGAA 

TGGGGGTGTGGTCAGTTGGGGCACCCAAAGACAAGCCATGCCNCCNNCCCNGGAAANGCC 

GCNCAGAGGGTTCCCTTGGGCAATISfTGGTATNTACTGGGGTATCCTTCTTNTGCGGTTNC 

TTTCGGGNCMTTTTTCCGATNNCCACCTTTNNCCTTACAAAGGGNCCCCAANNNTTGNC 

CTTTCTCCANCNCCCCAAAAGGTTNNGTGCCCCTTTTNNNCCCCGNAANNNGGAATTGTT 

NAAGNCCTTNGNMGNGGTTATTGGGGAGGACCATTTTCCTTTCCTTTNCCCCCCCCCCA 

AGGAAAMAAAMTTTTCCNGTGTTNAAGGNGGTAATGNNAAGTAACCTCCCCCCAACCA 

AATTGGAAAGdGGNTTAMGAANTGGNTTTTTTNACCTTTT^ 

CAMNNNCCTTTTTTCCAMGGGGGGGGCCCTTTTTCCTTTCCT^ 

TTNGGAAATTNGGGGGGGGGGGCCCTTGGGMTTTGGAAAAAANCCGGNTTNGGCCCGGN 

CCTTTGGGTCTTTGGTTTTCCCCAAAAAACCCCCTTNGGGCCAAACCCCCCT 

Sequence 1468 

AGGTACGCGGGTGGTGAAAAAAGAAGTAGAAATCGTGGCCCACCTCCCTCTTGGGGTCGT 

CGCCCTCGAGATGATTATCCGTAGGAGGGAGTTCCTCCACCTCCNNCGCCACAATCTTCC 

C MAGAM GGGGAGMAGTCTTTTCTTCCTTCGGCAGNCNCGNGTAGTCNAGNGGTTCCC 

CCTTTTTTCCTTAGGNAGGCATTTAAGGGGNAAGTAAAGGAAGGNAGGAAGNAATTCCGG 

CCTTGGTTCNTTCGGGGGNAGTAAGNAAAAATTCCACCANAGTCCCGGTCCCCGGAATTC 

CCTTTTCNTTCTTMGGGGGTTCCTTCCGGTTAAGGTTCCGNAATTCCTTAAGGGGTTNC 

AAMTGGAAMAGNGAAAAATTMGGAAAGNMCCMGGTTTTTTGGCCAAAGGAAGAAA 

AGGTTGGGGTGGTTAACCCCTTGGCCCCCCGGGGGGCNGGGCCCCGGCTTTCCTTAAGAA 

AACCTTAGGGTNGGGAAATCCCCCCCCCCGGGGGGCCTTGGC 

Sequence 1469 

CCGGGCAGGTAC I I M I II I I II I II I I I I ATAACTGAAGCTTTATCTGGAGTGGGGGAA 

TGGGGGTGTGGTCAGTTGGGGCACCCAAAGNACAACTCATGCCTCCTNCNNNGAAAGGNC 

GGCCCMGGGTCCCTGGGCCAATTTGGTTTTCNTGGGATTTCTTCTTTTCGGNTACATCG 

GGNCAATTTTCCGNTACACCTNCNCCTNCNAAGGGNCCCNAGGTNTGGCTTTCNCNNCGC 

GCMNAGNTNGTNCCCTTTTCACCCGGAATGGMTGGTTAAGACCTTGNAAGNGGATTTT 

GGGNGGACCTTTTNCATTANCTTACCCCCCCCAANGTAAAAAAATTTCCGGTNNTAAGGG . 
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GNAAGMAAGGMCCCNCCCAACAATTGMGGNGGTTMGGTTGGGTATTTTCCTTTTCC 
TTTCCCCCTTTCCCCAMGGCCNTTTTTCNMGGGGGGGGCCTTTTTNCTTTCAT^ 
MCCTTTTTGGNAATTGGGGGGGGGGNCCTTGGGMTTGGGAAAAAACCGGTTTNCCCGG 
CCTTTNTCTTTG NGTTCNCCAAAAACCCCCTTGGGCCAAACC 
Sequence 1470 

AGGTACGCGGGTGGTGAAAAAAGAAGTAGAAATCGTGGCCCACCTCCCTCTTGGGGTCGT 

CGCCCTCGAGATGATTATCCGTAGGAGGGAGTNCCTCCACCTCCNCCNCCNCNATTCTTC 

CCAA AGAAAG NGAAGAAAGNCCTTTCCTTCTTCGGCAAGNCCCNGNAGGCCAGGGGNTGC 

NCCCTTTTTTChlTAAGGAAGTATTNAGGGGGAGTAAANGTAAGTAAGGAAGGAATTCCGG 

CCTTGGTTCNTTCCGNGGGTAGGAAGGAAAAAATTCCAACCAAAGGCCCCGNTCCCCGGA 

ATNCCCTTTTCCTTCTTNAGGGGTTNCNTNCGGGTTAGNTTCCGGAATTCCTTAAGGGGT 

TCAAAATGGAAAAAGGGAAAATTAGGGAMGTMCCAGGTTTTTTGGCAAAAGGAAGGAA 

AGGTTGGGGGTTGGNTAACCCCTGGCCCCCCGNGGGCGGGGCCCGGCTTCCTAAGAAACC 

TTAGGGTGGGGAATCCCCCCCCGNGGGGCCTTGCCAAGGGGAMNTTTTCCGNAATTAAT 

TCCAAAAGG 

Sequence 1471 

AGGTACAAACGAGTCCTGGCCTTGTCTGTGGAGACGGATTACACCTTCCCACTTGCTGAA 

AAGGTCAAGGCCTTCTTGGCTGATCCATCTGCCTTTGTGGGCTGCNTGCCCCCTGGTGGG 

CTNGCCTGNCCCACCCAACCAGGCNTGGCTTCCCTGCCTTGGCCTTGGCCTTGGCAAGGC 

TCCCCCAGGCCTTAAAGGGGTTTTGGNAAAGGNCNCCAAAGGGGAAAAGGAAGGTTCCGG 

GAAGG GGMNTTCCNGGGMCCGAAGGGNATTMTGGGGGGMTTTTTTGGGGTCCTTNC 

TTTTTTGGACCNTAAAAATTCNAACCCCAAAAAAAAAAAAGGNCCNAAANCCCCCCAAAA 

ANCCTTTTTTAAGNCCCCAAGG I I I I I M I AATTTTTTGGCCAAAAAAAAACCAAAAGGG 

GGGAAAAAAANTTAAAAAAAAGGGGGGCCCTTTTAAACCCTTTTTTCCAANTTANANTAT 

AANAATAAAATANNANAANANNNNNTNNGNGGGTTTAACCCCCTTTGGGCCCCCCCGGGG 

GGGNCCCGGGGGCCCCCCGGGNTTCCTTAAAGGAAAAACCCTTAANGGGTNGGGGGGAAA 

TTCCCCCCCCCCCCCCGGGGGGGGCCCTTGGGCCCAAAGGGGGAAAAAA 

Sequence 1472 

AGGTACAGAGTCTTTTGCTTCCTCCCACCCCTAGGGGGAAAMCTGCTTTGTGCTTTGGG 

AAGTTGTCTCTGAAACCCGGGGACAGAGGACCGCAGGACAGANCTACCGNCGGGGAGNCN 

CGNGGNAGGGAATGGGGGCATGCAAGTCATGNTGAGAAGNGGAGNGTGATCTTACAAGCA 

NNGGAAGNACGTATGNGGTCCCGNGTAAGNAAGTCCANGTAAGNGNCCCCTTGGAAGNAA 

AGTCCCAAGTANCGACCAGTNTTGNGNAGTAAGTAGGGTGNTGGGNAAATAGGTTGNANC 

CCATNTCCGGGGGTCCTTGGGGGGNCCTTGGGTAAGNCCNCGNCCAACCAACTGCTTCTT 

CCTNCCCCMTGGTTTAAAAMTTAGGCCACCCCTTTTTTAAGGAAAAAAAAAAI I I I IC 

CAACCAAAAGGGTTCNCNCCCAATTCCCAACCATANAAAAAAAAANATAGATANACACAN 

AAAGGGGAAAAGGTTACCCCTT 

Sequence 1473 

AGGTACTGGTGTGTCCGGAATCCTACCCACTGTGATGACAGTGCCTGATAGTTTCTTCTG 

CCTTTCTATCCCAAAACGATTGGTCAGTTTACCCMGGTTTGGCAAATGGCCAGCCTTNA 

GNAAATCTTCCCCAGGGGMNCAATNCTTCTTTTCTTAGGTTAAGNTTTGGCCCCTTTAA 

GNCCCAATTCCTmNGGTTMGGTTTTGGNAATNTTTGGAACCTNTNT^ 

NTNTIVTITGTGGCCTTTTCCCCAGGAAAMGGCCTTCTTAATGNCCTTT^ 

AACCTTNGGCCANTAACCCCAAAA I I I I I I I I I I I I NGGGTTTTNCAATTTCCTTGGGTT 

TTTGGGGGTTNCMTGGMTGGGGTTTAAGGCCAAGAAAGGCCCTTGGANCCCCTTCCCC 

NTGGGTTTTAACCAAAATTAAGGAAAAATTGGAAATTCCGGGG I I I I I CCCCTTGGGGGG 

CCTTAACCAAGGAACCTTTTGGAAGGGTTCCTNGGTN I M U I I I I I I I I GGG l I I I I I I 

TTTTAAAAAAANCCCCTNNTNCNCCCCCAAATNGGGTTGGGGGGCCNAAAAA 

Sequence 1474 

CGCGGT GGCGG CCGCCCGGGCAGGTACTTTTTC I I I I I I I I < I I I I I I GNGGATGGGGAC 
TTGTGAATTTTTCTAAAGGNGCTATTTAACATGGGAGGAGAGCCGTGTTGCCGGCTCCAG 



# 
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CCCAGCCCCGCNTGCTTCACNTTTCCACCNCTTTTNriTTCTCACCTTGCCTTCTTGGGCT 

TTTCTNCAAGGGCCCCTNNTGGCCTTTCTTCCCCGNACCCCTTTGGTTCTTCNCTTCTGG 

AAANAACCCNCTTCNCTNCCAACCAAACCTNNGCAGNCTCCCATTNNCTTTCCCCGGGCT 

TCCCCCNTCNCCTTAAGGTTCTTGGTTCCCCTTGGCCGGTTCNCCTTCTTGGTTCCCCCG 

GGGTTTTTCAAGNAGNMCAAANCNTTNCCCCAAAAANGCCAACAAAAAAGGCCAAGGTT 

I I I 1 I I I ICCCCCCCCTTAAAAGNGGGNNTGGGGGGGAAGGGGGAAAAGNCCAAAAAAAA 

GGMCCTTCTTGGTACCCCTTTTTNGGGGCCCCGGGCTTCTTAAAGAAAACCTAAGGGTG 

GGGGAAATNCCCCCCCCCCG 

Sequence 1475 

AGGTACTTCMTCCTGMTTAATCTTTAMCACTTTCAAATATGGAGATTAATCACCAAC 

TTCTTAI I I I I I GGGCCAGTTGGATTCAA 1 1 1 1 I I ATTTAACCATGGANCTCTTTCNTAT 

TATTCGGTTACCTNGTNCGGAAATGGCCTAGGAAAGNAAAGGGCCTTCTTTTTCNCGGAG 

GGGTTNCGGGGAACCAAGNCNAGGAACCTTTTTGGTNAA I I I I I I AATTGGTTTCCCCTT 

TTTTTTGGNTGGGGMTTTCCTTTAMTNGAACCCCCAGGTTCCGGGAACAAGGGAATTG 

NAGGCCCMAGGGTTCAAAAAAGGGGGAAAACCNTTTTTNTNAAACCCCCCAAGGGTATT 

GGCAAGTTTGGGACCACCATTAATTAA I I I I I I I I I I I CCTTTGGGGAAAAGGGAATGGT 

TAATTTTTTCCCCCCAAAATTCCAAAANCCCCTGGGGTGGGGAAATTAAAGGGAAAATTT 

TCNTGGAAAAAACCAACCCAATTCTTAA I I I I I I I GGAAAAAGGCCTTAA I I I I M I I ll 

TGGGAATAACCAACCC 

Sequence 1476 

AGGTACAAAATTTTATTAAAGGTCTTTAGAGAGCAACATCCAGACTCCAGAATACAGCTG 

NGNAGGAGACCCTGTTATGCTGTGGGGACTGGGCTGGGGCATTGGAAAAGCNCCTCCNTC 

TGGGCCTTCCCAACNCCCTTTANTTGTTTCTTGNMGAAATGGGNGGGGNTTTGNTTGGG 

NGNCNATGATNATTANCTTTCAATTCTTTTTTAGGGGGGATTTTNACCAACCAAAAATTG 

NCCTTTCNAMCCGGTTGNGGNNTTCCATGTGNCCAAGGGNGGNGGCCTTTTCTTTTTTA 

NGGGGGNCCCCCAMNTTCNTTTTNMCCCCCAANTANTTTTGNTTTTTCCGGNN 

CCCGGTGTTTNTACCCGGAAAATTCTCCGGNGGGGGNANCAAANGGNGNNTTTNCCTTGG 

TTG N GT ATTNCCCC AA ACT AATTTAAAA AAAA AAAC 

Sequence 1477 

CGGCCGCCCGpGCAGGTAC I I I I I I I I I M I I I I I I I I I I I I GATGAAATGCCAGGGTGA 

AAGGGATAGCCAAATAGGCTAAAGCACAAGTGCCACTCTAGGTTAATTTCGGCAGGACNN 

GCCCCAAATAAAAGGGTCCCACCNGACCANNTTACCCAATNCAACAACCATTTCCCGGNT 

TTCGGGG 

Sequence 1478 

AGGTACCTTAACTTGAGTTACAGGGCTGGTCCCTCTCTTTTCAI I I 1 1 I ATCCCAGTAGG 

TGAGACCGTCCCTGCTGTGTTCGGTGGCTGTGGAGTTGATGGCATCTTTGCTCCAGGTGA 

CACCTGCATGCTGGCCAGCTGAGCGGCATAGAGCTGCTGCAAAACAGGGAAGACAAACAT 

TGATCTCTTTMATGCAGCTGAAATTGAGTTTCAAAAGAAAAAACTTACCATATTGTTAG 

ATTTCTCAGACAGCCTTGTMTTTCTTTTTCACTATTTTTAAAAAAGGGAGCTAAGA 

GGCAAATAATAAAACAGAAAGAAAAAGGACAGGTATGGGAGCCATAGTTCTGTTTCTGGT 

CCTTCTAGCMCMCTTTATGATCCTGGACMAGGATTTGATCTCTTTAGMTTCAAGTT 

I I I I I I I I IA 

Sequence 1479 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAACACCTGCTAAAGACATAGC 

TTAATAAATATTTGTTGAATGTTGTTGAATGAAGAGCATTTTCCTGCTCATAAATTTCTC 

ATTACTTAACGAATGTAGAATAAATTCCTTACTCTGGCAGACTGGAATATGTAGTTACAG 

TTTATTCTAGAA I I I I I GCATTCCCCTTCTTCTGGTTCACTTGGAGAAGTTCTTTTCCAT 

CCTAAGTGACCCAGGTGAGTCCAGTGTCCTGCTGATTTCACTGCAGGGCAGGCATGTGAT 

CCAGGCTGGCCACAGTGGTCAGTGTGGACAACTGACCCTTGTTCATCTGTGGAGTCTAAC 

AAAAGTAGTGCCCCCTTATCCTCGGTGGTTATGTTCCAAGATGCODAAGTGGATGCCTGA 

AACCGTGGATAGTACCTGCCCGGGCGGNCGNTCTAAAACTAAGTG 
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Sequence 1480 

CCGCGGTGGCGGCCGAGGTACTTAGCATTGATCAAAGAAATTTCAAATTACCGATCAATT 
GGGTGGGGAGAGGAATTTTCATTGTCCAAGCACCCTCAGGGAACAGAAGTCAAAGCAATA 
ACATATTCAGCMTGCAGGTCTATAATGMGAGMCCCGGMGTTTTTGTGATCATTGAC 
ATTTAAGACACCAAAAAATAAAAGACTCCTACGAAGAACTG I I I I I GTTTTCCTCTTCCT 
TTTGAGAAG ACACT ATGAATTAAATTCTACAGC I I I I I I I I GATATATGGAAATTTGTAG 
MCAGAMTATTTTAGTTAMGTGTGACTTTCAGAAAGGGAAAATCAGGGCACAGCCTTG 
GTCTGTGTTCCCCAAATATTCACACTTTAAAGAATTCTTAAC 
Sequence 1481 

ATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATGATTGTCCTTTATGCATTAAATTCAT 
GCTTTTACATACTTGTGTTTCCTTTTCATTTCGTC 

TAGAGAAAAGGGTTGGCCAGGTGCAGTGACTCACACCTGCAATCCCAGCACTTTGGGAGG 

TCGAGGTGGGTGGATCACTTGAGGCTAGGATTTTGAGACCAGCCTGACCAACATGGAGAA 

ACCCCATCTCTACTGAAAATACAAAATTACCCGGGTGTGGTGGCGCATGCCTGTAACCTC 

AGCTACTCGGGAGGCTGAGGCAAAAGAATTGCTTGAATCTGGGAGGCGGAGGTTGTGGTG 

AGCCAAGATCGTGCCATTGCACTCCAGCCTGGGCAACAAGAGCGAAACTCCGTCTCAAAA 

AAATTAAATAAATAAGGCTTATGTGCAGTTATTCTCAATGAGCTAAAAAGCTTCCTGAGG 

CTAGAAATTCTTGTCA 1 I I 1 1 GGCTGGGA 

Sequence 1482 

CCGCGGTGG CGGC CGCCGGGCAGGTACTATCTATTTGGCCTTGGAAAGTTACTCTCTAAG 

TCAGGGCTTCTTTTCTTNAACTTTGGTGTGAGACTCCGCTGGAGAGCCGGTTAGAAAAAC 

ACAAGTCT CGGG CCCCACCCCGCAGAGCCCCTCATTCTCTAGTTCTGGGTTGGGGCCCAG 

GAGTCTGCTTTTCTAGCAAGCGCCCAGATGTCACTGATGCTTACAGCTCTCANACCACAG 

TTGGAGCAGNGATTTTTAAMGTCTTTTCATTTGTAAAGAGTNGTTCTCCATGCTCCAAA 

TGACTGNGACGACTGAGAAAATGCATGTATGTAAAGTCTGCANCTGGTGACATTGTACAC 

ACTNAGCAAATGGCCTTNGGTGTTACTGTNANT7TATTTTTACTAATTATTTNTNNCACC 

NACAAATTNGGANCTGCTNCAATCGGTNGGAATTTGGAAATTGGGC 

Sequence 1483 

GCTGCCATCAGCTCCCTAGGAGCTCTCCCTCCAGGAAGGGAATGTGTCCACCGTCAGACA 

CTCAGACCCAGCATGTGGGGACAGAGGCTGATGGCCTGTCTGGCCATTCCTCTCAGTTCC 

TCTCCTCACTAdCTTGTGTCCTTGTGCAAGTCACTTACCCTCTCTGAGGTTCAGTTCCCT 

CCTCTFTGAAGTGGGTTTAATAATAAGTACCTGCCG 

Sequence 1484 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTGCCCTTTCGTTAGAAGGCAGTGACTCCTTTC 

TGTGAAGCCGATTTAGTGAAGTGTCCTGTGCAGAAAAGAGTCCAGGGCTGTCAGTTAATT 

TCTTCCGGCCACTGGAGTTAGGGTTTGAAACTCTGCAGCTGCCTATTGCACTTGTGAAAA 

GGTTTGTATGTTCATCACTGCTGGCTGGCTCAGAGTTGGGAGTGAATCCTCCAAGGGATA 

AGCTTGGAGAACTTTCTGAACAGTCAATCTGTAAAGGTGTCTGCAATCCCAAGGCCAATG 

GACTAGATTCTGAAGGCTCTCGGTGGACCCACTGTTCCTCTCTGTTTATTMGCTTTTTG 

MGGAGAGAGATGAGGGCAGGACATGTGACMCGGTGCTTTTCCTTATGCTTATATCGCT 

CTCCAACAGCATCCTT 

Sequence 1485 

AGGTACATGCMGTTGCATGATTATAATGACGTGATCCTGGGATTTAAGTTGATTATGAC 

AGGAACAGAAGGAACTTGGAGATTAGGGACAATGAAAAGGTGGTAGTGAAAGGAGTGTTG 

GAGTTAAGTTACTAGATGTCTGGAAGACAGACTGTGGTGGTCAGATAATGAGATAGATTA 

TGGAGGGGTTACAGTTTTTGGTAATAACGAGAGAGATCTAAGGTATGACTGAGAGTGAAT 

GGATGAGAATAGCAGAGAACAAGGTCAGTGGAAAACTACTCTTCAAAGAACCTATAGTCA 

GGGTGTTGAAAGATTAAAAATTGCTAAGAATTAAATCAGGAAGTAGTGCTGCGGGAAATG 

AAAT GAGCAGTGAACTAAACTTACTGAGAAATGAGAGGGGATGACCCAAGGGTTTGTAGA 

TTTTGTAGATGATAGCA 

Sequence 1486 
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AGGTACATGACATCCGACGAAATGAGACGCCACCTAATTGATTTCCGGGAGTCCGCACCA 

GGGGCCCTCAGGGAAGAGACCCCGCAAAGATCCTGGGAGACCAAGGTGGGGACCCTGGTG 

AGAAGAGAGAGTTCAGGGGAGTCTCTCTTCATTGCCCTTCTGCTAACCCAAGCATTAATT 

TGCTAAGTATTTACCAGGGGAGTGGGAAAAAGAGTTGAGCGGGATTCTCTTAGGCTATGA 

GAGAGTCAGGCAGCCCCCAAGATAAAATAATGAACTAGAAAATCTGGAACCTTACTTCTC 

TGGGAATNTTACCTATCTGGCACCGTGGGAAGAAAGAAAAAAGGCTACTGAGTACCTGCC 

CG 

Sequence 1487 

GCGGTGGCGGCCGAGGTACTGACCTCTGCCAGTCAGCTGCGTAACTCCAGGCCCTAGGGT 

GCCCCGTCTGTCCAGCCAGGGATTGCATGGATATGCTGTGATCTCCCTTTTGGTTCTGAT 

TGAGTTGG ACCT TGTGGGAGGAGAAACATAGATGTTGATACATGAACACATATGTTGGAG 

AGAGAAAGTTTTATCTTGGCATAGGACTTTTAAAACACAAGGTAATT^ 

GGGACCAAAAAACACTCAATATGGGAAAAATCCAAATTCTGCCAAAATGTCTAAAGAGGT 

TTATTCTGAACCAGTATAAGTGACTGTGGTCTAGGTTACACAATTTCAAGAGATCCTGAT 

MAGCGTGCATGAGAGAGTTGGGCTACAGCTTGGTTTTACACATTTCAGGGAGACAGGAA 

TTGTANGGTAAAATTATGGCACAGGACTTTTAAAATGAAGCTGTGAAAGTTTACAGTCCA 

TAGAGAATAAAAATCTAGAAGTTT 

Sequence 1488 

CCGCGGTGGCGGCCGAGGTACTTGACACGACATATGGTAAATGAATAAGACAAAGGCTCT 

GATGGCTTCTCACAGCCCTGTGGATCAAACTCAGATTCTTTTCTAGAACCCCAAGGCCCT 

GTCAG TCTCA CTAGCCTCTCTCCAGACACACCAGCCI I I I I CTACCTTTCCAACATCCCA 

AGTTCCTTTTCTCTTACAGGACTACATACACTCTCTCTTCTGCCAGAAACCATGTTCTAC 

CAGCTAATTTCCACTCAACTTTTAAGTCTCANCTGAAATGTTACTTCCAAAGAGAGGCCT 

CCACTGAACCCCAAGCCTGGGGTTCACAGCACCTGTCTCCATAACTACATAATAATCTCT 

CTGATGTTTAGGCTGGGCATGGTGGCTCACGCCTGTAATGCCAGCACTTTGGGAGGCCAA 

GGCG 

Sequence 1489 

GGGCGAATTGGAGCTCCCCGNGGTGGCGGCCGGGCAGGTACCCAGAGTTGCGAGGAGTTT 

TTTAACTGATTTAGCCAGGTGGCAANCATNAGTGAATGGATGAAGAAAGGCCCCTTAGAA 

TGGCAAGATTACATTTACAAAGAGGTCCGAGTGACAGCCAGTGAGAAGAATGAGTATAAA 

GGATGGGTTTTAACTACAGACCCAGTCTCTGCCMTATTGTCCTTGTGAACTTCCTTGAA 

GATGGCAGCATGTCTGTGACCGGAATTATGGGACATGCTGTGCAGACTGTTGAAACTATG 

A 

Sequence 1490 

GATNAGCTCGATATNGAATNNCNNCANNCNNGGGGANNCANNNGNACAAGAGCGGANNCC 

NNCGCAGAGGAGCTNCAANTTTACACACTGTTTAAATGAGGGAATANGCNGCAGCGCTTG 

GATGTAACTGAAGAAGACAGTNNGAGCNNCAAGGAGGGACAACCACGACCTATGAGGACA 

CCATGCCAGAGAGGCCTGGACCCCACGCTAGGCTCAGTGCCTGTTATACTCTTGGGACCC 

AGCGCmCCTTCTCCATCACGTGGCATACTTGGCATTATTTGTTGNTTAAAATATTGCC 

CTTAGTTTTNACCTTTCNTMGGAGACACMGGNNGACCTTTGNGACATTAACAGTTGCC 

CCMGTNGGGGGNANANAMNMTTTTTGGGGGNGNAMMNCTTTTGGCTTTTTNNAAA 

AAAI I I I I I I I I NAAANTTT 

Sequence 1491 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGTGACCCTTGCTGGT 

GCTTTCATGTTTTGTGCCGAGGCMTTAGACTTTGTGCTGAATNTGTTTGTGCTGCCACC 

TCAGGGAAGGGGTGGAATGTGCAGCGTGGTTTCCATTTGACATTGTTTTCCCTGAGAGAT 

GGGAGGGCTGAACGTTACCTCTTGACAAGTCTTAGTGGACAGAGGGGCCCGGATACCCAA 

GCGCCTTAGTTCTTAGGGCTGGGTATTAGTTCATTTTCACACTGCTGATAAAGACATACT 

CGATACTGGGAAGAAAAAGAGGTTTAATTGGACTCCCACATGGCT 

Sequence 1492 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTACCCTTTACTTTTTCCC 
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CAAGACCATCTCAGGGTGGAGCATTCTGTCTAAGAGAAGAAAGATAAGGAGGCTCCCACC 

CACCTCTCCCAAGAGCAGACATTAMCATCTTTGTGCTTTGMGAGAGTGMTTTTGGAT 

AGTCTTGTGATTCTCAGACTMCTTCCAGMTTATACTTTAACCCCTTCCAGATATGGTC 

CGCCTTTGGCATTGTGTGTACCTGTGATGGGGCGTGTGGTTTCCGGTTGTCTCACCTTTA 

ATTGTCMCCTCCAGTGTTGACTCTAGAMTATGAGGAAAGCTTTTCAGTTTTTAAAATT 

GCCATTTAAATTTAGTCTATTAAAAACAMCCTAGAGGTCTTGGTGCAGTTGATTTCAGA 

GTTTATTMTTTAAGTGGTCCCMAAGTATTACATCTTTTATATTCTGGAAGAAAAGAAC 

TGTGAACAAATTAGA 

Sequence 1493 

CTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCANGGTACTGTGGAGGT 

CCCAGGAGTGCTGGTGATGGGCACAAAGTTCCGATGGGTGAAACCATTGACATAGAGACT 

GCCTCTGTCCAGGAGGNANGGGCCCAGCTCGATGATGCCATGGGTCAGTTTGCTCAGCTC 

CCAGTATAGCTGCTCTCTGTCCAGTCCAGGGTTTAGAGGGTCAAGGCGGTGAGTGCAGAT 

GGTGTCCACGCCGGTGGCTGNCCCACGTTTTTCAGGCCTGAGCAAGGTCAGTNTGCAGCC 

AGAGTACCTT 

Sequence 1494 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTMTTTGAATTTGTAATGA 
GTCTGATGGTATATTTCAATTTTTTGCTTTGAGGGACTGGCTGCTACATTGCAGMTATC 
TTATATCCCTGACTGCTTTCCACTAAATGTCAGTGGTGACCCCAATCCAATTATTATGAC 
AACTG AACAT GCTTATGCATCCCTCATGCCTTTAI I I I I I ATTTTGGGAAATCTTTCAGC 
TTCAGTTTTTGCTGATATTTATGTGATTCTTTGTTCTGC 

AACAGTCTCCTTGGTTCAGATTACTG I I I I 1 1 GACTAGAGCTTCTCGCTTCAGATTCTGT 
CATAAGATTATGGCTTAACCTATGGTTGTCCTTTGATTTGGTGCCATATGAAATAAAACA 
TTATTT 
Sequence 1495 

GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACGCGGGGAGTTTCACGCGCGTATGC 
TTTGCCCGCCATGGCCGCAGTCAAGGNGCCTTTGGAGTTCCATGCCAAGCGGTCTTGGCG 
CCCCGAGGAGGCAGTAGAAGATCCGGACGAGGAGGATGAGGATAATACTAGTGAAGCCGA 
GAATGGGTTCTCCCTGGAGGAAAGTGTTACGGCTCGGAGGCACCAAGCAAGGATACCTTA 
TGCTGGCTACTTTGGATGAGAATGAGGGAAGTGATAGATGGAGGGCAAAAAAAGGAGCAA 
TCCGATTGACCTTTNACCAAGGGGGANTTTTGGAAGC I I I I I I I I I I AAAAAI I 1 1 I NTT 
TTTNGGNGAAATTTTTNNAAAAAC I 1 1 I I 1 1 I NTTTNAAANNANATTACCCCCCTTTTAA 
AAAAAAAAAATTTNCCCCCCAAAAAAAAAANGGGGNAAAANANCCCCCCAAAAANAAAAN 
ATTAAAAAAACCCCCCCCAAANGNTTCAAGGGGCCC I M I 1 1 II I I I I NGGGGNAAAAAA 
AMAMTTNGGCCCCCNCCNCNTTTTmGGGAGGAGAGGGGGGNCCCCCNCCCCCTTTA 
AAAAAAAAAAAGGGCCCCCCCGGGGGGGGGGGGNANTTTTANAAAN I 1 1 I I I I I ICCCCC 
CCCC 

Sequence 1496 

CGGGCAGGACCATGGGAAATAAGAGCNGGCTNNNGGCATTCTGNGTANGGAGCCTGAGCC 

MACTCTAAAGCTGTCTTTATAMGGGAGGTCATGTGATGGCCAGAAATTGCCTTTGCTT 

CATG GTGCACTTGGTGGGGAGTCAGGTGTGGGGTGCTGGGTTTCACATCATCCCATTTTC 

TTrTNNGNNTTCAGACCTGCMTGCTTCTTTTGCAACCCGAGACCGTCTGCGCTCCCACC 

TGGCCTGTCATGAAGACAAGGTGCCCTGCCAGGTGTGTGGGAAGTACCT 

Sequence 1497 

AGGTACTTTTNGAAGTAACTGGACATGNGGGAGGNNAGGGGAANGGAAGTATTGNTATGG 

ACTGAACTGTGCCCCCAAAATTCATATGTNGAAGCCNTGAGCTCTGACATGATTGNATNT 

GAAGTCCTAAAGCCAGGAATGAGGAAGGCTGTGAATGTNCATTGTTCCATGCAAGAATGA 

CTCTGGNGNGGGCTATTTAGAGATCATGAGGGATACTGCCCCAGTTTCCACAGGCCAGAT 

GGNCTCCAACAAAAGCCACGGGGGAGTCACCCCTGCCTGGCAGATCTATCGGGTCAGGAC 

CACCGCCCAGGGGGTCCTGGAGAGGACAGTATAGGACCAAAGAGGAT 

Sequence 1498 
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CACCGNGGTGGCGCCCGAGGTACTTGGCCAAGCGCTCAGATCGGCAAGGGGCACCAAGTC 

TTGATCTGCCCAGTGCACAGNCCCACAACCAGGTCAGCGATGAAGGTATCTTCAGTCTCC 

CCCGAACGATGAGACACCATGACGCCCCAACCATTGGCCTGGGCCAGCTTGCACGCCTGA 

A GAGACTCG GTCACCGGAGCCATTCTGGNTTGACTTTTTAGCAAGAAGGNANTTNAAGGA 

NTTTTTTI I I ANCGNCTTNGGNAATCCTTTTGGGGGTGGGGCNCNTGGGNGAAAAACTCC 

CCCCCANCCNTNGNGATTCNTTNCNNGNGGGNNGAAAAATTTTTTGGNNAAAANTNNCCC 

ATAANNTTTTTNGTGAANAANGGANTT^ 

NNNAAAAANATTTTTTNTCTTCNCNGGGGGGGNCNNTTTTATAAAAAAAANGANNANACC 
CCCCCNCCGTGGNGGAGAAAATTNTTTTANATTA I 1 I I I I I I CNTCCCCCCCCCGGGCCT 
TGGGGGGGGGGGCCCCCCANCCC I I I 1 I I I I I I I I I I I 
Sequence 1499 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTTAA 

TTTGTGT CATC CCATCAGCAATGAAGGTCCCTATCCAGGGTCCTGCTTGGAGCAGCATTT 

CATGTTCTTTTGCTGTTTTGTGCTTTGCCGATTTTGGATTT^ 

ATTTAAAAAACTCGTCACCTTTTGGAMTGCCCATTGCCGACTTC 

AGTCCCT C CTGATTT TGTGTGTGTGTGTCTGTGTTTAAGCAAGCGTTCGGTTGGTATAGN 

I I I I I I I I GTTTTTTTAATTTAAATTGAAGGTAGCTGCCTCCTGAAAGCCAGCATTAAGC 

CAGAACACCCAGNTTCAAGCAAAAGACCCACCTCTCTGCAGAGGCAAAGTCTACTTTCTG 

GTACCT 

Sequence 1 500 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTTAGTAGAGACGGGGTT 

TCACCATGTTTGCCAGGCTGGTCTCGAACCCCTGGCCTCAAGTGATCCACCCGCCGCAGA 

CTCCCAGAGTGCTGGGATTTCAGGTGTGAGCCACTATGCCCGGCCTAATACGTGGATTTT 

TAAAGCTTCAGGTTCTGGTTCAGMGTTTCCTGGGTCTCATTAAAATAATGAGGCACTCA 

GAATTGGTCTAATAAAAATMCGACCATTTCTTTCTACTTCAGCTNTTTCACAAACTTTT 

TTAATGAAAATGACMGNGAGGNCCTTCAGTAGGGGCATTTTCAGGGGANAAAAATAGCG 

GGNAGACCTGAAACCTGGGNTAGGNAGTTTNTTTTTATTTTTTGA^ 

TTTTCANAAGACCCTAAAAMTTNTTTTCCCAAA 

GGNGGGACCCCCTTCCTTTTGGGNNTTTTNCCTTCCCCCCT 

Sequence 1501 

CACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCCGGCCGCCCGGGCAGGTACGCGG 

GGGCCACTGACCACAGCTCTTTCTTCAGGGACAGACATGGCTCAGCGGATGACAACACAG 

CTGCTGCTCCTTCTAGTGTGGGTGGCTGTAGTAGGGGAGGCTCAGACAAGGATTGCATGG 

GCCAGGACTGAGCT TNTNA ATGTNTGCATGACCCCCAAGCCCCACANGGAAAACCCCGCC 

CCCGNGGACMTTTGTTTTNACCANGTTTCCCCCNTTGGNNNNAAAAAATTCC I IN I I I 

TTTNCCNNCCCCCCCNCGGNGNNCCCNMGGGGGNNTTTTCCCCNNTTTTTTNANTNNNC 

CCCCCCCCCCCCCGGGGGNGGGGNGGNCNCCCCNCTTTTTNNNAANNAN I I I I I I II I NT 

TNNAMAANCCNCNNNTTTTTTANAANGGGGNTCCCCCNNAATNTNGGGGCCCNTTATCC 

CCNGGGGGNTAAMNAMNTNTTNNCCMAAAGGGGGGNCCCCCTCCCCCTTTTTNAAAA 

NAGGGGGCCCCCCCCGNGGGGGGNNGNGAANTNATAAAAN I I I I I I I I CNCCCCCCC 

Sequence 1502 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTCCAGCG 

CAGTCTGGGCCCCAAAGCAGTTTCACCCTTACCTAACTTACCTAACTCTTAATATAGCCT 

AAACTCACTGAMMTAAGCTMCTTCATTTCACCTTTNGTAGCATACACGTAGACTCAG 

AGTNTATACTGAAATATAATGGGAGGCTGTTATNAAAAAAAACCNNCCCTCNAAAAAAAA 

AA AATNTNNN CNNCTNAAAAAAAAAAAAAAAAAAAAAAAAANNNGGGGGGGGGGGNNNNG 

GG I M I I I I rNATNNTAAAAAMAAAATNNGGGGGGGGGGGGGGGGGGGGGGGTTTTTTN 

TNNTNTNTTCCCCTNNANTCCGGGGGGGGGGGGGGGGGGGGTNGTNGGTTTCCTTNTATT 

NTCTNTNNCTANNNCACNCCTCCNANACCCGCNNNNGANNNNAANATNNNTTNNAATTAT 

NNNNATTTNNNAMAAGNGGGGGGGGGGGGGGGGGGGGGGTTNCTTTCNNT7TNNANTAT 

AANTANNATAGTNTCTNNNGNANNNTTNTNTNNNNANATNTNGGNNTNCNCNCNTNCNTN 
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CNTCTTNTTTATAAAANAAAAAAA 
Sequence 1503 

CCCTCGCGGTGGCGGGCGAGGTGCATCACCCTGCTGAGGGACATCCAGGACAAGGTCACC 

ACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTG 

ACGATGGACTCCGTGTTGGTCACTGTCMGGCATTGTTCTCCTCCAATTTGGACCCCAGC 

CTGGTGGAGCMGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCC 

ACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACA 

AGCAGCTCCAGCACCCAGCACTTCTACCTGMTTTCACCATCACCAACCTACCATATTCC 

CAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGAT 

GCGCTCAACCAACTCTTCGAAACAGC 

Sequence 1504 

CCGGGCAGGTACTCTTGACAAAGCCTCCCCAACAAGGTGAAATCACTCTCGGCTCACCAC 

TGGTGGCAGAAATACTCTGGGCCTGTCTGTTCTGAATGGTCAGTTTCACTGTAATCCTCA 

GGCCCTTCCAGTCACCCGTTGCCTTGGCMTGTCATCACCAACTTTTTTTGGAGACAGAC 

CCAGGGGGCCCGATCTTGGGGGCCAGGGCAGAAGTTGGCACCGACTTTNCCTTCCGGGNN 

CCCCTMGGTTA AAANA CTTTTNTTTTTTTGGGGGNNNAAAANTTTCGGGCNNTTG 

GGGGCCCTTTNNTTTTTAAANNAAACCCCCNNTTTTNGGGAAAAACCCNAAAGAAAAGTN 

CCCC CTTGGGCCCCCCCCCNNCCCAAMGGCCGNNAAAAACCCCCGGTTTTTTNGGNGTT 

TTTTAAAAAAAMGGAAANCCCCCCGCNNCGGGAMANTTTNTNANAANTTTTTTTCCCC 

CCCCCCCCNCGGGGGGGGGGCCCGGCCNCNN l I I I I I I II I I I 1 1 I I I 

Sequence 1505 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATGGTGGTAA 

ATGCTTAACTCCACCTATATAAAAATCAAAAACAGGCAAAATGAATCATCAGACTACAGC 

TTACCTTTAGGGGGTGTGAGTAGTGACTAACAGGGGTCAAGATGAAGGTTTAAGGATGTC 

GCCAGGCGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCGAGGCAGGCGGAT 

CACTGAGGTCAGGAGATCGAGACCATCCTGGCTAACACGGTGAAACCCTGTCTCTACTAA 

AAATACAAAAAATTAGCCGGGCGTGGNGGTAGGCGTCTGTNGTCCCAGCTACTCGGGAGG 

CTGAGGCAGGAGAATGACGTGAACCCAGNAGGTGGAACTTGCAGTGAGCCGA 

Sequence 1506 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCATTTCTGCATTTATTTTAGCCCATGGAATAA 

C TGTG CTGAGAAACCACAGAGTCAATCAGATTCAAAATGTTAAAATCCTTCCTGCTTGGA 

GTTTTCCGTCTTCACATCAMGCATTTCATGCCCGTCAGCMCTTTTTAATGCATTTGCT 

CCTCGTTTGCACMTTTCCATTTAAAGACTTTCCTTGGCTGACTTCTCTGATGAGGTTTC 

CTGCTTGCCAGGAGAGCACGCTAATGCAGAAATTACAAAGGGGGCTTCACGTCCCTTTTC 

CGGAGGACCTGATATTTCAGATAATTTCCAGCTTCAGTTTTGGAGAAACGACTGTTCTTT 

GCACCAGGGGAAMTAAACTGATTTTCAGTGTAAGCAACCTTTCTGCAAGTAGAATGGGG 

ACTGTTGGGAATGGAGATGAAGACTTCACTTCATGTTATTCATTTCCTTAC 

Sequence 1507 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I I 1 1 1 1 I I I I I M I G 

GTTAGGTTMCAGAMCAATGGGGTTATTTCCGGTAGTGGCMCAGCACGAACTAGTTTT 

CTTGTGGACTMTGGCCCGAGAGTTCCTGCGATTTGTGGATTATTTCCCTTAGATGCAAT 

CGACCATTTCGTTCCAATAAGAATTAATGCTACCACGGGGCTTACAAAGAACCCCTCAAG 

TCTTCCAAATCTGCCCATGACATCAACCTNTGCTGCGTAATCGGACCTGCACCCAACCCA 

GGTTT 

Sequence 1508 

CCGCGGTGGCGGCCGAGGTACGCGGGAGAACAGCTCAGAAGGAGACCCACAGTGAGCAGC 

TCCCCTGTGTCGGCGGGCAGGTCGTCCCTCAAGTGTTCAGCTCTCAGCAGAGAAAAGGCC 

CTGGAGAGGGTGACTCCTCTCAGCTCTCAGCAGAGAAGCAGCCCTGGAGAAGGTAGCTTC 

TGTTCGCAGGCAGATTGTCCAGAGGTCCTGCTGCTCTCAGACGGGGCCCTGGAGAGGATA 

GCTTCTATCCATAGGCAGG 

Sequence 1509 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

249/467 



CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCG6GCAGGTACATAGCA 

GTGGAAGCTGCAGAGGGCCGAAACAAAAATGAAGTTTTCTACCAATGTCCAGACCAAATG 

GCTCGAAATCCAGCTGCTATTGACATGTTTATTATAGGTGCTACTTTTACTGACTGGTTT 

ACCTCTTATGTCAAAAATGTTGTATCAGGTGGCTTCCCCATCATCAGAGACCAAATTTTC 

AGATATGTTCACGATCCAGAATGTGTAGCAACAACTGGG 

Sequence 1510 

CTCCCGCGGTGGCGGCCGCCGGGCNGGAC I I I I I I I I I I 1 1 1 1 I I 1 1 I I 1 1 1 1 IAAAGAT 

CCAGACTAAGACACATCAACAAGAAATTTCCAAATACCAGGTCAAGAATACTTCACATGT 

TTCTGGAGGGAAAGAAAACAGTTCATATACAGNGAATCAGGAATTACAATTGCATCAGAC 

TTATCAACAGCATCGGGAAGATAATGNGAAAATACCTTCAAAACTCTNGAGGGACACAAG 

AAGATGCTGCCTGGCCACAGGAGCGAGACTGCTGGCCTCCAATCAGTCTTNTGGGCAGGC 

CTNTGATGCAATTACAGGGCTAAGGAACCACGGTAACCAATGTGCCATTGGCTGTGCTAA 

AACCCAGNGGCCCCAGGAAGAATCCTGTTGACACTGGCTTGGAACATGAGGCTTAA 

Sequence 1511 

CGACTACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAAGGTACTTTT 

I M I I II I n 1 1 1 I M 1 I I II 1 1 GGCATATAATCCAGTTTATTAAATACAGATGATGGGC 

CAGACATGGTGATAGAGAMTACAGATTAAGAMCCAGATCAMTCCTTTTTAAGGAATT 

ATNTAGNGGAAAATATNTCMCT^^*NTT^f^TTACNCTACTATTCATTATCTTACACTTCA 

AATCTTCACCTTTCCATmGACNGNCGCTNTTTTACTTCAGNNTCCTGAAMCATNm 

CCAACAGMGTTACATAAAMTNCTMTCTTCMGGGGCTTTCTAAMTATTTTNATCAC 

GTCATTAAAATCTTCTTTCACTAGGCAANGGTTNTGTCTTTATGGGGGCTG 

Sequence 1512 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACAGAAATGGAGTCATCAGTTTATCA 

ACCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACC 

ATATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATAT 

TGAGGATGCGCTCAACCMCTCTTCCGAAACAGCAGCATCAAGAGTTATTTTTCTGACTG 

TCAAGTTTCAACATTCAGGTCTGTCCCAACAGGCACCACACCCGGGGGTGGACTCCCTGT 

GTAACTTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTGCCATCTATGAGGGAATTTCTG 

CGGATGACCGGAAATGGGTACCTCGGCCGCTCTANAACTAGGTGGGATCCCCCCGGGCTG 

NAAGGAATTTCGATATCNAGCTTATCGATACCCGTCCGACCCTCGAGGGGGGGGGCCCGG 

GTCCCAGCTTTTTGGTCC 

Sequence 1513 

NGGGGCCGCCGCCCCCGGGGCCGGNCAAAGCAANGGGGGGCCAACNAAGGCNNGGNNNCC 

CCNGGGGGGNANGGAAAAACGNGGGNAANCCCCGCGGCAACCAAAAGGGGNCCAACAACA 

AAACAAGAACCGGGAGGCCCCGGGGGAGNCCANAAAAAANGGCGGCANAAAAGNCCCNGG 

GGGGGGGGGGGCCCCNAAAAAAGGAGGGNGGGANGCCAAAAACCAGCNACNAANCAAAAN 

GCGGCCCGGGGGGGCCGCCCCCAACCNGGGCCCCAGCCTTAAAGNCCCAAGAANCCGCGG 

GGAAAAAACCCCNGGGNCCGGGGGGCCCCAAGCCCNGGGCCAANNAAAAAANGGGAAAAA 

NNCGGGGCCCCCAACCCGCCCGCCCGNGNGGGGAAAAGGAAGGGGGGCCGGGNNCNCCGC 

CCGGNAANNCCGGGGGGGCCGGCCCNCCGGNGCCCCGCCNNNCCCCNNCGGCGGGCNAAC 

CNGGGAACCNGCCGGCCNGGCCGGCCCNCCCGGGGGGNCCCGGNNNACCGGGANGNGGNC 

CGGGCCGAAGGCCGGGGNANGCAAGCCGNCNAACCNCCAAAAAAGGGGC 

Sequence 1514 

GTNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTMGTCTTNTAAATCTGGTGTGTTT 

TTTACACTTGCAGCACATCTCAAATGAAATAGCAACATTTCACATGCACAAGACTCACAT 

ATCTTCTGATGGCTATCTTGTCGAATAGCACAATTCTAGGAAGTTCAAAAGGGGGAACAA 

TTGCTTCTCAAATTGATGAAGGTAGAGAAGAATGGTATCAGAGAAACTGATTAAGCAGGG 

ACCTGGGAATCTGAGTTGAGTTCCGGGGCATGTTTGTTCCCTAGCACCATGACCTACATG 

GCAGTGTCTCTTCTCATACTATTNTAAGAGTCCCTGG 

Sequence 1515 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGGCCAAGC 
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TGGACTGCATAAAGATTGGTATGGCCTTAGCTCTTAGCCAAACACCTTCCTGACACCATG 

AGGGCCAGCAGCTTCTTGATCGTGGTGGTGTTCCTCATCGCTGGGATGCTGGTTCTAGAG 

GCAGCTGTCACGGGAGTTCCTGTTAAAGGTCAAGACCCTGTCAAAGGCCGTGCTCCATTC 

AATGGACAAGATCGAGCGGCCGCGGGCAGGTACTTCATGAGACAAAATGGAAAAAGGAGA 

ATTAAATTAAAATGCACMCTAATATTTATCTACTACAGACATMTATTTCTCAGTTGTG 

AACTMTTACTATGCTTGGAAMTGCTAGCATCCTCATAAATATTTTGGTTCTATTGGGA 

TACAAMTCTGATTTCGCAAACTTTGCAAAGGCACATT7T 

Sequence 1516 

CCGGGCACGGTACGCGGGGGCATTTATCTTCACTCTGATGAGGGCTCAGACTTGATAACG 

CCCGTGGTGCCCCATCCCTATAGGAGCTGGTGAGATTGCAGCCTGCTGCCTCCCCTCCAT 

CAGCCACAGC TATT GGATTTCCCACCCAGAATCTTTAGGTAAATGAGGTAAGTCCTGATT 

TrTAAMCTTCTTTTGAATCTGGAATCCAMCACCTGAGTGGAMGAGAAGCCTGCTTTA 

AACTGGACAGATGAAACTAGAACAGACTCTTGGAGACGGCTGGCAGGAAGTGAAGCTCAC 

CTTACCTGGGCTTACCTCACTGGGTCAMTCAGAATTTTATTTTGGAGGGCAGGTTGGCT 

ACTTTGGATATTATCTGNGAATTTCCTGCATTGGCTGGACTTCTMTCTCTGNGAATTTA 

AAAGCC 

Sequence 1517 

CCGCGGTGGCGGCCGAGGTACCTTTTAGTTATTGACAAGGTTAAAATAGCACTCTCAGTT 

TTTCAGTATTACAGAGAGCAAATAGTTrCCTCTCCTGCTCTGNGCAGTAGCTTTTCCAGA 

ACTATGGACAAAATTNGATCAGAAAGAAGATTGATTATTTCCTCATCI I I I I I ICTTTTT 

TTTGAGACAGAGTCTCTCTNTGTCCCCCAGGCTGAAGTGCAGCGGCATGATCTCAGCTCA 

CTGCAACCTCTGCCTCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTTCTGAGTAGCTGG 

GATTACAGGCGCCGTGCCANCATGCCCGC 

Sequence 1518 

CCGCGGTGGCGGCCGAGGT AC I I I I I I I I I I II I I I I I I 1 1 I CTTTCTTTC I II HIH I 

I I I I I 1 1 I I I I I I I I I II I I I I I I I I I I I I I 1 1 1 I I I I I I I I I I I GGCANNAAAAANCNT 

NATTNCCATTNGGCCCAAGGCTTGTTAGGAAAGTAAAAAAAGCCCCCNATTGGCNGGNGG 

GNNNGGCTTAGGCAAAACCCCTANTACTTTGCANGGGGCCCTCNAAAAAGCCCGNGGGCC 

CNA AAAAG NCCTTAAANCCNCCNTGNAANAGGGCCCCTTNAAAAAAAAACCCCCCNNNNN 

NAAATTTTGNGGCCCCCCCCCGGGGGGGGGGGGGGGGGGGGGGNGGCCCCCCCCCCCNTT 

TNTNNAACCNCCCNCCCCCCCCCCNNAAANGGGGGGGCCCCCCCNAAAAAAAAAANGGGG 

GGGG GGGNNNGGCCCCCCCCCCCCCCCNMAAAAAAAAMANGGGGGGGNGGGGNTTTTT 

TTTTTNCNCCCCCCCCNCCNNNCNNNNNNNNNGGGGGGG 

Sequence 1519 

CCGGGCNTGGAC I I U I I I I 1 1 I I I I I I I 1 1 I I NCTTGCTTTTGG 1 1 I I I I CCTTCGATA 

TCTTCAAAATCTGTGTCAGAAGAAAAATGTGTTTCTGACTCCTGTAGTAGATTAAAATCA 

GTTGGTATTTCTGGAGCTGCTATCATTTCTTTATCATCTTCTGGAACACCTCAATGTCAG 

AMTATATTCATATATTTACGTGGGTAAMTGGNGGCTTGCCTTCTTTCTTCTAAGACAA 

TTCCTC ACGG NGGGCCAAGCGCCCTCCGGGTGCCCCTCCAAACTTNTAGAAGCTTCGCCT 

CCGCCATTTTATAACTTACCTCCCCGCGTACCTN 

Sequence 1520 

CCGCGGTGGCGGCCGAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACA 

CTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACG 

ATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTG 

GTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACC 

TACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGC 

AGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAG 

GACAMGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGATGCG 

CTCAACCAACTCTTCCGAAACAGC 

Sequence 1521 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGT 
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CACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAA 

CTTGACGATGGACTCCGTGTCGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCC 

CAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGG 

CTCCACCTACCAGTTGGTGGACATCCATGTGGCAGAAATGGAGTCATCAGTTTATCAACC 

AACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATA 

TTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGA 

GGATGCGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGT 

Sequence 1522 

TACT ATAGGGCGAATTGGANCTCCCCGCGGTGGCGGCCGAGGTAC I I I I » I II I II I I » I 

TTTTGGTGGTAATTGAGCTGGGAAAAATTCAGAATTGGGTCATAATTAATGGTAACTAAA 

CAGATTTGNGAATATGGGACATCTGTGGNCTTGAAAACATCAGTATGATTTGNCCCCATA 

TTTCTTCAGCCTGGACAATAGAAACAGACAGGGGAGGGGGGTAAAGTGCANTAAAGTAGG 

CTGAGTGATGTGGTGCTAGCAGCTGGAGTCCAGAGAAGTTCTGACAGTGCAGGGAGCAGC 

CCCTTTGTTCTTTGGAGCACTGGAAGGGCTGAGCTGCATCTGAGGTGTTCAAGCCACCAA 

CAGGACAGGGTAGAGGACTAAGTAGCACATGTCCTCCAGAGCAGCTTCCTGTCTTTGTGT 

GGTCACATCACATCGGGGGAA 

Sequence 1523 

CGACTNCTAT AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I I 1 1 I II I I I 

rTTTTTTTTI IGCTTCT GGTA AAAAGAAGCATGTTACAATTTTCCCCCCCAATTCACATA 

GGACTCTAAGMCATATTTTAAATCAGTGCTTCCATACAGGAACGAAATCCACTATTTTA 

GAATTCTAAATCTTGTTGAAAAGCAACCTTATCTGAAGAGTAAACAAGAAGATTCAAAGT 

TAAGTATCAGTGCAGTCCAGAGCCCCTAAATGAATAAAACTGAATGTATCTTAAAAATAG 

GATTTGCACACCAGTAAGAGACTTGTTTACGATTCTGGGGAGGAGGGAAGAAACTGTAAG 

AGGGAGAGAAAAGGGAAGAAAAACAAGAAGAAAAATAATNGGAAAAAAAAAAAAGAAAGA 

AAGGTTTGTTTAGCTCAAAG 

Sequence 1524 

ACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAATTGAGCTTAG 

TGATATTMCTTGGTAAATTAAGCCAGTTTGTCTCTTTGCTTTATCTGTCTTGTGGAATC 

TGTATCAGTGGAMTTAAATAGTCTTTACATTTGATGAACCCTGTTTAGGTGTTGGAAAT 

ACCCATCTATTJGTTAAAAAGGCAAGGTCCCATGATTTAGTGAATGGGGGATACAGACAG 

CCmATTCAAGTAACTGAATAAACAAAAGAATTAGAGAGTGTGATGAGTTTGAATAAAA 

AATATAGNTCATAAAAACCAGAAATGTGATAGAGCATAGTGGCTGGAAGAAAGTTACCCA 

AGTGGCTTGGGTAGTCAATGAAGTTGACTCCAACATGCAGTAGTACGCCGGGTCCTAAGA 

TAGAGATTAAGTCATGGTTTAAATGAGGAACAATCAGTAA 

Sequence 1525 

CCGCGGTGGCGGCCGCCCGGGCAGGGTACACCACTGTGCCTGCTTTGAATCCTTTACGAA 

GAGAAAAAAAAATTAAAGAAAGCCTTTAGATTTATCCAATGTTTACTACTGGGATTGCTT 

AAAGTGAGGCCCCTCCAACACCAGGGGGTTAATTCCTGTGATTGTGAAAGGGGCTACTTC 

CAAGGCATCTTCATGCAGGCAGCCCTTTG 

Sequence 1526 

GAGCNCCCCGCGGTGGCGGCCGNTGTAC I I I I I I M M I I I I I I I I GCTTNTTAATTGGA 

TGCCTGGAGACAATTCCATTTCAATTACCTTATTGGCATGACNAGATATACAAGGGCTGC 

CAATGTCAATACATTAAGACTGAGCGTGCTGGAGCAGCAGCCAGGGTTCAGGGCACTGCT 

GTGTCATCTGCGCCACGGTGCACAAAGGCAGNTTCAAAAGCATTTCAGCATGATCGCTTC 

CCTCTCTCCGCTCCTGGGGAGAGAAGGATCCTGCACACCACAGGCAAATCATGCTGAAAT 

TGAGGNGGNGCCTTTGGGACTCCCATCCCATCACAGTCTTGGGAT 
Sequence 1527 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGGATATTGGTTGAACA 
AACAGGTGGGCAAAGTGAGGAAGATAAGAAGTCCATCCGTTCAGTTTCCCCACTGCGGAG 
GGAATAACACTGTCTTTCCACAGGTCACAGACTGGGATGAGCAACGGGCTGAAGGCACGT 
TTCCCGGGAAGATCTGAACTGGCTGCATCTCCCTTTCCTCTGTCCTCCATCCTTCTCCCA 



# 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

252/467 



AGATGGTGAAGGGGGACCTGGTNCTG 
Sequence 1528 

AGGTACGCGGGATGGCACATANGACATCAGCTAGGCTTTTGGGMTCGTTTGTGTTCTTT 

GTGGAAATGTCCTTTAGAAGCACCCATGAAGTAGTGTGTTCAGACTGTGCACACAGAAAA 

CAGGCTCTGCCTTCACATGTGAGACGGTGGACTTTTCCTNTGGACAAAATGACAGCATNC 

TGGCGACTCCACAGTGGAGCTGAGCGCCACTCCCTGTAGCCCGATCTGGGACTGAAACGC 

TTACACCTCTGCCTTAGAAGGAGTCCCCCNTGCC 

Sequence 1529 

CCGCGGTGGCGGCCGCCCGGGCAGGGTACGCGGGGGAATTAGTCCGAGTGGAGAGAGCGA 

GCTGAGTGGTTGTGTGGTCGCGTCTCGGAAACCGGGTAGGCGCTTGCAGCATGGCTGACC 

MCCTGACTGAMGAGCAGATTTGCAGMTTTCAAAGAAAGCTTTTTTTC^^ 

CAAAAG ATGGG TGGATGNGAACTATTAACCAACCAAAGGGAATTTGGGGAACCTGTAATG 

AGGATCTTTTTGGGCCAGMTCCCCACAAGAAAGCMGAAGTTTACCAGGGAACATTGAT 

TT AATG GAAAGTMGAATGCCTTGATTGGTTAAATTGGCACCAATTTGGACTTTCCCTTG 

MTTTTCTTGACAMTGGATGGGCCMGGMAAMTTGGAAANGACCCCAGNACCNGTTG 

GAAGGAAAGGAAAA 

Sequence 1530 

CCGC GGTGGCGGCCGAGGTACG CGGGTGTTCCTTTTTGTTCAAAGTCTATTTTTATTCCT 

TGATATTTTTC I I 1 1 I I I I I 1 1 1 I I I I GNGGATGGGGACTTGTGAATTTTTCTAAAGGTG 

CTAT TTAAC ATGGGAGGAGAGC GTGT GCGGNTNCANCCCAACCCGCTGCTNACTTTTCAN 

CCNNTTTTCACCNGNCTNTGGGTTTTTAANACCCTNGGNTTTNTACCCCNNTCCTTTGNA 

AANCCCTNNNCCNAAN NGNG GGGCAAANANCCNGNGGGCCCNCTANAAAANANCTTGCGG 

CCCCTGTCCCCCGGGGTTTTGAGGACAANTTTNCCCAAAGCNNMAAANAGANGTTTTTT 

CNCCNTCNNCTCCTGGNGGGGGGGGGGNGAAAAAAAAAAAAAAAAAN I I 1 1 I i I I ICCCN 

GNN NNGGG GNGGNNNCTTNAAAAAAAAAAAANAATGTCCCCCCCCCCCGNNNNGTGGGCG 

NNAATTTTTNTTAANAATATTTTTNNCCCCCCCCCCCCCCCCCCGGGGGGGG 

Sequence 1531 

ATAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTACCATTCCGGGTCATCCGCAG 

AAATTCCTCATAGATGGCAACTCTGTCTACTCTCCGAGCCAGTGGCGAGAAGTTACACAG 

GGAGTCCACCCCGGTGTGGTGCCTGTTGGGGACAGACCTGAATGTTGAAACTTGACAGTC 

AG AMM TMCTCTTGATGCTGCTGTTTCGGAAGAGTTGGTTGAGCGCATCCTCAATATT 

CCTTTrGTTCCTCTGGTAATTGGTGGTGCCTGGCTGGGCTTTGTCCTGGGAATATGGTAG 

GTTGGTGATGGTGAAATTCAGGTAGAAGTGCTGGGTGCTGGAGCTGCTTGTTGGTTGATA 

AACTGATGACTCCATTTCTGTCACATGGATGTCCACCAACTGGTAGGTGGAGCCCAGCCA 

ATGGGAATGAGGCATTCAGGGTCTTATCTAGAAAGACTTGCTCCACCAGGCTGGGGTCCA 

AATTGGAG 

Sequence 1532 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAGAAGGTCCCGGCAGCACGCACGGA 

AGAAGACGGACCCCGCGATGAGGGCGGCGGCAAGGAGCACCTTCATGTTGCGTTCGGAGA 

GGCG CA GCATC CACAGGCCCCGCGTACAGACTGGTTTGGTAAATGCTAAACTTTTGTGTC 

TTTTGCCTTTTTAMGGMTTGTTAACATTGGAATTGAGGGTATGTACCT 

Sequence 1533 

AGGTTTTGGGCGATCGTTTTCATACGAAATATTTGAGATGCTTTAGATGTGTGTGCATGT 

CAGCTGCCACCTGAAAGAAAGGCCTCATTAAAGATTTTCACTGATTAACTCTTTGATTGT 

TCTTGGGATCTCAGATGGGAATTCACGCTGCTTGCTGCAGAGCTCTTGGGCTAAGTGTAT 

TTTCT TAATT ACTGAGAAATGCGTGTTATCAGTAAGCAGTGAAAAGTCTTGAAAAAACTA 

AGTAATTTTAAAAAAAAAAAAAAAAAAAAAAANGTCCTGCCCG 

Sequence 1534 

NCGACTNCTATAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTA C I H I H I I I I 
TTTTT7TTTTTTTTTTGG NGG N NGTTTTC ATTTTCTTCTG G ACTGGTAATTTTTACTTCT 
GGCCTTGGTGGTCTTTGAGGGTCTACATCAGCCTNATCTTGTCCACTGCCAGCTCC7TGA 
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GACTTTCTTTCCTTGTCTAGGACCAGTTGAAGCTCATTATTAACCTGCTGCAGCTCCTCA 
CCATTCTCCTCATCGGNGCCTCCATTTACACGAGCAGGGCGTGCCTGCTGCTCACTGTGA 
TCCAAAATCTCCTNGAC7TGCTCAGGGGCATNTGTAGCCTCC 
Sequence 1535 

CACTTA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTGGTG 

I M I I iGGAAAAGTTACATTTAGATCTATTCTGAAGCTGTTCATTTTTAACAAATAAAAT 

GTTACAGGTTTCACATGATTTATTCGCAAAAAAAAAAAAAAAAAAAAGTACTGTCTGTGA 

ACAGCAAGGAAAATATGATTACACCTAAGAGATGAAATTGGCATAGGCGAGAAGTCAGAA 

AAATAATCTATACAGCTTGCATGGTTGGGGAGTTAGGAGAGGCCAAGGCCACGTGCACGT 

AGAGCAAGAGGTAGAAGAGGCCCGGGGGCTAGAGCGCACCCTGGTGGATAGTGTGAGAAT 

TTCACACTGGCTCAAGCCTTGAAGACCACCCCAGGGGTGCGCCTTAGCAACGCACTTATG 

CAAGACCCCAACAACTGGCCCTTGAAAGGAGCTTTTTACTGGTGGGATGTG 

Sequence 1 536 

AG GTAC TAATATCCCTTAATGGCAGAATGTGATAATCATGGAATTAATTATTGCTAAAGT 

AGTTTTCAAATAAAAAAAAGAAAAAAGAAAACAAAACAAATTAAACTTGACACAATCTGA 

CCAMCACGTGTCAATTTACAATTTCAAGGTTATTTTACAAMTACCTACATTTACACAA 

TAGGCTCCCGGCAGCATTTCCAGACAMTGTTCTTTTMTTTATCCTGACATGCTATAAA 

TGAATAMTTACACTATTTTAAAAATTATCATCAGTMGTTTTTCCTTTCTCATGGGGGT 

CAANAGCAAAAAAGAAAATNNAGGCNTGCCAAAGGAAGGATTTGGAGAGGGGAAAGCCGC 

CACGCACCACCAGTATAACCTTAAAATAAACAGGGAGGGGGCTGGGGGGAGGGGGTCAAC 

AAATTCACTTGAAACCCTCACAACAAGTGGTGGATGCCTCTTCCAAAGGGGGGNGGAAAA 

GGTTCAANCTTGGCTTTTATTAGGCCCANTTTCAAAAAAAGGTATTTTTAAAAGNAAAAA 

ATGGTTTTTGGTANGGGGAAAAA 

Sequence 1537 

NCTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTCAAACACCTCATGA 

GGCTGGCCAGAGATGACACCACTGCCCCGGATGCAAGGGGGCAGCCTTGCTTTGGACCCC 

GTTCTGAATCTCAGGCTTCATCTATTAGGCACACTGAGCAGAAACACTCCTCCCTGCCAC 

T TCCG AGTTGCAGCAAAAAGGGTCTAACTATGATTTAMGCAAATAATTTTAAACTTCAA 

AT7TTCATTACTGTCATACTTATGAGGACCAAATAATGTAATGACTAAGACATCAAATTA 

CATGTAATATGAATAAAACCAGTAAGAAAATGATAGTTACTTATAATTGGATCAACATAC 

AAAAAGAACGTCATTTGGCCAAAGTAAAACGGTTAAAAAAATAGTGTCTATAAAATCACC 

ATATAAAATCTTAGTATGTACCTGCCCG 

Sequence 1538 

CCGCGGTGGCGGCCGCCCGGGCATGGTACAAAACAAATTCAAAGGGTTAAGAATTTCAAG 

TTGGTAGCTTCAGAGCATTAAGCCCCAAATGTGGGGTCCCTTCTGAGCCCAGGATTATGT 

GTGACTGCACTGGTCACATTGCCATGAAGCCAGCCCTGGTGGTTGGTAGAGGGGTGGGCA 

CTGGACCTAAGCA GGGC CAATCAGAGCCATTCCTGGAAATGGATATATAAATATTAGGAA 

TGCAAAGTTATTTATTTTGTTAGAGTTTGAACAACNGTGATCTGTAGAAAACCAGTCACA 

TTTCCTTAAGGATGCATNAATAAATATGAGCTAAATGTAATMTTAGMCTTTGGTTTCC 

AAGCAGATT CCATG CTCTAATTCCCTGGATAGGTGAATATAATAAGATGATNTATCCNTG 

MTMGNTNCTTTTNCTTTGCCCAAGGGGACANGATGGTAAATAAGGGCTACTAAATCAA 

GTTG N CCTTT AAAAT AA 

Sequence 1 539 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAACAAATTCAAAGGGTTAAGAATTTCAAGT 

TGGTAGCTTCAGAGCATTAAGCCCCAAATGTGGGGTCCCTTCTGAGCCCAGGATTATGTG 

TGACTGCACTGGTCACATTGCCATGAAGCCAGCCCTGGTGGTTGGTAGAGGGGTGGGCAC 

TGGACCTAAGCAGGGCCAATCAGAGCCATTCCTGGAAATGGATATATAAATATTAGGAAT 

GCAAAGT 

Sequence 1540 

GTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCATGGCCCTTCCTT 
TGAAATCATTTTTTCTTCCAGGCCCTTGCGCTCCGGGCCTGGGAGACAGAATGAGAGACA 
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GCCCCAAGGATCTGTGATACACTTTCAGGGTCA I I I I I I CCTTGTCCCTGCCAGTCCAGG 
CAC ACCTTT TGTGTTCTCTCCTGMCATACTTTCTCATTTTTCACAATATGGAAAGGCTG 
ATAATTTTTCAMCCTTTAAGTTCTCTT^ 
TTTTTCTTCTCACATTTTATTATTATAMCATTCMGGGAMCC 

TAGAAATTTATTCAGCCAGCTGGGGCGGTGGTGGCTCACACCTGTAATCCCAGCACTTTG 
GGAGGCCCGNGGTGGGGTAAGATCACCCTGAGGTCAAGGAGTTTCGAGACCAGCCTGACC 
AATAATNGGTGAAACACTTGNCTTNTACTTAAAAAATACCAAAATTT 
Sequence 1541 

GGAGCTAAACCCCGGNGGCGGCCGAGGTACTTACCCTCMTTTCAATGTTAACAATTTCT 

TTAAAAAGGCAAMGACACAMAGN1TAGCATTTACCAAACCAGTCTGTACGCGGGGCCT 

GTGGATGCTGCGCCTCTCCGAACGCAGCATGAAGGTGCTCCTTGCCGCCGCCCTCATCGC 

GGGGTCCGNCTTCTTCCTGCTGCTGCCGGGACCTTCT 

Sequence 1542 

CCGCGGTGGCGGCCGAGGTACACAAACGAGATGCTACCTAGGAGAAGGGTATTCTTTTCA 

CTATTCTTTCAMTTTTCTGTATGTTCGAACATTTTCATAGTAGAAAGTTGGGGGGAAA^ 

TCTGTTTC ATAM CATTTCCTCAGCAGCAGTCCAGTCTATTGCATTTTAATTGGTTGTGA 

TATCATTGTTTTATGCAATACGTTCTCAACAAGTATATCCTCCGGCAAACTGAACAAGGA 

CCAAGTCTGTTCTGCCTACAGCTCTGCTTCCTCATAGCTGCTTTCCAGAACGTGACTCTT 

GCAAATTATCAAGAAAGGGGAACTAATCTAAGGGATCCCAGATCAACAGCCTNATGAAGA 

CCTTAATTTATGNTTCTAANATAAAAGATAGGMGTTTTCAAAAAAAGCCCCTGCTTCAC 

ANAGGATCAANANCAGGGGGTGGGCCCTGCTGGGCTTNCACTGGGATTTTTTGAGCATTN 

CTTTCCCNGGNGGCNCGGNAAGGGGNGTGGNGTGAGCCCNAGGGNGGAAAAAAATTT 

Sequence 1543 

CCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGACTTTGTCCAGCTGCAGG 

TGCCTGCAGGTCACACTGGCTMCTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTGC 

CAACTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAAATATTTCTTTCATTTGCTC 

TGAGCTTAATATTGTAATTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTTT 

ATCAAGCAGTTTGATCTTCTGAGGTCTGGTATTGTAGTTTGCTGGCCCACAGAACCTTCA 

CGTGTATTCACAGCCTCAATGCCATAAGGAAACTCTTT 

Sequence 1544 

CCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GtGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACCGACTTTGTCCAGCTGCAG 

GTGCCTGCAGGTCACACTGGCTMCTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTG 

CCAACTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAAATATTTCTTTCATTTGCT 

CTGAGCTTAATATTGTAATTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTT 

TATCAAGCAGTTTGATCTTCTGAGGTCTGGTATTGTAGTTTGCTGGCCCACAGAACCTTC 

ACGTGTATTCACAGCCTCMTGCCATAAGGAAACTCTTTTAAGAAGTTCTGACAGCTGGT 

CATGTTAGGTATMGAACAGGGTGCCTTATCACTGGTGGATTTCATTTCT 

Sequence 1545 

CGAGGTAC I I M 1 1 I I I I I I I I I 1 1 I I I I GGGGAGTTTGTTGTTTTTAACCAAATTATNA 

TAGATGGAAGCATTAGGCAGCTGMTGTTCATTTGCCTTCANACATCATNTCCTATTTCA 

TTTGCTNGNCTCGCATTAAAMGMTCATTTATCAGCAAAAGCATCATTTATTTGTTTAA 

ATGA CMGGTTTAGCTAGCAGAGNAGAGTTTGCNATGCTTTTAAANAAATAACNTTTGAC f 

TTTTCTTCAAGACACTACAAMCCATTTGTTCAAAAAAGGCTGCCCAANGTCATTTANAA 

GMTATTTTTCAMANGTCTATTTCNTATTTTTAAAAAACTTGCCCTTTA 

GGTTTTGAATTCCACCTGGGCCTTTTCTTTTTAACCTTGAMGGGCTTAAA 

Sequence 1546 

GGCGGCGGCCGAGGNACAAGNANCNNTTGGNGGAGGGGGGNGAAACCCAANACCCGAACN 

NGGGACTGNGCAGACAAGCTATATCTTAANCCCNCNCCGGGCCAGACCNCNAGCAAGGGN 

GAGGAAGCAAAAGNCCACAGNNACNGGGGCAGGNAANNGGNANAAANGAGGGNGNGGGGC 



# 
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MGGNTGCGGTGNTTNACAGGGGGAANCCCAACACCCCGGGAGGNCGAGGCAGGAGGANCG 

CCCGAGCCCAGGAGGNNGAGACCAGCCNGGGCAACANGGNGAAACCCNGGCNCNACAAAA 

AANACAAAANNAGCNGGGCANGGNGGNNNGGGCCAGNGANCCCAGCNANNNGGGGAGGCN 

GAGGNGGGAGAAANGCNNGAGCCCAGGAGGCAGAGGNNGCAAGNGAGCCCAAGAANGCGC 

CACCGGAACNCCAGCCCAGGCAACAGAANGAGAACCCGGNCNCACANNAAAAAAAAAAAG 

AAAANGAAAAAGAAAAAANGNCCCNGGCCGGGCCGGGCCGGNNCNAGAACCANGGGGAAC 

CCCCCGGGCNGCAGGAANNCCGAAANCAAAGCCAACNGANACCGGNCAACCCCGNGGGGG 

GGGCCCNGGNACCCANNNNNNNGGNNCCCNNNAGGGGAGGGGNAAANGGCNCGCCNGGNC 

GNAACAAGGGGCAAAAGCNGGNNNCCNGGGGGGAAANNGGNANNCCNGNCNNCAANNCCC 

NNNAAAAAACNAANCCGGAAGCAAAAGGGNAANNCCGGGGGG 

Sequence 1547 

AACACCACCGCGGNGGCGGCCGNCCGGGCAGGNACTTTNNTTNGGGGGNTAAAAAACCCC 

CCNAAAAACNGGGCCCTAGNAGAAGGCAACNTTCATTNCAAACGAGGGGGCCCCNGCCCC 

GNGAAAAAACGGGGAACACGACGNCNAAGGCAGANCCCCGNAAGNACCTACNNNGGACAG 

CCGGGGCAGGCGNGCAAANNNTTTGGGCNNGGCCNCGCAAAGCACAAAGGGGGACACANA 

ACCCACTGCCCACGGCGCAGGAGAAAAAAAAGAAACAGCAAANCACGAGGGGACAN 

Sequence 1548 

CCGGGCAGGTACATCANTTNTGCTGAGGGACATCCAGGACAAGGTCACCACACTCTACAA 

AGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGACGATGGACTC 

CGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCAGCCTGGTGGAGCA 

AGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCTCCACCTACCAGTT 

GGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCAACAAGCAGCTCCAG 

CACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCATATTCCCAGGACAAAGC 

CCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGAATATTGAGGATGCGGTGAGAAG 

GGGGTGCTCAACCAACTCTTCCGAAACAGCAGCATCAAGAGTTATTTTTCTGACTGTCAA 

GTTTCAACATTCAGGTCTGTCCCCAACAGGCACCACACCGGGGTGGACTCCCTGTGTAAC 

TTCTCGCCACTGGCTCGGAGAGTAGACAGAGTTGCCATTCTATGAGGAATTTCTGCGGAT 

GACCCGGAATGGTACCTTNGGCCGNTTCTAGAACTAGGNGGATCCCCCGGG 

Sequence 1549 

CCGCGGTGGCGGCCGAGGTACGCGGGGCTTGCATCTCTGGGGCCAAGGAGTGGTGGGTGA 

GATCTTCCATGGCCCTGGCATGGGTGATATAAGCGGGACCGGTAAGGTGGTGGAGCTCTT 

ACCAGACCCTGCANAACCCTCTCCGTGGTGTTGAACTTCCTGGAACCAGGGTGTTGCATG 

TTTTCCTCATAATGCAGGTTGGTGATGGTGAAGTTGAGGGTGAACGGCACCAGGAGAGGG 

CCAGCAGTTGTGGGGCTGGGGAGGGAGGATGGAGTCCCTGACCCAAGGTCCACTGTGGAG 

GTCCCAGGAGCTGAAAAAAGT 

Sequence 1550 

AGGTACTTTACAAACAAGTCTGAAAAAGGAGGGAGTAAAGTATGGAAGAATGATCTCTGG 

ATGTTGCTACTGGCCTCAAAAAAGCAGTGCTACAGATTTCTGTGTGAAGAGAATACGCTG 

TTCACACATTTTCCTATTTCCAGGCATGAAAATATTCTATTGGGTAGAAGAAATAGGAAA 

ATCTCTTATGACAAATGAAAGACAGGTGCAAACACACCAATCCCTGTCTAGCAGTATAAA 

GCATATTGGGCTCAGAATTTGTCTGTTGCTAGCACCTGGCTTTCATACTATATCCTTATC 

AAATAATCAGATTGAAAGTCCAAATCATTCTTAAGCAAGCAAAAATCCTCAGTGGCCATA 

CCTCA 

Sequence 1551 

GGCGGCCGCTCTAGAACTAGNGGANCCNTTTTGCGGGGGGAAAAAAACCCCAAGCCCANC 

GANACCGNCGACCNCGAGGGGGNNCCCGGNACCCAGCGNNNGCCCCCNAAAGAGAGGGNN 

AANN GCGC GCNNGGCGNAANCANGGNCANAGCNGNNNCCNGNGNGAAANNGNNANCCGCN 

CACNTTTTNNTTTNNCNGACGAGCCGGGNGCANAACCCCCAAANANA 

Sequence 1552 

CCGGGCAGGTACGCGGGCTGCCTGGGGATGGCAGCCGCGTGCGTTCCCTGTGCTGTCCTG 
TAGGGTTGTGGTGGCTTTCTGATATTCAGGTTGGTCCCTTTTCCTTCTTTTTGGATGGGC 
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TTTCGAGGTTGGGAGAGGAAGCTCAGGGATGGAACTGACTAMCATATGTGAGATGTTTC 

CTTGTTTGCTGCTCATGGAGGTTTCATTTCTGMGTCTTTTCTTGGGAGGGAAAAGGATG 

TGTGGATATATGAGGTGACTCTAGAACCCTCATTTTATGGATGAAGAAACTGTGATTCAC 

AGAGGGAGAGTGATTTGTCCAGTGTCACGTGGGGAGCTGGCCGGGGAGCCAGACCTCCTG 

TACCTCGGCC 

Sequence 1553 

TACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTTCCATG 

GG CCCTG TTCCCATTGATGTATACTGCTTCCTTACTAACAGTGAGGGATGACTTTCATCA 

GTCTTTTATCACCTGAACAGTCTTCCGGCCATAATGATAGTAACTATAAGCTGATGCAGC 

TGTGGTGAAAGCTGTAAAACACCTTTTATGGAAGAAAAGAAATAAAATGTAGTTGTCAAG 

TCTAAAAAATAGTAGCAACGGGAATCATAATGAATACATGCAATGAATTTAAAATGTAAA 

AATGAATTTAAAAAGTAAAAAGGGCTCTGTGGTGTAATTTTTCTTAACTACAAGAGTCTA 

AATACACTGCTTTTCTTTAAGAGTTCATTTTAATTAGTAACGTCAMCAAAATTATTCTA 

GATAATGAGCCCTACAAA 

Sequence 1554 

ATCGACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGC 
GGGTGGCCAGATTMTTTTCCAGCCMTAGAGGAMGTTTITATATATTTTTTGCTAAGG 
TGCTGGAGAGGGGAAAGGATGCCACACTTCAGAAGCAGGAGGACGTTGCTGTGGCTGCTG 
CAGTCTTGGAGCCCCTGCTCAAGCTGGCCCTGCTGGCCGGCCTGACCATCACTG MIN G 
GCTTTGCCTATTCTCAGCTGGCTCTGGATATCTACGGAGGGACCATGCTTAGCTCAGGAT 
CCGGTCCTGTTTTGCTGCGTTCCTACTGTCTCTATGTTCTCCTGCTTGCCATCAATGGAG 
TGACAGAAGTGTTTCACATTTGCTGCCATGAGCAAAGAGGAGGTCGACAGGTACCT 
Sequence 1555 

CCGCGGTGGCCCGCCCGGGCAGGTACAGCAAAAAAGAAACTGAGAAGCCCAAACTGCTTT 

CTTGTTAACATCCACTTATCCMCCMTGTGGAMCTTCTTATACTTGGTTCCATTATGA 

AGTTGGACAATTGCTGCTATCACACCTGGCAGGTAAACCAATGCCAAGAGAGTGATGGAA 

ACCATTGGCAAGACTTTGTTGATGACCAGGATTGGM7TTTATAAAAATATTGTTGATGG 

GAAGTTGCTAAA GGGTG AATTACTTCCCTCAGAAGAGTGTAAAGAAAAGTCAGAGATGCT 

ATMTAGCAGGTATTTTAATTGGCAAGTGCCACTGTGGAAAGAGTTCCTGTGTGTGCTGA 

AGTTCTGAAGGACAGTCAAATTCATCAGCATGGGGCTGTTTGGTGCAAATGCAAAAGCAC 

AGGTCI I I I I AG 

Sequence 1556 

ATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGTCCACATCCCGAAGGCCATAGCGTC 

GAGCAAGGTCACATGGATGGCAGCACCTTACCACTCAGGCTCAGGATATTGGGATCTGTT 

GCCMAGCCACCACACATTTGCCACTCAATTCTGTGGTTTCCGCAGATGAGAAGGCTGAT 

TTGAACTGCTTCAACACAGGATCCTGCAGGACCTCCTCCTTTGCCATATGCTCCTTCAGC 

AGTTCTGTCTGCACAATCCCCGGCCACAGAGACACACAGCTGACCCCATGGCGCCGCAGC 

TCGTGGGCACAGTCAGCAGCCAGCTTGTCACACGCAGCTTTGCCCACACCATAGGGGACA 

TTGAACATATACTGCAGGCTTCCTGGGGAGGAGATGACCACGATNAGCCCCTGGCCAGCT 

GGTACCT 

Sequence 1557 

CTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGCAGGTACGCGGGGGGT 

CCTGGTGCTGCGGCCACACCACCCTGGGGTGCTCATTGACAGAGCTGCCATAATGAACTT 

GAAAGGACGGGAATCACCGAGGGAAGCTGGGGCTCCCCTGCCCACAGGAGAGGATCCCCG 

TTCTTCAAGCTTCTCTGCTCAGTGTCTACTAACGACCGACATTTGCTAATGTAAATAATA 

GTAAATTATTGAGAATTCTMTTCTTTTACACAGTCTGTTTTTMTCTATTTT 

TAAAATCTATGACT 

Sequence 1558 

CCGCGGTGGCGGCCGGGGGCCATTGAGACTGCCATGGAAGACTTGAAAGGTCACGTAGCT 

GAGACTTCTGGAGAGACCATTCAAGGCTTCTGGCTCTTGACAAAGATAGACCACTGGAAC 

CAATGAGGAAGGAGAGAATTCTACTGGTCACAGACAAGACTCTCTTGATCTGCAAATACG 
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ACTTCATCATGCTGAGTTGTGTGCAGCTGCAGCGGATTCCTCTGAGCGCTGTCTATCGCA 

TCTGCCTGGGCAAGTTCACCTTCCCTGGGATGTCCCTGGACAAGAGACAAGGAGAAGGCC 

TTAGGATCTACTGGGGGAGTCCGGAGGAGCAGTCCCTTCTGTCCCGCTGGAACCCATGGT 

CCACTGAAGTTCCTTATGCTACTTTCACTGAGCATCCTATGAAATACACCAGTGAGAAAT 

TCCTTGAAATTTGCAAGTTGTCTGGGTTCATG 

Sequence 1559 

TACT T AGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCCAGGCACAGCTAA 

TTTTACTTTTTATTTTCTTGAGACAGGTCTCACTCTGTTGCCCAGGCTGGTCTTGMCTC 

TTGGCCTCAAGTGATCCTCCAGCCATGAGCCACTGTAAATGTCTGGGAATGCCAACTTGA 

AGCCCAGCTGGTCAGAAGTTCTAGAGGCCCAGACTTGCAACTGGTGTCTGAAGGGATGGC 

AGTCTTGGGGACTGAGCCCTCAACCTGAGGGATCTGATGCTATTTCCAGGCAGATAGTGT 

CAGAATTAAATTGGAGGATACCGCAGTTAGTGTCTGCTGCAGAACTGATTGCTTGCTTGC 

TGGTAGGGGGGAAATCCCCACATATTGGGGGGGTATTTGAAGTCTTTTGGTGTTGACTCT 

TACTGNGTTTNGTTGGTTTTTCATGTGAAGAGCAGAGGGAAAAGCAAGGGA 

Sequence 1560 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACTGACCCAACAACTGTGACTA 

GCTGGCCACGTCATTCAGGGCTGGTGTGGCATTTATGTGTGTGTGTGTGTGTGTGTGTTT 

TTCCTGTTTGCCCAGCAGTGCATTGTGGGTTCCAAGAGTGGGTAGTGTGTGTATGTGTGT 

GTGTCAGAGGGAGACCTGGCAGGCACCTNTTTGAGAGTAGCTGTGGTCAGAGCTGTTTGG 

TCAGTGCATTATGTTGAATGAGGTCCAGGAACCCAGAGCCACCCAGCAGACACCACTGTG 

GCTTGCCAGCTGCCAAGATGGAGAAGCATGTGCCCCTGTAGAGCGTCTTCCCAGAACCAG 

ACCCC GAGCCACTCGCTTCCTCTGTGCTGNGACAACATTGGTGCCAGGGGAGATNGTTNT 

TTTTTCAAAGGGGACCTACTGTAGCCCCTTTTAAT 

Sequence 1561 

CCACTCACTATAGGGCTGAATTGGAGCTCCCCGCGGTGGCGGCCGGAAGAGCAACCGAGA 

TGAAGGTGAAGATGCTGAGCCGGAATCCGGACAATTATGTCCGCGAAACCAAGTTGGACT 

TACAGAGAGTTCCMGAAACTATGATCCTGCTTTACATCCTTTTGAGGTCCCACTGAGAA 

TATGTMGAGCTTTAAATGCTACCAAACTGGAACGAGTATTTGCAAAACCATTCCTTGCT 

TCGCTGGATGGTCACCGTGATGGAGTCAATTGCTTGGCAAAGCATCCAGAGAAGCTGGCT 

ACTGTCCTTTCTGGGGCGTGTGATGGAGAGGTTAGAATTTGGAATCTAACTCAGCGGAAT 

TGTATCCGTTACCT 

Sequence 1562 

GGGCGNGTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTCCTGCCTTCCCCCATATA 

CTGAAGTTTGAGAGGCTGGGAAGGTGCGGAATGGGAAAAGGAGCAGCTGCTTATGTTCAG 

TTTAACATTCTCTGGGTTTCTCCATCTAGGTCTTGAGTTCATTCTCTTCCTGTCCTTTTG 

GCTTCCTTGTTTAACCTGGTCCCTGTTTCAGGAGAGAAGCCTCATCAGTGCCAAGTCTGT 

GGGGAAGACCTTCTCTCAGAGTGGAAGCAGGAATGTGCATATGAGAAAGCATCACCTGCA 

GCTGGGAGCAGCTTGGGAGTCAAGAGCAGGAGCAAACTGCTGAGCCACTAATGGGGCAGT 

AGTTTGCTTG 

Sequence 1563 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGC 

CAGCCCAATGCACAGCAGTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGA 

CTTTGTCCAGCTGCAGGTGCCTGCAGGTCACACTGGCTAACTACTTTGTGATGGGCTCTT 

CTTTCTGAGGTTCTGCCAACTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAAATA 

TTTCTTTCATTTGCTCTGAGCTTMTATTGTMTTTGTATTTGATCTGCTGGGTCTTTGG 

AGTCAGGACTGGTTTTATCAGCAGTTTGATCTTCTGAGGTCTGGTATGTAGTTTGCTGGC 

CCACAGAACCTTCACGTGTATTCACAGCCTCAATGCCATAAGGAAACTCTTTTAGAAGTT 

CTGACAGCTGGTCATGTAGGTATMGACAGGTGCCTTATCACTGTGGATTTCATTTCTTG 

Sequence 1564 

CCGCGGTGGCGGCCGAGGTACAMTTGTCGTTTTTATTCCTCTTATTGGGATATCATTT^ 
MAAACTTTATTGGGTTTTTATTGTTGTTGTTTGATCCCTAACCCTACAAAGAGCCTTCC 
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TATTCCCCTCGCTGTTGGAGCAAACCTTATACCTTACTTCCAGCAAGCAAAGTGCTTTGA 

CTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGAACAAMCTGTTCTTTTGCATTTTGCCG 

CTGAGATATGGCATTGCACTGCTTATATGCCAAGCTAATTTATAGCAAGATATTGATCAA 

ATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCCA 

ACTGCTAATGCAAATTAAMCATATTTCATTTTAACATGATTTCAAAATCAGTTTTTCAT 

ACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGAA 

TGGGGTAGAAACATTGTAAATATTTACTCTTTGCAAACCCTGGTGGGTATTTTATTTTGG 

CTTCATTTCAATCATTGAAGGTATATTCTTATTGGAAATGTACCTGCCCNGGCCGGCCGC 

TCTAGAACTAGTGGAATCCCCCNGGC 

Sequence 1565 

ACTACT ATAGGG GCGAATTGGAGCTCCACCCGCGGTGGCGGCCGCCCGGGCAGGTACTTT 
TTGGTGTTTTTTGGAAAAGTTACATTTAGATCTATTCTC 

TAAAATGTTACAGGTTTCACATGATTTATTCGCAAAAAAAAAAAAAAAAAAAAGTACTGT 

CTGTGAACAGCAAGGAAAATATGATTACACCTAAGAGATGAAATTGGCATAGGCGAGAAG 

TCAGAAAAATAATCTATACAGCTTGCATGGTTGGGGAGTTAGGAGAGGCCAAGGCCACGT 

GCACGTAGAGCAAGAGGTAGAAGAGGCCCGGGGGCTAGAGCGCACCCTGGTGGATAGTGT 

GAGAATTTCACACTGGCTCAAGCCTTGAAGACCACCCCAGGGGTGCGCCTTAGCAACGCA 

CTTATGCAAGACCCCAACAACTGGCCCTTGAAAGGAGCTTTTCACTGGTGGGATGTGGCC 

CTGCTTGTATTCAGAACCATAGTTTTAACAAGCCANCATTAAATCCACAAGTCTTTGCCA 

MGCACTTTMGCCTNTTGACATTTATTGGMTTAATTTACCTGCMGGAAAGTTCATAT 

ATCTAGCTTTGGTAACCCTACATTCGGGAAAAATGTTTCCATGANATAACTAAAANCCCC 

ATGAATGATACAATCTTGGACAAAACCCAAAGNNGGCATAATTAGCATAAAACTCCAAAT 

Sequence 1566 

CCCGCGTCCGGCATCTCCCAACGTGACTGACCCGCCGACCACGACCCGCAAAGTGGTCCC 

GACGACGCTCACCACCACCAAGCCGCCAGAAACCTGTGAGAGCTTCAACAGCTGTGTTTC 

CTGTGTCAACGCCACCTTGACTAATAATATTACCTGCGTCTGGCTAGATTGCCATGAAGC 

AAATAAGACCTATTGTTCAAGTGAATTAGTAAGTAATTGTACCCAGAAGACCAGTACTGA 

CTCCTGTTCTGTAATACCTACCACCCCAGTGCCAACCAATTCTACAGCTAAGCCTACAAC 

TCGGCCTTCCTCTCCTACACCTACTCCCTCAGTTGTCACATCAGCAGGTGCAACAAATAC 

CACTGTGACTCCAACCTCACAGCCTGAGCGGAAGTCCACCTTTGATGCAGCCAGCTTCAT 

TGGAGGAATCGTNCTTGTCTTGGGTGTGCAGGCTGTMTTTTCTTTCTCTATAAATTCTG 

CAAATCTAAAAGAACG 

Sequence 1567 

TCGCCNCGCGTCCGGGCAACTGCAGTTGGAAAAAAAGATTCAACTTCAAAGCAGAGGATT 

TTTGATGAAGAACCAGCTAATGGAGTGAAGATAGAAAGGTTTACAAGGGATGATCCTTGG 

7TATCTTCATGTGAAGAAGTGGATGA7TGTAAAGACCAGTTGGAGAAGCAACAGGAAAAA 

CAAGAGATACTTTTGCAGGAAGTGGCATTCACTCAAAGGAAAGCAGTTATTCATGAGAGA 

GTCTGCAAAAGTGATGAAACTGGGGAGAAGAGTGGTCTGAATTCCAGTCTATTTTCATCC 

CCAGTTATACCCATAAGAAACCATTTTCATAAACATGTATCACATGCTAAAAAATGGCAT 

CTTAATGCTGCTGTAAACAGTCATCAGAAGATTAATGAGAATGAGACACTATATGAAAAT 

AATGGAATGTGGAAAACCCCCTCAGAGCATTCACCTTATTCAGTTTACAAGAACCTCAAA 

CA 

Sequence 1 568 

GCTCCATGCCCTTCTCTGAGACGGGGACCAGGGGATGGCAGNCATGCACCTGACAGCCTG 

GCCCNAGAAGTCGGTGACCTTTGAGGACGTGGCTGTGTACTTCACCCAGGCGGAATGGGA 

TGGCCTGTCCCCTGCACANAGGACCCTGTACAGGGATGNGATGCTGGAGAATTATGGGAA 

TGTGGCCTCCCTGGGATTTCCACTTCTCAAACCTGCTGTGATCTCACAACTGGAGGGAGG 

AAGTGAGCTGGG 

Sequence 1569 

CGCGTCCGTTTCTCCTGGCACCCTGTATTCATGGCCTTGGCGTTCTGCCTCTGCATGGCT 
GAAGCCATCCTACTCTTCTCACCTGAACACTCCCTGTTCTTCTTCTGCTCCCGAAAAGCA 
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CGGATCCGGCTCCACTGGGCAGGGCAGACCCTAGCCATCCTCTGTGCAGCTCTGGGCCTG 

GGCTTCATCATCTCCAGCAGGACCCGCAGTGAGCTGCCTCATCTGGTGTCCTGGCACAGC 

TGGGTGGGAGCCCTGACACTGCTGGCCACTGCTGTCCAGGCACTGTGTGGGCTCTGCCTC 

CTTTGTCCCCGGGCAGCCAGGGTCTCAAGGGTGGCTCGCCTCAAGCTCTACCATCTGACA 

TGTGGACTGGGTGGTCTACCTGATGGCTACAGTAACGGTGCTTCTGGGCATGTACTCAGT 

ATGGTTCCAGGCCCAGATCAAAGGTGCGGCCTGGTACCTGTGCCTG 

Sequence 1570 

CGTCCGCTAAGTTCCAATATTGAAAAATCTGTAAAAGACCTCCAGCGCTGCACAGTGTCT 

CTTGCACGGTATCGAGTTGTAGTTAAAGAAGAGATGGATGCCTCCATTAAGAAAATGAAA 

CAAGCCTTTGCTGMTTGGAGAGCTGTTTAATGGATCGAGAAGTGGCGTTGCTTGCTGAA 

ATGGACAAAGTGAAAGCTGAAGCAATGGAAATTTTGCTCAGCCGACAAAAGAAGGCTGAA 

CTTCTAAAGAAGATGACTCATGTGGCTGTTCAAATGTCAGAGCAGCAA 

Sequence 1 571 

GCAGCCGGCCATGCAGGCCGTCATCCGAACTGACATGGACCAAATTATCACCTCCAAGGA 

GCACCTTGCCAGCAAGATCCGAGCCTTCATCCTCCCCAAGGCAGAGGTGTGCGTGCGGAA 

CCATGTCCAGCCCTACATCCCATCCATCCTGGAGGCCCTGATGGTCCCCACCAGCCAGGG 

CTTCACTGAGGTGCGAGATGTCTTCTTCAAGGAGGTCACGGACATGAACCTGAACGTCAT 

CAACGAGGGCCGGCATTGACAAGCTGGGCGAGTACATGGAGAAGCTGTCCCGGCTTGGCG 

TACCACCCCCTGAAGATGCAGAGCTGCTATGAGAAGATTGGAGTTCGCTGCGACTGGACG 

GGCTGCAGC 

Sequence 1572 

CCGMCAANGTGGCCACCCAGGTTTTTAACCCAAGTCTAGTGGTCATCCTATTCTTTCCA 

CACCAACATGCCAAAAGCCTTACCTNGAAAGAAAATATAATTTGCAAGAAGCATCACAGT 

GCCGGGGTCTATATTCTCGATCAGGTTGNTAATTTTCCCATGGGI I I I I I G ACTGATAAA 

GNCATTGATCTGCTTCTGAGCCATTTCCAAATTCTGAAAGTTGGTAAGGATGGTTTCGGN 

ACTGTAAAAGTTCTTGGCATCTTCC 

Sequence 1 573 

CGCGTCCNGGNCGGAGAAGACAGTAGGGATACTGGATATGGGAGGAGCCTCTCTCCAAAT 

TGCTTATGAAGTTCCTACCTCAACCTCTGTCCTTCCTGCAAAGCAGGAAGAAGCTGCCAA 

GATCCTGCTGGCTG AGTTC AACCTGGGCTGTGATGTGCAACACACTGAACACGTGTACAG 

GGTTTATGTCACMCTTTTCTGGGTTTCGGAGGCAACTTTGCCCGGCAGCGCTACGAAGA 

CCTTGTTCTGAATGAAACTCTTAACAAAAACAGATTGCTTGGTCAGAAGACAGGTCTGAG 

TCCCGACAATCCATTTCTGGATCCCTGCCTGCCAGTGGGACTCACAGATGTGGT 

Sequence 1574 

CGCCGTCCNGTTTACTTGGAGTGTCCAAAACTGCAAGCAGTAGAGAAATAAGACAAGCTT 

TCAAGANNNTGGCATTGAAGTTACATCCTGATAAAAACCCGAATAACCCAAATGCACATG 

GCGATTTTTTAAAAATAAATAGAGCATATGAAGTACTCAAAGATGAAGATCTACGGAAAA 

AGTATGACAAATATGGAGAAAAGGGACTTGAGGATAATCAAGGTGGCCNGTATGAAAGCT 

GGMCTATTATCGTTATGATTTTGGTATTTATGATGATGATCCTGAAATCATAACATTGG 

AAAGMGAGAATTTGATGCTGCTGTTAATTCTGGAGAACTGTGGTTTGTAMTTTTTAC 

Sequence 1 575 

GAGGCGCTCAACCTACCGAGGCGCCCACAACCTGTCCGGCCTGCTGGGCTTGTCCCTGCG 

CTACAACAGCCTCTCGGAGCTGCGCGCCGGCCAGTTCACGGGGTTAATGCAGCTCACGTG 

GCTCTATCTGGATCACAATCACATCTGCTCCGTGCAGGGGGACGCCTTTCAGAAACTGCG 

CCGAGTTAAGGAACTCACGCTGAGTTCCAACCAGATCACCCAACTGCCCAACACCACCTT 

CCGGCCCATGCCCAACCTGCGCAGCGTGGACCTCTCGTACAACAAGCTGCAGGCGCTCGC 

GCCCGACCTCTTCCACGGGCTGCGGAAGCTCACCACGCTGCATATGCGGGCCAACGCCAT 

CCAGTTTGTGCCCGTGCGCATCTTCCAGGACTGCCGCAGCCTTCAAGTTTCTCGACATCG 

GATACAATCAAGC 

Sequence 1576 

GACCACGCGTCCGCGCACCGCTTCATTGAGGCTGCAAGAGCACACGGGCACCCACGTGCT 
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GGTCCACTGCAAGATGGGCGTCAGCCGCTCAGCGGCCACAGTGCTGGCCTATGCCATGAA 

GCAGTACGAATGCAGCCTGGAGCAGGCCCTGCGCCACGTGCAGGAGCTCCGGCCCATCGC 

CCGCCCCAACCCTGGCTTCCTGCGCCAGCTGCAGATCTACCAGGGCATCCTGACGGCCAG 

CCGCCAGAGCCATGTCTGGGAGCAGAAAGTGGGTGGGGGTCTCCCCAGAGGAGCACCCAG 

CCCCTGAAGTCTCTACACCATTCCCACCTCTTCCGCCAGAACCTGAGGG 

Sequence 1577 

CTACACTCAACTTCACCATCTCCAATCTCCAGTATTCACCAGATATGGGCAAGGGCTCAG 

CTACATTCAACTCCACCGAGGGGGTCCTTCAGCACCTGCTCAGACCCTTGTTCCAGAAGA 

GCAGCATGGGCCCCTTCTACTTGGG7TGCCAACTGATCTCCCTCAGGCCTGAGAAGGATG 

GGGCAGCCACTGGTGTGGACACCACCTGCACCTACCACCCTGACCCTGTGGGCCCCGGGC 

TGGACATACAGCAGCTTTACTGGGAGCTGAGTCAGCTGACCCATGGTGTCACCCAACTGG 

GCTTCTATGTCCTGGACAGGGATAGCCTCTTCATCAATGGCTATGCACCCCAGAATTTAT 

CAATCCGGGGCGAGTACCAGATAMTTTCCACATTGTCAACTGGAACCTCAGTAATCCAG 

ACCCCACATNCTCAGAGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCACCACAC 

TTTTACAAAGGCAGTCAAACTACATGACACATTCCGCTTCTGCCTGGTCACCAACTTGAC 

GATGGACTTCCGTGTTGGTCACTGTCAANGCATTGGTCTTCTTCAATTTG 

Sequence 1578 

GCGGCCGCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGNNNTNCAAT 

AGNTTTATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGC 

TTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGA 

CGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1579 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTAACCTACCTTTAAGACTGGGATNTCT 

ATTGNTAACAATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGG 

AGCTTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATG 

GCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1580 

CTCCCCGCGGTGGCGGCCGAGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGT 

AGGCCNTTNCCCTACCAACTMCTAATGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGA 

AGCTCCTTTCTATTACTCATCATGCGATAMTMCTATATCCGGTATTAGCTATTGTTTC 

CAATAGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1581 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTAACCTACCTTTAAGACTGGGATAACTA 

TTGTTNAACAATNNCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGG 

AGCTTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATG 

GCCTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCTGCCCG 

Sequence 1 582 

AGGTACAAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGNTAGGCCNTTACCCTACCAA 

CTAACTAATGTTCCGCACCCCCATTTTTMGTGMGCTGTGMGCTCCTTTCTATTACTC 

ATCATGCGATAAATMCTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCT 

TAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1583 

CCGCGGTGGCGGCCGAGGTACAGCCTCTCGGCCCGGCTAMCATCATCGTCTTTTTNGGC 

CATTACCCTACCMCTAACTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCT 

CCTTTCTATTACTCATCATGCGATAAATAACTATATCCGGTATTAGCTATTGTTTCCAAT 

AGTTATCCCAGTCTTAAAGGTAGGTTAGGTACCTGCCCG 

Sequence 1584 

T CTTC GANCACGNTTCGGGCGGCTTTTCCCCGGGCAAGGCTTCTAAATCGGGGGGGCTTC 
CTTTTAGGGGGTTCCGAATTTAAGTGGCNTATAACGGGCANCCTTCGAACCCCCAAAAAA 
AACTTTG 
Sequence 1585 
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AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTNGGTAGGCCATTACCCTACCAAC 

TAACTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAMTAACTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCGG 

Sequence 1586 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATAC 

CGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTT 

TAGCCGGGCCGAGAGGCTGTACCTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTCGCnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGCCCAACGT 

GAACCGGTTACACTTTGCCAAGCGCCCTANCGCCCGGTCCTTTTCGCTTTTCTTTCCTT^ 

CCTTTTCTTCGGCCACGGTTCNNCCCGGCTTTTTCCCCGTCAAGGCTTNTAAATTCGGGG 

GGCTTTCCTTTTAAGGGNTCCCGATTTAAGTGGCTTTANCNGGNAACCTTNGAACCCCCA 

AAA 

Sequence 1587 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGTAGGCCATTACCCTACCAAC 

TAACTMTGTTCCGCACCCCCATTTTTAAGTGAAGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAMTMCTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCGG 

Sequence 1588 

CGGGCAGGTACCTAACCTACCTTTAAGACTGGGATAACTATTGGAAACAATAGCTAATAC 

CGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGCTTCACAGCTTCACTTAAA 

AATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGTAATGGCCTACCAAGACGATGATGTT 

TAGCCGGGCCGAGAGGCTGTACCT 

Sequence 1589 

TACCNCGCGTCCGGGGCCCGGATGCTGGGGGCCACCAGGGCCCCGGGGATGTGCTGGTCT 

TCATGGATGCCCACTGCGAGTGCCACCCAGGTTGGCTGGAGCCCCTCCTCAGCAGAATAG 

CTGGTGACAGGAGCCGGGTGGTATCTCCGGTGATAGATGTGATTGACTGGAAGACTTTCC 

GGTATTACCCCTCGAAGGACCTGCAGCGTGGGGTGTTGGACTGGAAGCTGGATTTCCATT 

GGGAACCTTTGCCGGAGCATGTGAGGAAGGCCCTCCAGTCCCCAATAAGCCCCATCAGGA 

GCCCTGTGGTGCCCGGAGAGGTGGTGGCCATGGACAGACATTACTTCCAAAACACTGGAG 

CGTATGACCCTCTTATGTCGCTGCGGGGTGGTGAAAACCTCGAACTGTCTTTCAAGGCCT 

GGCTCTGCGGTGGCTCCGTTGAAATCCTTCCCTGCTCTCGGGTAGGGCACATCTACCGAA 

ATCAGGATGCCCCGTCCCCGTTTGACCAGGAGGCCACCTTGAGGAACAAGGTTCGCATTG 

CTGAAGACCTGGCTTGGGGTCA 

Sequence 1590 

GGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTGATATCCACATATTTTTGAG 

AAAAATTCCCAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTATACCACC 

ATAGCAATAATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTCTAAAACA 

GCACAGCAGCTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTTCACACGC 

CTCAAATCCATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGGTC 

TGGTTCTTCAGCCTTGCTCCTTC 

Sequence 1591 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTTGAATATCAATTTCTAAATATTTACCC 
AAAAATGTATATTTAAAG Mill AACAACTTCTTTTCAATCTTGAAAAAGTTC CCTCATT 
TTTTCTTTTAAAATCCCATTMCAATCAATGGTTCTTTTAAAATAGMTAAATATT^ 
TGTCCATTTAAAGMGMTATGGATCTTGGAAAATCATTTGAATATTTTTTCTTAAAAAT 
AGATTTTG I I I I I I I GTTAATTTGTTTCCAGAAATAACTTCATCTAATAAAGTAACTTGA 
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CCACCGTAAGATTCATATAATCTAATCAAAGATCTACCAACTGGTTGTTTTACCCTGGAT 
CCAGATTCACCMTTAMCCTAAAATTTTTCCTTGTTGGATCTAAMTTTTACCGTTATC 
AACAGCTTTGACAACCCAAANTTATTGGAAAAATA I 1 1 I 1 1 I AAGCTTTGAACTTTTAAG 
GGTGGGTTAACTTGCATTA 
Sequence 1592 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGG 

GTTTCCCTGCTATGTTTTGTCACACATCATTGCTGGGAGAGTTCAGTGCTATCCAAGTGA 

CTCCCCGGGGAGAGGACAACTGGAAATTTATGCCTGGTGTCTTCCTGGACTCCGACCTAT 

GCACCTTTTTCCTTGGATGATTTTAACCTGTCTCCTTTTGCTGCAATGAACCATAACCAT 

GAGTATATCAGCTTTTCTAATTTTTGTGAAGTCCCTCAAGTGAATTATCAAACCTAAGGG 

TAGTTTTGGGGACCCCCATCACAGAAGAGGTCAAACAGGGGGCAGG 

Sequence 1593 

CCGCGGTGGCGGCCCGCCCGGGCAGGTCTCTTGTCTAGTATACTCAAGGCAGCCTAGTAA 
ATTATTATTTATCTATACAATACTGGAAAAACTTGTAGACAAAAACATGACTTGAATTGC 
TAAAAAAAAAAAAAAAAAAAGAGGGAGAATGAAAACTTCCGGACGCGTGGGTCGAAGCTT 
GACCT 

Sequence 1594 

CCGCGGTGGCGGCCGCCCGGGCAGGTAGGCTGTCTACACTGACATCATCCAGGGCAAGCT 

GGACCAGCGAAACCAGCTGCTGGAAGTGGATTTCTGCATTGGCCGTGACATCCGAAAGAA 

GGATATCAATAATATTGTCAAGACCCTGCATGAATGGTGTGATGGCTGTGAAGCAGTTCT 

ACTGGGCATNGAGCAGCAAGTTCTGAGAGCCAACCAGTACCTT 

Sequence 1595 

CCGCGGTGGCGGCCGCCCGGGCAGG I I I I I I I I I I I I 1 1 I I CTTTCTGTTCCTTGGACTA 

GATMTCTGMATCMCTGTCTTCAGTTTTGCAGACTCTTGTGCCAGCTAAAATGTTCTG 

TTGAGCCCCAGAAGCTAATTTTCTTTTCAGTTATTATC 

TTTTTAATATAATTTCTACCTC I I I I I I ATATTCTCCATTTGGTGAGACATTCACATACT 
TTCTTCCAG I I I I I I I I AG ACGTAGTTTCCTTGAGTTCTTTGAGCATATTTAAAATAGTT 
GATTTAMGTATTTGTCTAGTTACTCCACCATCTGAGTTTCCTCAGGGAAMTTTCTATT 
GCCTCCTTTTTTCCTGTGTGTGGTCGGNCCATACGGACGCGTGGGTCGAAGACCT 
Sequence 1596 , 

ACTTNNTN I I I I I M I II I I I I I I I I I I AAGCGCCCGGCATTTTCTAAATAAAATCATTT 
TATTTGGNAAAAGGGTTTTAACAGNTATACCTTTCTAGCTAAAAGAAAAGAAATAGCGGG 
ATGTACCT 
Sequence 1597 

AGGTACGAAGAGAAAGGAATCAAAGCCTACTANCTCAAAAAATTGTCAAATTGCAAATGA 

GGACATCTAGAGAGGAAGAAAGGAAAAAAGGAACTAAAAAACAGAAACAATTAACAGTAA 

GTTCTTAACTATCAATAATTATTTTAAAAGTAAATAGATTAAATTATCTAATCAAAAGAC 

ATTGAATGGCTGAATGGATTAAAAAACAAGATCAACTATACATTGCCCATCAGAGATTCA 

TTTTAGCTTTAAGGATAAACACTGGTTGAAAGTGAAAAAAGTCAAGGCTGGGCATGGTGG 

CTCATGTCTATAATTCCAGCACTTTGGGAGGCCAAGGTGGGCAGATAATCTGAGGTCAGG 

AGTTTGA 

Sequence 1598 

CCGGGCAGGTACCACCTGAAGACCCTCACACTCAACTTCACCATCTCCAATCTCCAGTAT 

TCACCAGATATGGGCAAGGGCTCAGCTACATTCAACTCCACCGAGGGGGTCCTTCAGCAC 

CTGCTCAGACCCTTGTTCCAGAAGAGCAGCATGGGCCCCTTCTACTTGGGTTGCCAACTG 

ATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCCACTGGTGTGGACACCACCGGCACCTAC 

CACCCTGACCCTGTGGGCCCCGGGCTGGACATACAGCAGCTTTACTGGGAGCTGAGTCAG 

CTGACCCATGGTGTCACCCAACTGGGCTTCTATGTCCTGGACAGGGATAGCCTCTTCATC 

AATGGCTATGCACCCCAGAATTTAT 

Sequence 1599 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACACAGGACCAATGCTGCC 
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CATCCACATGGMTTTACAAACATTCTACAGCGCAAAAGGCTCCAGACTTTGATGTCAGT 

GGATGATTCTGTGGAGAGGCTGTATAACATGCTCGTGGAGACGGGGGAGCTGGAGAATAC 

TTACATCATTTACACCGCCGACCATGGTTACCATATTGGGCAGTTTGGACTGGTCAAGGG 

GAAATCCATGCCATATGACTTTGATATTCGTGTGCC I M i ll I ATTCGTGGTCCAAGTGT 

AGAACCAGGATCAATAGTCCCACAGATCGTTCTCAACATTGACTTGGCCCCCACGATCCT 

GGATATTGCTGGGCTCGACACACCTCCTGATGTGGACGGCAAGTCTGTCCTCAAACTTCT 

GGACCCAGAAAAGCCAG 

Sequence 1600 

TCNCCGCGGTGGCGGCCGCCCGGGCAGGTACGTTCACTGTCTCATATAATCNCAGCCTCC 

TGTGTGATAGCTGGTGTCATCTCCACTTACAGATGAGGAAACTGAGGATAAGCAGGGTTG 

AATAACTTGCTCGAGATCACAGAGCCACGGGTGGNGAAACAGGATACAAACCTGGTTCTG 

TT TGACT CTAAGACCATTCATNTTTCCTCTGAAACTCAGTATTGCACAGTGTAGAAATGC 

AGTTTTTAAGACCTCCCAAAGTGACGTGCTGNGTCACTGCCCATCATTAGCTANATTGAG 

TMATTGCTGCTTAGCCCCANTTGTTTTGACAGAATCAATAGCCCTTGCTGAGGGGCCAN 

CAGCCTACGGACACAGGAGCATGCTTCATGGGCAAGACCACCATGCACACTCAGAGGGGA 

AGCCACAAGGCAACCTCCACGCCACTTAAGATTTGTAGGGCTCTGAACACATCACCAGAT 

ACAGACCACCTACTTATTTTTTNCACTGTMTANCAAAGGCAGGMTCTTTTTNCTGTAG 

GGTAAAGTTTGGGGG 

Sequence 1601 

GGCAGGTACAAGGCCCCAAAGAGGAGGAATTCCTTGTAGAGGAGCTTGTAGATGCTTCCC 

CTCCAGCGGAGAAGCAGGCCAGAGAAACCTCCGAAGCGGGCCTCCGCCACTTTGAGAGTG 

TATGAAACCGTCATGGTGCTGGGAGCCTGGGGCAGGAGGTCACAAGAGTTGCCCCCAGGG 

CTGTCGTTTAGTTCTCCAGACAACCTCCCTTCCACTCTGGTCTCCACACCCCAGCCTTCA 

CCCTGCGTCAAGTGGACAAG 

Sequence 1602 

AGGTACCAGTGGGGACTTCTGAAAGAACNNTACTNGTGTCAGTGGAAAAGCTGGCATTTT 

GGGAAATGCTGGTCTCTCTCAGTCCAGGAGTCAAGGMTATGTTGACTCTCTCTTAATTT 

TTGTAGTCTCAGAGGAAACAGACATTGATGTGGAAACAGTTGTATGCCCCATGGTGGAGG 

TGGTATCCATNGGAGCTGTGGCCTTGGTTTTTTCTGAGTCAGCTAGGACAGAGGATTGTG 

ACCCATGTCCAQAACTGGTGGTTTCCACATTAGTCGCTGCTGTGCTTGTGGAAGGATGCA 

TGGCTTCTATAGCTGTGGTGTCTTCATCTGTTGTCAGTATCTCATGTGAGGNACCTGCCC 

G 

Sequence 1603 

CCGGGCAGGTACTGTGATATCCACATATTTTTGAGAAAAATTCCCAAGCCAGGCGAATGT 

GGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAATAATGGTTAGTAAGATGG 

TGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCAGCTAATGATTTGGTATT 

GATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATCCATGGTTGGCTCCTTCA 

GGCTGCAGTMGTTTGTCTTMGAMGTCCAGGTCTGGTTCTTCAGCCTTGCTCCTTCGC 

GAAATGATCCTGTGTGGGTTAGTTCTCCTCTCTGGGTTGCTGTTTCCTCA 

Sequence 1604 

AGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACCACCTGAAGGCCCTCACACTC 

AACTTCACCATCTCCAATCTCCAGTATTCACCAGATATGGGCAAGGGCTCAGCTACATTC 

AACTCCACCGAGGGGGTCCTTCAGCACCTGCTCAGACCCTTGTTCCAGAAGAGCAGCATG 

GGCCCCTTCTACTTGGGTTGCCAACTGATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCC 

ACTGGTGTGGACACCACCTGCACCTACCACCCTGACCCTGTGGGCCCCGGGCTGGACATA 

CAGCAGCTTTACTGGGAGCTGAGTCAGCTGACCCATGGGTGTCACCCAACTGGGCTTCTA 

TTGTCCTGGACAGGGATAGCCTCTTCATCAATGGCTATGCACCCCAAAATTTATCAATCC 

GGGGGCGAGGTACCTGCCCCGGGCGGGCCGCTTAAAACTAGGNGGGATCCCCCNGGCTTG 

CAGGAATTTCGATATTCAAGCTTATCGATACCCGTCCNACCTTCGAGGGGGGGG 

Sequence 1605 

CCGGGCAGGTACCACNTGAAGACCCTCACACTCAACTTCACCATCTCCAATCTCCAGTAT 
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TCACCAGATATGGGCAAGGGCTCAGCTACATTCAACTCCACCGAGGGGGTCCTTCAGCAC 

CTGCTCAGACCCTTGTTCCAGAAGAGCAGCATGGGCCCCTTCTACTTGGGTTGCCAACTG 

ATCTCCCTCAGGCCTGAGAAGGATGGGGCAGCCACTGGTGTGGACACCACCTGCACCTAC 

CACCCTGACCCTGTGGGCCCCGGGCTGGACATACAACAGCTTTACTGGGAGCTGAGTCAG 

CTGACCCATGGTGTCACCCAACTGGGCTTCTATGTCCTGGACAGGGATAGCCTCTTCATC 

AATGGCTATGCACCCCAGAATTTATCAATCCGGGGCGAGTACCT 

Sequence 1606 

CGGCCGCCCGGGCAGGAACNNNNTTTTTTGGGGGGGGGAAAACCNAGACGGAGCCNCGCN 

CAANGGCCCAGGCGGGAGTGNAAGGGCACCAGGGGGGGCNCACCACAAANACCGCCGCCC 

GGGNGAAAGCCACNCNCCGGCCNNAGCCNCCGGAGNAACGGGGGGAACAGGGGCAGGCCA 

TNI I I I I I I I I IGNGGGGGGNGNANGGGGNGGANNCNCCAGGNAAAAANCANGCNGGCCA 

GGGGGGGGGGGAACNCCNGACCTNATGANGCACCCGCCNNGGNCNCCCAAAANGCGGGGA 

NNANAGGGGNGAGCCACCGNGCCNAGCNGACGG 

Sequence 1607 

CGAGTTACCAGAAGGAGAGATCACCACCATCGAGATCCACCGCACTAACCCGTACATCCA 

GTTAGGAATCAGCATCGTTGGCGGCAATGAGACGCCACTGATCAACATCGTNATTCAGGA 

AGTNTACCGGGATGGGGCCATCGCCAGAGATGGAAGGCTCCTTGCCGGAGACCAGATTCT 

TNAGGTCAACAACTGTGATATCATGCAACGTGTCCCATAACTACGCCCGGGCTGNCCTTT 

CCCAGCCCTGCAGNACCCTGCACCTGACAGNGCTTCGGGAGCGGCNGCTTNGGCAGTCGT 

GCAAA 

Sequence 1608 

CGAGCCTTTAGATGGCGTCTCCTCAGGGGGGCCAGATTGCGATCGCGATGAGGCTTNGGA 

ACCAGCTCCAGTCAGTGTACAAGATGGACCCGCTACGGAACGAGGAGGAGGTTCGAGTGA 

AGATCAAAGACTTGMTGAACACATTTGTTTGCTGCCTATGCGCCGGCTACTTNGNGGAT 

GCCACCACCATCACAGAGTGTCTTCATACTTTCTGCAAGAGTTGTATTGTGAAGTACCTN 

CAAACTAGCAAGTACTGCCCCATGTGCAACATTAAGATCCACGAGACACAGCCACTGCTC 

AACCTNAAACTGGACCGGGTCATGCNGGACATCGTGTATAAGCTGGTGCCTGGCTT 

Sequence 1609 

GCGTCCGACGTCCCCCAGGAGAATGGTAGACACAGATGAGGAAATTGTGGAGATGGGCAC 

AAACCGCAAGGTGAAGAAAACGAACAAACACCGAGTTGATACGGATAGTCCCCGTTCCCC 

TGAGGGCCGACCCCGTGACTCCCGATGAGCGTCCAGTTGCGCCGGGCATCCCTGGGCCTC 

CCAGCGTCTTTCCCGGAGGTTCATCGCCGCACGGCGGAAAGCGCTCTCGGTTCCGCTTTC 

CGGCCCCAGCCTCCCGGGCGCCCTCGCGCGGCGGCTAACGCTGGTCCTCGGCCGGGCGCG 

CTGACGTCATCGTGCGTCAGAGTGAGCCCGGATGGGGCGGCGGGCTTCGGGAGCGCCCGG 

GCTGATCCGAGCCGAGCGGGCCGTATCTNCTTGTCGGCGCCGCTGATTCCCGGCTCTGCG 

GAGGCCTCTAGGCAGCCGCGCAGCTTNCGTGTTTGCTGCGCCCGCACTGCGATTTACAAC 

CCTGAAGAATCTTCCTATCCCTAT 

Sequence 1610 

CGCGTCCGGCGGCGGCGGCTGAGGAGGGCCCGGCCTGCGAGAGCCTCAGTGGGAGCCGGC 

TCAGCCCTCGGCCACCATGTCGGCGCCGTCGGAGGAGGAGGAGTACTGCGCGGCTGGTGA 

TGGAGGCGCAGCCGGAGTGGCTGCGCGCCNAGGTGAAGCGGNTGTCCCACGAGCTGGCCG 

AGACCACNCGTGAGAAGATCCAGGCGGCCGAGTACGGGCTGGCGGTGCTCGAGGAGAAGC 

ACCAGCTCAAGCTGCAGTTCGAGGAGCTCGAGGTGGACTATGAGGCTATCCGCAGCGAGA 

TGGAGCANCTCAAGGAGGCCTTTGGACAAGCACACACAAACCACAAGAAGGTGG 

Sequence 1611 

CGCGTTCGAGTCTGGAGACGACGTTNCGAAATGGCACCTCGCAAAGGGGAACGGAAAAGA 

AGGAATGAACAGGTCATCAGCCTTGGACCTCAGGTGGCTGAAGGAGAGAATGTATTTGGN 

GTCTGCCACATCTTTGCATTCTTCMTGATACCTTTGTCCATGTTANTGMCTTTCTGGC 

NAGTGAGTACTTCAGAAAGGCATNAAACANGCCTCAAAGGGAC 

Sequence 1612 

CCCCGCGTCCGCCCACGCGTCCGGGCTCGGCTGCACCGGGGGGATCGCGCCTGGCAGACC 
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CCAGACCGAGCAGAGGCGACCCAGCGCGCTCGGGAGAGGCTGCACCGCCGCGCCCCCGCC 

TAGCCCTTCCGGATCCTGCGCGCANAAMGTTTCATTTGCTGTATGCCATCCTCGAGAGC 

TGTCTAGGTTAACCGTTCGCACTCTGTGTATATAACCTCGACAGTCTTGGCACCTAACGT 

GCTGTGCGTAGCTGCTCCTTTGGTTGAATCCCCAGGCCCTTGTTGGGGCACAAGGTGGCA 

GGATGTCTCAGTGGTACGAACTTCAGCAGCTTGACTCAAAATTCCTGGAGCAGGTTCACC 

AGCTTTATGATGACAGTTTTCNCATGGAAATNNGACAGTACCTGGCACAGTGGTTAGAAA 

AGC 

Sequence 1613 

GTTNAGTNGAAGTTCTCTACCATTGAATCAGTGAACTAGAAAGATCTGATTTGGCCTGGG 

ACCAGTGTTCAAGTTGGTTTGGTCTTTATTAAAAATCACAATATTCCGAAAACAAAAAAA 

CCTAGGAGATAAATGTAGAGGTATTGACTTTTCGTATCTTTTATCTTCACACTGAAACAA 

GAGCTATCCTATTTGATTATTAAAGTGAGCTATGTGTTAAGTGCCAGGACATTTCTAGCT 

TTTGTGAGAATGTGTCTACATATGAGtATAATAAACCCACATGTATACACAATTGTCTCT 

TATGTACTCCTACCTGGCAGGAGTCTTTG 

Sequence 1614 

CGCGCCGGTGGTGCGATCTCGGCTACTGCAACTTCAGCCTCCTGGATTCAGGCAACACTC 

C TGCCTCA GCCTCCCACGTGGCTGGGATTACAGGTGCCTGCCCCCATGGCTA A I M I N G 

TAi 1 1 I I I GTAGAGATGGGGTTTCACCATGTTGGCTGGGCTGGTCTCACTCTCCTGACCT 

CAAGCAATCTGCCTGTCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCC 

CCCAGCCTGAGCCI I 1 1 I I I I I I I CTAATGCATCCAAGGTTAAGGGGAAGACGCAAATAA 

CAGGACTATTCTAAAAGGAAACCTGTTTGAACTCTGTGAGATCAAGTCATCAGTCTCAGT 

AT TNCA CAGGCACAC CTTM TTTCATTGGTAAAAGATATATATATTTTGNCTATTTTTGN 

GCTTTTGGGGGCCTATTTTGNGCTTTTTTACCTTTMTGN^ 

GTGGAI I I I I I IACCA 

Sequence 1615 

TCGCCNCGCGTCCGTAGAAACTCACACTAGACACACGCAGTAGTCATACGTCTTCACACG 

GTTTAGGAGCTACTGGACCMCATTCTTGTTTTTGCTTTTG I I I I I I I AAATAATTCTAG 

TCTGGAGCTAACTGTGGAGCAGCCAAATAGTAGCTGGCATGTTGATTCAAACCATGGGCT 

GAA7TTGCTCATAGGCTGTGCATCAGACAAAAGCTTGAATA7TTGTGTTGTATGCTTGTT 

CCAACCACCGQTTGTG TGAGCAT TTTTGTGGC1TGTACAGAMGTACACTTTTAAATTGT 

CTCT TGCATCACTAAAATTTTTTTAAAATGAGCATAACAACGAAAGGCATCCAGCTGACT 

TTTTGATTCCAAGATTATTGATTGGATTGACTTTT^ 

TAAATCATATGGCGAGTCAGGGAATAAAAAAGTCAAAAAGGAAACAAATAGAAGCTTTTT 
I I I I I IAAAAA 
Sequence 1616 

GGNCGAGCGCGCCTTGCGGGGGCGGTATCCCGGCGCCCTAAGACCCCACGACCNCNNGCA 
CCGGCCGNTGCTGCNAGACCCCGGCCCGNGTCGGTCCGATGTCGCCCCCCGGNCCCGGCG 
GAAACGCCTCCCTNCTGGCCAGGCTGTTCNAGCACCCGCTT 
Sequence 1617 

TCGCCACGCGTCCGTTNAGATGCAGAMTGMMAAAMCACCTTTGTTTTATAAATATC 

MAGTACATGCTTAMGCCAAGTTTTTATCTAGTTTATTCTAGTACTTAGCTTGCCTGGA 

ATAGCTAATAMTTATTCATGTATGTGCTTTTGAAAATCCAGAGCCCTATTTTTACACAC 

TTGTGTGMGTTGGCAAACATTTTGAAAMTGGAAAAAAGTTTCTAATAATTGGGAACAA 

TTACATTMTTMTATTTTGTAAAATATTGMGCTTTTAGCCCTATGTCAATTTGTAGAT 

TAAAATAAATTAATTATAGGAAAGGAAGATAACAGTGAGAAACCAAACATTAC 

Sequence 1618 

CACGCGTCCGCCCACGCGTCCGGCGCGGCGGGAGCGGGCGGNGCGAGCGGGAGGCGGCGG 

CTCACAACTGAAGCGCTGNGGCATGGNGCGCGCCTGCCTCCAGGCCGANNNGTACCTNAT 

GTTCGCCTTCAACCTGCTNTTCTGGCTGGGAGGNNGTGTGNGGGCTGGNTGTCTGGCNTC 

CTGGCTTGGGC 

Sequence 1619 
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TCGCCACGCGTCCGGACCTGGATGTGAGGGTGAANGGGCTGTGCTGCTGGGAGCCACATT 

CCTCATTGACTACATGTTCTTTGAGAAGCGAGGAGGCAGCTGGGCCCTCTGCCATCACCA 

GTTAGAGGCCACCATGGTGTGAGGAGACCATCACCTCGACCAGAACTCCAGATGGTCACC 

TGCCCTGGCCCCTCCTCTGGGCAGCCCCTTTCCTCCATGTACACTGCAGGGGACAGAAGG 

GGGGCCCCATCCCTACCCTACTCCCTGGCCGCCTGCCCCTGTGGTTCCCAAGGAAGGGGG 

TATTGTATTGAGAGCCGCTCTCCTGCTACCTCCCACCACTGTCCAACAGTCCCTCGGCAC 

ACAGGCATATTMGCT7TCACACTTTTCCCATGCACTTTTTTCCACCCC 

Sequence 1620 

GGAGTCGACCNCGCGTCCGGGGGCTTGCTGGGATCATGGCGGAGAATCACTGCGAGCTCC 

TGTCGCCGGCCCGGGGCGGCATCGGGGCGGGGCTGGGGGGCGGCCTGTGCCGCCGCTGCA 

GCGCTGGGCTCGGCGCCCTGGCCCAGCGCCCTGGCAGCGTGTCCAAGTGGGTCCGACTCA 

ACGTCGGCGGCACCTACTTCCTCACCACTCGGCAGACCCTGTGCCGGGACCCGAAATCCT 

TCCTGTACCGCTTATGCCAGGCCGATCCCGACCTGGACTCAGACAAGGATGAAACAGGCG 

CCTATTTAATCGACAGAGACCCCACCTACTTTGGGCCTGTGCTGAACTACCTGAGACACG 

GCAAGCTGGTGATTAACAAAGACCTCNCGGAGGAAGGG 

Sequence 1621 

GTCGCCCCGCGTCCGGGGCCCGCGGGCCTCGCCTCCGCCCTCCGCCACCTCGAGCTGCGG 

TAGCAGCGACTCATGAGAGCGCGGCCGGAGGACAGATTTGATAATGGGCTGCATTAAAAG 

TAAAGAAAACAAAAGTCCAGCCATTAAATACAGACCTGAAAATACTCCAGAGCCTGTCAG 

TACAAGTGTGAGCCATTATGGAGCAGAACCCACTACAGTGTCACCATGTCCGTCATCTTC 

AGCAMGGGAAC AGCA GTTMTTTCAGCAGTCTTTCCATGACACCATTTGGAGGATCCTC 

AGGGGTAACGCCTTTTGGGAGGTGCATCTTCCTCATTTTCAGTGGTGCCAAGTTCATATC 

CTGCTGGTTTMCAGGGNGGNGGTACTATATTTTGNGGCCTTATATGATTATGAAGCTAG 

MCTCCAGAAAGACCTTTCATTTAAGAAGGGTGAAAGATTTCAAATAATTAACAATACNG 

AAGGAGATTGGTGG 

Sequence 1622 

TTCGGGAGTCGCCCCGCGTCCGCTTTTAGAAAAGGCCAATATACCTATCACACTTTGGAA 

GTAAAAATA CACACT TTCGTGTGTACCTAAAAAAAAAATCGTTGAAAATCAAGGCCAAAG 

GTAGTGCAATTTTTTCATTMGATTTAAAAAAAAGGGAATGATAGTCTTTGAAAGAAAAC 

AGTAGGCA TCCA GCACTGGACAAAACATGGGTATCAAAGATGAATAATCTTTGGAGATTC 

TGGCAGTGTTTTCCCAGAACAAGTCMGTGGAMGTGGAGAAATTATCTGTATMTTTTG 

GACACATACAATGGCAGTTTATCAMGGGTTTTGTTCTGTGGCCTGAATTTACTGGGGTC 

CTACCTACACATTGAACATGTTTTGGCTGGC I I ) N I I I I I I M I I CAACTTGCCAGTTT 

CACTTTACATGGTTAGTAATAAATGGTTTCCACGGGGTGAGTTGGGATAAMTTNTTNAA 

AACATNTTAAATTCCA 

Sequence 1623 

GGAGTCGACCNCGCGTCCGAGCCGGGCCGGGGCGATGTGGAGCGCGGGCCGCGGCGGGGC 

TGCCTGGCCGGTGCTGTTGGGGCTGCTGCTGGCGCTGTTAGTGCCGGGCGGTGGTGCCGC 

CAAGACCGGTGCGGAGCTCGTGACCTGCGGGTCGGTGCTGAAGCTGCTCAATACGCACCA 

CCGCGTGCGGCTGCACTCGCACGACATCAAATACGGATCCGGCAGCGGCCAGCAATCGGT 

GACCGGCGTAGAGGCCGTCGGACGACGCCAATAGCTACTGGCGGATCCGCGGCGGCTCGG 

AGGGCCGGGTGCCCGCGCGGGTCC 

Sequence 1624 

CGCGTCCGGGGCAGCCGCGCCCGCGGAGTTTTCCGCCCGGCGCTGACGGCTGCTGCGCCC 

GCGGCTCCCCAGTGCCCCGAGTGCCCCGCGGGCCCCGCGAGCGGGAGTGGGACCCAGCCC 

CTAGGCAGAACCCAGGCGCCGCGCCCGGGACGCCCGCGGAGAGAGCCACTCCCGCCCACG 

TCCCATTTCGCCCCTCGCGTCCGGAGTCCCCGTGGCCAGATCTAACCATGAGCTACCCTG 

GCTATCCCCCGCCCCCAGGTGGCTACCCACCAGCTGCACCAGGTGGGTGGTCCCTGGGGG 

AGGTGCTGCCTACCCTCCTCCGCCCAGCATGCCCCCCATCGGGCTGGATTAACGTGGCCA 

CCTATGCGGGGGCAAGTTCAACCAGGGACTATCTTCTCGGGAATGGCGGCCAACATTGTC 

TGGGGACATTTGGAGGGAGCCAACATGCCCAAACCTGGACCCTGGGGCCCCTGGGGGCTG 



# 
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Sequence 1625 

CACGCGTCCGGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGCGGCCTCCCGGCGGGAGC 

GAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGGCGGCTGCGGGCG 

CAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGGCGGCGGT 

CCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCGGC 

TCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAAGATGTTCCGCGAGGTTGAGGAAC 

TGATGGAGGACACGCAGCACAAATTGCGCGCGCGGTGGAAGAGATGGAGGCAGAAGAAGC 

TGCTGCTAAAGCATCATCAAGAAGTGGAACCTGGCAAACTTAC 

Sequence 1626 

CCACGCGTCCGGCCGGGGGGTGCCCCCGGGACGTAGCGCCGCGGAGAGGAAGCGGCAAAG 

GGGACCATGCGGCGCCTGACTCGTCGGCTGGTTCTGCCAGTCTTCGGGGTGCTCTGGATC 

ACGGTGCTGCTGTTCTTCTGGGTAACCAAGAGGAAGTTGGAGGTGCCCGACGGGACCTGA 

AGTGCAGACCCCTAAGCCTTCGGACGCTGACTGGGACGACCTGTGGGACCAGTTTGATGA 

GCGGCGGTATCTGAATGCCAAAAAGTGGCGCGTTGGTGACGACCCCTATAAAGCTGTATG 

CTTTCAACCAGCGGGAGAGTGAGGCGGGATCTCCAGCAATCGGGCCATCCCGGACACTCG 

CCATCTGGAGATGCACATGGCTTGGTGTATTGGACGGGACCTTCCACCCACTT 

Sequence 1627 

GCCACGCGTCCGCCGCCCGCTTGCCCGTCGGTCGCTAGCTCGCTCGGTGCGCGTCGTCCC 

GCTCCATGGCGCTCTTCGTGCGGCTGCTGGCTCTCGCCCTGGCTCTGGCCCTGGGCCCCG 

CCGCGACCCTGGCGGGTCCCGCCAAGTCGCCCTACCAGCTGGTGCTGCAGCACAAGCAGG 

CTCCGGGGCCGCCAGCACGGCCCCAACGTGTGTGCTGTGCAGAAGGTTATTGGCACTAAT 

AGGAAGTACTTCACCAACTGCAAGCAAGTGGTACCAAAGGAAAATCTGTGGCAAATCAAC 

AGTCATCAGCTACGAGTGCTGTCCTGGATATGAAAAGGTCCCTGGGAAGGANGGGGGCTT 

GTCCAAGCAAGCCCTACCACTCTCAAACCTTTACGAGACCCTGGGNAGTCGNTTGGATCC 

ACCACCACTCAAGCTGTACACCGACCGCACGGAGAAGCTGAGGCTGAATGGGGAGGGGCC 

Sequence 1628 

CCTAAGGGCAACAAGGGCGGTCCTGGCCAGCCGGGCTTTGAGGGAGAGCAGGGGACCAGA 

GGTGCACAGGGCCCAGCTGGTCCTGCTGGTCCTCCAGGGCTGATAGGAGAACAAGGCATT 

TCTGGACCTCGGGGAAGCGGAGGTGCCGCTGGTGCTCCTGGAGAACGAGGCAGAACCCGG 

TCCACTGGGAAGAAAGGGTGAGCCCGGAGAGCCAGGACCAAAAGGAGGAATCGGCAACCG 

GGGCCCTCGTGGGGAGACGGGAGATGACGGGAGAGACCGGAGTTGGCAGTGAAGGACGCA 

GAGGCAAAAAAGGAGAAAGAGGATTCCCTGGATACCCAGGACCAAAGGGTAACCCAGGTN 

AACCTGGGCTAAATGGAACAACAGGGACCCAAAGGCATTNAGAGGCCCGAAGGGGA 

Sequence 1629 

AGTCGCCCCGCGTCCGCTGTGCCTGAAGGAGACTGGTTTTGTCCAGAATGTCGACCAAAG 

CAACGTTCTAGAAGACTCTCCTCTAGACAGAGACCATCCTTGGAAAGTGATGAAGATGTG 

GAAGACAGTATGGGAGGTGAGGATGATGAAGTTGATGGCGATGAAGAAGAAGGTCAAAGT 

GAGGAGGAAGAGTATGAGGTAGAACAAGATGAAGATGACTCTCAAGAAGAGGAAGAAGTC 

AGCCTACCCAAACGAGGAAGACCACAAGTTAGATTGCCAGTTAAAACAAGAGGGAAACTT 

AGCTCTTCTTTCTCAAGTCGTGGCCAACAACAAGGAACCTGGAAGATACCCTTCAAGGAG 

TCAGCAGAGCACACCCAAAACMCTGTTTTCTTCTAAAACTGGGTAGAAGCCTAAGAAAG 

ATAAACTCTGCTCCTCCTACAGAAACAAAATCTT 

Sequence 1630 

TNCGGGCCTGGTGAGCACCGCCGAGGCGCGGGCCAGCTCTTCGAGGTTGTGCGCGGGAGT 

GGCACGGCGGGCCGGGCCGAGCGAGGGGCTAACTTCAGCGGTGGCACCGGGATCGGTTGC 

CTTGAGCCTGAAATCATGACCACCCCAGGAAAAGAGAACTTTCGCCTGAAAAGTTACAAG 

AACAAATCTCTGAATCCCGATGAGATGCGCAGGAGGAGGGAGGAAGAAGGACTGCAGTTA 

CGAAAGCAGAAAAGAGAAGAGCAGTTATTCAAGCGGAGAAATGTTGCTACAGCAGAAGAA 

GAAACAGAAGAAGAAGTTATGTCAGATGGAGGCTTTCATGAGGCTCAAGATTAATAACAT 

GGGAGATGGCCCAGGGTGGGTGTCATCACTTCTGACATGATTGAAATGATATTTTCCAAA 
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AGCCCAGAGCAACAAGCT1TCAGCMCACMGAAATTCAGGAAGCTGCTTTCAAAAAGAA 
CCTAA 

Sequence 1631 

CGGAGTCACCACGCGTCCGGGGCTGCCAAGGGAGGAGGAAGATGGCGGCGGGGGCGAGGT 

GAGGTGTTGGCAGTGGAAAGGGGTTCGGGCTCGGGGGGCGGGGGGACGCGGAGCGATGGC 

CCGCGCCGGCCGCAGGGGCGGATAAAAAGCCGTCGCGCTGCGGGAGTGGGCGGGAGGGAG 

AGGGGGTGTCCGAGGGCCACAAGAGTATGACGGGGCTGTACGAGCTGGTGTGGCGGGTGC 

TGCACGCGCTGCTCTGTCTGCACCGCACGCTCACCTCCTGGCTCCGCGTTCGGTTCGGCA 

CCTGGAACTGGGATCTGGCGGCGCTGCTGCCGCGCCGCCTCTGCGCGGTCCTAGCGCCGC 

TCGGCTTCACGCTCCGCAAGCCCCCGGCAGTCGGCAGGAACCGCCGTCACCACCGGCACC 

CGCGCGGGGGGGTCGTGCCTGG 

Sequence 1632 

CGTCCGTTTGTTTAATA I I I I I I I ICTCTCTTGAACAAAACTGAGATAATTTAGAAAACA 

GGTGCTTAATTGCAATAAAATTACTATGAAGTATATTAAAAATCACGACATTGTAAAATC 

TCACTTTAGATCATCAAAGAAAACCATTGTTACTATCTGCTTTGAGCTTAGGAAAATGTA 

CAAGAGAACAAATTAAAATTGAAAMTTGATTTCACTTAGAAAAACTTCTAGGAACAGGG 

TGAACCACTGATTTTAATTTGCCTAATTATCTTATGACAAGTATCAAATTAAGATGACAC 

TTAMGGATCCTTAGCATTTAACTTAATGATGGAGAAAGAGTGCTCAATAGGACAGTTCC 

CCAGTTAAGGGGTAATGGAGATGCCCATTTTCAGGAGGACCATTCTAAGMGATATTTTT 

GGATTCATTAAAAACATTTAAATAAAAAGCCCTTCTTCAAGATTGGGAAC 

Sequence 1633 

CGCGTCCGCCGGCCTGGTGCACGCGGCGCACCGAGGCCTCCCGCAACGCCGCCGACAAGG 

AGCGGGCGGCGGGCGGCGGCGCCGGCAGCAGCGAGGACGACGCGCAGAGCCGCCGCGACG 

AGCAGGACGACGACGACAAGGGCGACTCCAAGGAAACGCGGCTGACCCTGATGGAGGAAG 

TGCTCCTGCTGGGCCTCAAGGACCGCGAGGGTTACACATCATTTTGGAATGACTGTATAT 

CATCTGGATTACGTGGCTGTATGTTAATTGAATTAGCATTGAGAGGAAGGTTACAACTAG 

AGGCTTGTGGAATGAGACGTAAAAGTCTATTAACAAG 

Sequence 1634 

CCCCGCGTCCCGGTTGGCCGGGCGGAGGTCTTCGCTGAGGCCCGGGGCGGGGTGGCGCCA 

CCCCTGATTGCGGTGCCACGGACTGCTCCTGCTGGGCGGAGAGGACAGATTTTGCAAAGC 

GGAGGCTTGCGACGGGTCCTGCAGGGGGACAGTGAGGAAAGGGCCCGCCTCGTNTCCGCT 

CCTGGGGGACCCGCAGAAATAAGAATCAAACTCCACAATGACAACCTATTTGGAATTCAT 

TCAACAAAAATGAAGAACGAGAATGGGAGTCCCGATTTAGT 

Sequence 1635 

CCACGCGTCCGGGCGGGGCCATCCAAGCAACGCTGAAGGCCTTTTCCAGCAGCTGGGAGC 

TCCCGGATTGCGTGGCACAGCTGAGGGGCCTCTGTGATGGCTGAGCTCTCTTATGTCCTA 

TACTCACATCAGACATGTGATCATAGTCCCAGAGACAGAGTTGAGGTCTCGAAGAAAAGA 

TCCATGATCGGCTTTCTCCTGGGGCCCCTCCAATTGTTTACTGTTAGAAAGTGGGAATGG 

GGTCCCTAGCAGACTTGCCTGGAAGGAGCCTATTATAGAGGGGTTGGTTTATGTTGGGGA 

GAATTGGGCCTGMTTTCTCCACAGAAATAAGTTGCCATCCTCAGGTTGGCCCTTTCCCA 

AGCACTGTAAGTGAGTGGGGTCAGGCAAAGCCCCAAATGAGGGGTTGGTTTAGATTCTGA 

CAGTTTGCCAGCCAGGCCCCACCTCAGCGTCTGTCGAACAAACAAAGNTNGGGNGGGTTT 

N 

Sequence 1636 

CCNCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCTTCTGCAGCAAGCTCAGGAGAGCT 

GCTGTCTTCCCTCCCGCCCACCAGCAACGCACCCTCTGACCCTGCCACAACTACTGCAAA 

GGCAGACGCTGCCTCCTCACTCACTGTGGATGTGACGCCCCCCACTGCCAAGGCCCCCAC 

CACCGTTGAGGACAGAGTCGGCGACTCCACCCCAGTCAGCGAGAAGCCTGTTTCTGCGGC 

TGTGGATGCCAATGCTTCTGAGTCACCTTAACTTTGAACCATTCTTTGGAATTGGCGTGG 

TATATTTAACCACGGGAGGCGTGTCTGGAAACGCAAACTATCATTAATTTCATACTAGGT 

TTGTACCGTATCTGTAGGCATTCCTGTAAATAATTCCAAGGGGGAAAACTAAACNGGGAC 
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GTGGGG7T GTATC CTGCCAGGTTTGAGTGGGGGCTCACACCTAGGGTGAGAAGTCAGAAA 

GCGCTTGTATTTTAAACAACCAAAAAGAATTGAAAGGGTG 

Sequence 1637 

CGTCCNATAGGCTTGCACACTTTTCTAACTACATGTTTAAGTGGCAGAGTCCAGGCTGTC 

GAGTCACGGTTGGGT TTGA ATCTGACTCCACCAGTMCTTTGGTTGGAAAAAATCACTTA 

TCCTCTTTAAG CTTGA TTTTATTTATTTTATTTTATGTMGAGTGAGACAGTAGTAG 

MTAGGGTTGCTTTTAMTTAGAGTGMCATGAGGCATTTATTCGGTGCCAGACAGATAA 

CTGCCTATAACAGGATGTGATCAGCACAAGTAACAGAAAATTAGCCTGGACGGTGGCTTA 

AGCAATGGGGAATGTTTATCTCACATAGCAAAMGGTCTGTAAATAGGATGGTTTTAGAG 

TTGGGGTGGGGAAGCCAAAAATGTCATCAGGATTTCCTTGGAACCCGT 

Sequence 1638 

CGCGTCCG GATTAA TACAACTCTTAAAAAATATAGTCAATAGGTTACTAAGATATTGCTT 

AGCGTTMGTT7TTMCCGTMTTTTAATAGCTTMGATTTTMGAGAAAATATGMGAC 

TTAGAAGAGTAGCATGAGGAAGGGGAAGAAAAGAAGGGGAAGAAGATCAAAGAAGGAAAG 

AAGAAGGGGAAGAAAAGAAGGGGAAGAAGATCAAAACCCACCATGCCCCAGGCTCAGCAG 

GGAGCTGCTGGATGAGAAAGGGCCTGAAGTCTTGCAGGACTCACTGGATAGAAGTTATTC 

MCTCCTTCAGGTTGTCTTGAACTGACTGACTCATGCCAGCCCTACAGAAGTGCCTTTTA 

CGTATTGGAGCAACAGCCGTGTTGGCTTGGCTGTTGACATGGATGAAATTG 

Sequence 1639 

CGCGTCCGGCTCCCCGCACCCCCTCGCACTCNCTCTGGCCGGNCCAGGGCGGCCTTCAGC 

CCAACCTTGCCCAGCCCCACGGGGGCCACGGAACCCGCTNGATCTCGCCGCCAACTGGTA 

GACA 

Sequence 1640 

GTCGCCACGCGTCCGGCGGCCGGGCCGGGGCAGCCGGGAAGCGGGTGGGGTGGTGTGTTA 
CCCAGTAGCTNCTGGGACATCGNTCGGGTACGCTCCACGCCGTCNCAGCCACTGCTGTGG 
TCGCCGGTC 
Sequence 1641 

CGTCCGCTCCTCCCGCCTGAGGCGAGTCTGGGCTCAGCCTAGAGCTCTCCGGCGGCGGCG 

CAGCTTCAGGGCAGCGCGGGCTGCAGCGGCGGCGGCGGTTAGGGCTGTGTAGGGCGAGGC 

CTCCCCCTTCCTCCTCGCCATCCTACTCCTCCCTCCTCGTCATCCTCCCCCTTCGTCCTC 

CTCGCCTTGCTCCTCCTCGTCAGGCTCGACCCAGCTGTGAGCGGCAAAGATGGGCGGCGC 

CCAGGCCGCCGCCTGCCAGGCTGTCGGGGCGTCATGGTGCCGGCGCCCATCCAAGACCTG 

GAGGCCCTGCGCGCGCTCACGGCGCTCTTCAAAGGAGCAGCGGAACCGAGAAACAGCACC 

CAGGACTATCTTCAMAGGAGTTCTGGATATCCTAAAGAAATCTTTCTATGCTGTTGAGC 

TTGCCTGCANGAGATCCATCCCAAGTGGAAAACCT 

Sequence 1642 

ACATTTATCATGGATGCTGACCGGGAGAAAGAAAGAAAGAAACGGGAGGAGCGGGAGCGT 

AAGCGGCGGAAGGAGGAGGAGGTGCAACAGCCAAAGTTGGCAGAGGAGAGACGGCGGCAG 

AATTTACAGGAGGAAAAGGAAAGGAAGTTGGAATGCCTGCCCCCTGAACCTTCCCCTGAT 

GACCCTGAAAGTGTCAAGATCATCTTCAAATTACCTAATGATTCTCGAGTAGAGAGACGA 

TTCCACTTTTCACAGTCTCTAACAGTAATCCACGACTTCTTATTCTCCTTGAAGGAAAGC 

CCAGAAMGTTTCAAGATTGMGCCMTTTTCCCAGGCGAGTGCTGCCCTGCATCCCTTC 

AGAGGAGTGGCCCAATCCCCCTACGCTACAGGAGGCCGGACTTAGCCACCAGAAGNTCTT 

TTTGTTCAGGACCTMCTGACGMTGACATTTTTTTCTTTCTGTCCCCJCCTACCCCAGT 

CCCTAAAAGAAATGGGGNAAAAAGGAAACAACAGCAGTCNTAAAAAAAAAA 

Sequence 1643 

CGCGTCCGGAGGGGCTAAGAAGGTTGTCCTTGCCTAATGCTCTGATCTGTAAGTGAATAG 

GGCAGMCAGTTCAGCCTTGAGGTTAGAATTTAGCAGGAGCTATCCTGACTTAATATCCA 

GTTGTGGGGTTTGCAAAACAAAACAGCTGTATGTAATCATTGCCACTAGTTCCATCTAGA 

ACTCCTTTCTAGTTTGTTATTTTTAAAATGTTTATACATAAAACCACCAAAATACATAGC 

TTCGACAAGATGGAAGTTTATTTCTCTCTCCCATAACAGTGCAGTGATAGTCAGCTGGTC 
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CAGGCCAGGCAAGGGGCTGGTCCATGATGTCATCAGGCACCCAGGTTCCTACTGGCTTTG 

CATGTGGCCACAGTTAGCAACAAANGGAGGCTGTAAATTT 

Sequence 1644 

CGCCACGCGTCCGGTGATGCGGACCCCGGGCGGGCGCAGGGCGCGGGCTCCGGCGCCGCC 

GCTGCGTCCTCCCCGGCCGCGGGCGAGCCGCTGCAGAGGGAGCGTCGCGCCGGGGCGGAG 

TGCGGGCTTGCGCGGCAAGTGCGCGCCGAGGTCACGAAATGGATTGGAGTGAACCGGAGA 

CCCCGAAAACGGAAGCGCAGGGAGAAGGAAGAGGTGTTTGAAAAGCTTCTTCCAGACCAG 

CTGGTCTTGCTTCTGGAGCATCTCTTGGAGCAGAAGACTCTGAGCCCCCGAACTCTGCAA 

AGCCTCCAGAGGACATACCACCTCCAGGATCAGGATGCAAGAGGTTCGCCATCGGTGGTG 

TGAACTCATTGTTAAGCACAAGTTCACGAAAGCCTACAAAAGTGTGGAGAGGTTCCTTCA 

GGGAGGATCAGGCCATGGGGTGTGTACCTCTACGGGGAGCTGATGGTTGAGTGAGGACCC 

CAGAC 

Sequence 1645 

TCGCCACGCGTCCGGGACATCGAGTNCGGGCTGGCTACGAACTCCTCGGGGGCGAAGGTG 

GCGGAGAGGGATGGGTTCCAGGACGTCCTGGCGCCCGGGGAAGGCTCGGCGGGACGGATT 

TGCGGTGCGCAGCCAGTGCCGTTCGTCCCTCAGGTGCTTGGCGTGATGATCGGGGCCGGA 

GTGGCGGTGGTGGTCACGGCCGTGCTCATCCTCCTGGTGGTGCGGAGGCTGCGAGTGCCA 

AAAACCCCAGCCCCGGATGGCCCCCGGTATCGGTTCCGGAAGAGGGACAAAGTGCTCTTC 

TATGGCCGGAAGATTATGCGGAAGGTGTCACAATCCACCTCCTCCCTCGTGGATACCTCT 

GTCTCCGCCACCTCCCGGCCACGCATGAGGAAGAAACTGAAGATGCTCAACATTGCCAAG 

AAGATCCTGCGCATCCAGAAAGAGACGCCCACGCTGCAGCGGAAGGAGCCCCCGCCCGCA 

GTGCTAGGAAGCTGAC 

Sequence 1646 

TCCGCCAAGTCCTGCGATGATGGACTCAACACCTTCCGCGACGAGGGGCCGGGTTCTGCG 

GCGCCTGCCAAACCGCATACCCAGCCTGCGGATGCTCCGGAGCTTCTTCACCGACGGGTC 

CTTGGATAGCTGGGGCACCTCTGAAGATGCTGACGCTCCTTCTAAGCGACACTCAACCTC 

TGACCTCTCAGATGCGACCTTCAGCGATATCAGGAGAGAAGGCTGGTTGTATTATAAGCA 

GATTCTCACCAAGAAGGGGAAGGCTGAGGACCGGGATGACATGCTGGGCTGGATCAGAGC 

GATCCGGGAGAACAGCAGGGCCGAGGGCGAGGACCCCGGCTGTGCCAACCAAGCTCTGAT 

CAGCAAGAAGCTTAATGATTATCGCAAAGTGAGCCATAGCTCTGGGCCCAAAGCTGATTC 

CTCCCCCAAAGGCTCTCGCGGCCTGGGGGGCCTCAAGTCTGAGTTCCTCAAGCAGAGTGC 

GGCCACGTGGCCTCANGACTCAAGACCTGCCCGCAGGGAGCAAGGATGACAGTGCTGCAG 

CCCCCAAAACCCC 

Sequence 1647 

GGTGTCGCCCCGCGTCCGGTTTCTTCCTAATTTATATTTCCGATACATANGTGTAGAACA 

GGAATTTGCAGMGCCATTTMGTTATCTTTTGAGGTAANGCTCTGATTTAGCATTTA7T 

CTGATAA AATCT AATACATCATGGGATATATATAAAGCAACTTAATTCTTGTGGTGTAGT 

CTTAATAGTTTTGAATGTTGACTGMTGTCTATAAAATTGTGAGTTTGTCTTTGTTACAT 

TCCAGTGTTTCTGCCTCTTGGCATGCTTAAAGCACGGCTTACTTCATCTGCTCCTTACAC 

ACTAAAATGCTGTTAGTGTGCTCAACTACAGAAATAGCCGCTGCTAAGTTGATGTAGATT 

TTCTACTTGAA TATTT TTATGGTTGTAGGMCCTCAGGAGGGTCAGTGTTTACTGGTTT^ 

TATATGCCTTCTTTTTCCTGTTTGAGCTTCTCTCTTTGMGGGATTCTAACAGAACAAAA 

GCTGCTGATCACCCTAAGTTGGAAACAGNAAAGGNGTAATTAATATAACTTAATGC 

Sequence 1648 

TCACCACGCGTCCGAAAGTCCGTGACATGGTTCCCCGTGGTGGCCCGTGGCAGCCCGTGG 

CATGGCGTGGCTCAGCTGTCTGTTGAAGTTGTTGCAAGGAAAAGAGGAAACATCTCGGGC 

CTAGTTCAAACCTTTGCCTCAAAGCCATCCCCCACCAGACTGCTTAGCGTCTGAGATCCG 

CGTGAAAAGTCCTCTGCCCACGAGAGCAGGGAGTTGGGGCCACGCAGAAATGGCCTCAAG 

GGGACTCTGCTCCACGTGGGGCCAGGCGTGTGACTGACGCTGTCCGACGAAGGCGGCCAC 

GGACGGACGCCAGCACACCGAAGTCACGTGCCAAGTGCCTTTGATTCGTTCCTTCTTTCT 

AAAGACGACAGTCTTTGTTGTTAGCACTGAATTATTGAAAATGTCAACCAGATTCTAGAA 
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ACTGCGGTATTCCAGTTCTTCTGACACCGGATGGGTGCTTGGGAACCGTTTGAGCCTTAT 

AGATCATTTACATTCAATTT 

Sequence 1649 

CNCCACGCGTCCGGGGATCCCTGGGGAGAGTAACAGTGGCCCCACATCCCTCCTCCTGGG 

AGACGCTGGTGCAGGGCCTCAGTGGCTTGACTCTCAGCCTAGGCACCAACCAGCCCGGGC 

CTCTGCCTGAAGCGGCACTCCAGCCACAGGAGACAGAGGAGAAGCGCCAGCGAGAGAGGC 

AGCAGGAGAGCAAAATAATGTTTCAGAGGCTGCTCAAGCAGTGGTTAGAGGAAAACTGAG 

ACGTGCACCCCCATGGGATGGAGACCCGAAGGGACTCAGACGGAGCCGCCGTGTTGGCAG 

CGCCTGGGTGTGGGCCCATTTTGGGGACCAAACAGCAAGCTGTGGTCGGATGAGTGCCAG 

GACCTGTGTACCGGGACACGTGGGGAGTCCTCCCAGCATGATGCTTGACTGACCCGAGGA 

AGGTCCTCATGTTTCGTGCCTGTCATTCTCGGATGGCTGTGAGGCATTCCTTGGCAAGGG 

ATGCTTGCGTACCAAGCGGTCCTACCGCATCTACATGGCTTCTGTGATGCATGTTGTCGC 

TTTCCCACCCNGGAT 

Sequence 1650 

CGCGTCCGAGCTTTGCAGGGAAGAACAGAGTATGGGTCATCTCAGCCCCTCATGCCTCGG 

AAGGCTACTACCGCCTCATGATGAGCCTGCTGAAGGACGATGTGTACTGTGAGCTGGCGG 

AGAGGCACATCCAACAGATTGTGCTCTTCCACCAGGCAGGTGAGGAAGGAGGCAAGGTGA 

GAAGGATCACCAGCGAGGGCCAGATCCTGGAGCAGCCCCTGGACCCTAGCCTCATCCCTA 

AGCTGATGAGCTTCCTGAAGCTGGAGAAGGGCAAGTTTGGCATGGTGCTGCTGAAGAAGA 

CGCTGCAGGTGGAGGAGCGCTATCCATATCCCGTTAGGCTGGAAGCCATGTACCGAGGTC 

ATNGACCAAGGCCCCATCCGTAGGATCGAGAAGATCAGGCAGAAGGGCTTTGTCCAGAAA 

ATGTAAGGCCTCTTGGTGTAGAGGGCCANNGTTGGTTGNCTGAGGGGGAATTGACCCGTT 

GGAAGGGGGAAGCAATGAAAGGGCCAAAG 

Sequence 1651 

CGCGTCCGGGATGCGTTGGGTCTGAAAGTCGATGAAGGACGCGATTACCTGCGATAAGCT 

TCGTGGAGTTGGAAATAAACTATGATACGGAGATTTCCGAATGGGGTAACCTAACTGAGC 

MACCTCAGTTGCATTTTGATGAATCCATAGTCAAATTAGCGAGACACGTTGCGAATTGA 

AACATCTTAGTAGCAACAGGAAAAGAAAATAAATAATGATTTCGTCAGTAGTGGCGAGCG 

AA AGCG AAAGAGCCCAAACCTGTAAAAAGGGGTTGTAGGACATNTTACATTGAGTTACAA 

MTTTTATGATAGTAGAAGAAGTTGGAAAGCTTCAACATAGAAGGTGATATTCCTGTATA 

CCGAAATCATAAAATCTCATAGATGTATCCTGAGTAGGGCGGG 

Sequence 1652 

GTCGCCNC GCGT CCGCAACATTATTGAGATTGTCGTGTATAGTCATCGAATATCAGCCAG 

TTCCTGTMTTTTGTGACACGCTCTCTGCCMGCCCACCAAGTATTTCCTTTATAGCTAA 

AAGTTCCATAGTACTAAGGAAATAAAGCAATAAAGACAGTCTCAGCAGCCAGGATTCTGG 

CTGAAGGAAATGATCCGCCACCCTGAGGGTGGTGATGGTAGTTTCTACCCATACCTCAGC 

CTCAGGCGAGTGGCTTATAGCCTCCATTCATGGTGCACTTTATTTATGGTACTAAGATAA 

AGACTGTCAATCCATTGATTTATCTCCTCCTGTCCCCCATCTAAAATACCCATGCTGCTT 

TTCTGGAGTGTTGTGGGGGGGTTACCAGCTTGATCCACTGGTGCTCTTTAAGAAGGCCCA 

AGAAAGGTCTTTGGGGCATTGCCAAAGAAAATCCCGGATTTATGTGGGAAAACCCTCACT 

TTTCTCTTTACNGGCTGGTACCAAGA 

Sequence 1653 

CCGTCCGI I II I I I I I I G AACTTACCGTAAAA I I I 1. 1 I I I I AAAAAGTGCTTGTAAAAAT 

TAAAGAGGAATAAAAGGGGGGTGAACAGCCAGTACGATAGTGCATGCCTGAAATTCCAGT 

GCTTTGGGAGGCCGAGGCAGGAGGATCGTTTGAGGCCAGTAGTTGGAGAGCAGTGTGGGC 

AACGTAGCAAGACCCCATCTCTACAAAAAATTTAAAAGTTAGCCGGGCATGGTGATTCAC 

ACCTGGAGTTCCAGCTGCTGTGGAGGCTGAGGTGGGAGGATCGCTTGAGCCCAGGAATTT 

GAGGCTGCAGTGAGCCATGATTGCACCACCGCACTTCAGCCTAGGTGACAGAGCAAGGGT 

CTACCTCAGAAAAAAAAAAAAAGGAGGAGCAAGCACGTGTTGATGGGTGGAAATCCAGCC 

AGAAATGCTGAGGCTGAAAAGATTGTTCCCAGTTTNCTGGTAGCCAGGGGAAAAGGGGAA 

ATT 
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Sequence 1654 

CGCGTCCGCTGACATTTCCTAGGAAGCTNGGAAAAGGAAAGTGAAGGAATGGTCTAAAGA 

MTGACCATTCACACTGATTTTGTCTGGACAGTCTGGCCGCAGGTTATTCATAGATTATT 

CAGCCTTTGCAGGACTTGGATTCAGGGTTTTACTCAGTCCCTTTACCTTAGGTGGAATCT 

TCTTAAATTGAAATTTTTGGTMGGAATTCATTTCACAGGTAGTGTTTCAGACTCTGAAA 

GCCCTGACTTGGTTCTTGGCTTCTACTGTACTAGTTACTAGTTACTGGTACTGTTGCCAA 

GCMTCTGCTTGGAATTTGTGGATCCCTGCTGTTCCCTAATCCCACCCCCTGCCCTGAGA 

CAGTGAATGTAGTCCGTGAAGGGAGTGCCTCTCTGGGACCCCCTGTGTTG7TACAGGCTG 

TGCA GTGCA ACAATTCCAGCAAAAATACCCTATCCCCGCACTTAGTCATTCGTGGTAACT 

AACAATTTTGAAATACTCATATAAAATGAACAGGAAAGTGGTTAGTG 

Sequence 1655 

GACCACGCGTCCGGACCCAGACCCGGCTGACCCACCTACCCGCGATCCTGCCCATGGCTG 

ACGGGCTCTTTCGGCGCAGACCCTGGGGTCTCGAGCAGATTCGCCCGGACCCCGAGTCCG 

AAGGCCTGTTTGACAAGCCTCCCCCGGAAGACCCTCCCGCTGCCCGCGGGCCCAGGTCGG 

CGTCGGCCGCGGGCAAGAAGGCTGGTCGGCGCGCGGGCGGGAGGGCGCAGGGGGGCCGCG 

CCGGGCAGCCCCCGAAGGCCGCATCGCXaCCCCCCGCCCAAGAAGGAGGCGCCTCCACTGG 

ACGAGGGCTGCTATCTCGACCATTTTCCGCACCTCTCCATCTTCATCTACGCAGCCATCG 

CCTTCTCCATCACCTCCTGCATCTTTACCTATATCCATTTACAGCTTGCCTGAGTGGCCA 

GCGCGGGACGGGGTGGGCGCAGGACCGAGCGGGGAGGGAAAGGGGAAAACGGGGGCTNGG 

CATTTTGTGTTTTAG 

Sequence 1656 

CNCTAACCCCGAAC TCTAG ATCGTCTTGCTTGTTTGTCTGAAGAAGGGAATGAAATAGAA 

AGTGGAAAAATAATATTTTCAGAGCATCTTCCCTTAAGTAAGCTACAGCAAGGCATAAAA 

TCTGGTACATACCTTCAAGGAACATTTAGAGCTAGCAGGGAAAATTACTTGGAAGCTACA 

GTATGGATTCATGGCGACAGTGAAGAAAATAAAGAGATAATCTTACAGGGACTTAAACAT 

TTAAACAGAGCTGTTCACGAAGATATTGTGGCTGTGGAGCTTCTCCCCAAGAGTCAGTGG 

GTAGCACCATCTTCTGTGGTTTTACATGATGAAGGTCAAAATGAAGAAGATGTGGAGAAA 

GAAGAAGAGACAGAACGAATGCTTAAGACTGCTGTAAGCGAGAAAATGTTGAAGCCTACA 

GGTAGGAGTTGTAGGAATAATAAAAAGGAATTGGA 

Sequence 1657 

CGTCCGCGGACGCGTGGGCGGACGCGTGGGCTGGCTGTATCTATACTTTCCTTGAGAAAA 

ATCCCATAAAGTGGATGGACCTGTGAAGAAAATGTATGCTTATGGCCTAGCCTTCATGTC 

TGGCTGATGTATCCTATAAGGCAGTAAGCCCCTTTTCTAGTCTCTGGTAAGATGCAAGAG 

CTCATATCCCCATCACTGACATTTTAGTTTGGAAATAATATTGAGACTGTGCTATGACCA 

ACCCCTGATGTTGTTTTTTCTTTTCAAACTTTTGCATATGAGTAGAG 

GTTAAGTATTTATGTCTGGGGGGATACCTTCAGGTGTCTTATCTGTTTTATGCAAGAATT 

TATGTGTTCATCTTTATTCAGTGCAAAG A I I I I I I I I rAAATTTTGTTTATAATTGGAGG 

TAACATTMGACMCTCTTNCTCCACAAGAAAACCTCTAAAATTAATATTCCTTAAGATT 

TGGTTTTCCTTTGCCTTATAATATTACCTTTTAAT^ 

AAAAG 

Sequence 1658 

GTCGCCCCGCGTCCGTTTGATATACCACTCTGATAACTCATATAAAAATATCATCATAAA 
MGCTTMTTTCATCCCTTTTATGTTGGTTTTAAAAGGT^ 

ATTGAGAACTCTTACATAGTAGAATCCATTCTATAATACATGTGTTGACAAAGCTTTAGA 

GAAAGTTTCCTATTCTCTTCCATTTCCCCTGCCCAAAGTGCTGACATAGGCAGTGATGAA 

GMTCTTTACCAAGATTTTCAGGGTGTACCTATGAMTTGCTTTAAATGCACTGCTGGTG 

TAAATAATTAGCAAGCAAAAGCGTTTCTGTGACTTCAGGTACCAGCTTAAAGAGCACTAG 

GGATGGGGAACGAATGCCAAATCAGACTCCACCTAGAGCACCAGGAAACAGCTTGTCCCT 

GGTAGGGAAATGGTGTTGCTGAAAG 

Sequence 1659 

CGACCNCGCGTCCGGCTGNTGACCCCATGCTGAGTGGCCNGTGGGGAGCGGCGCCCGGCA 
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GGCTC TTCTGGGGTCGTCTGTCCTATCCGTGGATTGTATATACTCTTCTCTGTTAAGGAG 

TTTTTCCCAAGAAGAAAAGTATTTAAAAGAAATACCAGTGAGTGCCTTAAAG7TGGAGAA 

GTAACTGCCCATGCCCAGAAATAAGGATGCCAGTGCCCAGAAGCAGTGAGATTAGTCTGT 

GTCCACAAGCAGAGGCCCCCTCGATGGGAGGGAGTGGCAGGCAGGAGAAGGTGGCGCTGC 

CAGGTGCCCGGGTCTATTGGAGGCGCCCCATCTCAGACTTCCTAACACAGCCTGTGTGGA 

AGGCAGAACAAAGAATGCATGCCCAGTCAGAAATCTGTTCTATTCTGCTCCAGGAAAATC 

GGAAACCTGTGAGTCANAGTCAGAGAAACTTACCCAGCCACGTATTCCTGT7TCATGGGT 

NCTGTAGATGTTTTGAGTCAAGGAAGGTA 

Sequence 1660 

TCGACCNCGCGTCCGGTGGGTCCCTGCCGGCCGGCGGCGGGCGCAGACAGCGGCGGGCGC 

AGGACGTGCACTATGGCTCGGGGCTCGCTGCGCCGGTTGCTGCGGCTCCTCGTGCTGGGG 

CTCTGGCTGGCGTTGCTGCGCTCCGTGGCCGGGGAGCAAGCGCCAGGCACCGCCCCCTGC 

TCCCGCGGCAGCTCCTGGAGCGCGGACCTGGACAAGTGCATGGACTGCGCGTCTTGCAGG 

GCGCGACCGCACAGCGACTTCTGCCTGGGCTGAGCTGCAGCACCTCCTGCCCCCTTCCGG 

CTG CTTTG GCCCATCCTTGGGGGCGCTCTGAGCCTGACCTTCGTGCTGGGGCTGCTTTCT 

GGCTTTTTGGTCTGGAGACGATGCCGCAGGAGAGAAGAAGTTCACCACCCCCATAGAGGA 

GACCGGCGGANAGGGCTGC 

Sequence 1661 

GGTGTCGACCNCGCGTCCGGCGCCCCGCTCGCATTGTTCGGGCGACTCTCGGAGCGCGCA 

CAGTCGGCTCGCAGCGCGGCACTACAGCGGCCCCGGCCCGGCCCCCGCCCGGCCCCGGCG 

CAGGCAGTTCAGATTAAAGAAGCTAATTGATCAAGAAATCAAGTCTCAGGAGGAGAAGGA 

GCAAGAAAAGGAGAAAAGGGTCACCACCCTGAAAGAGGAGCTGACCAAGCTGAAGTCTTT 

TGCTTTGATGGTGGTGGATGAACAGCAAAGGCTGACGGCACAGCTCACCCTTCAAAGACA 

GAAAATCCAAGAGCTGACCACAAATGCAAAGGAAACACATACCAAACTAGCCCTTGCTGA 

AGCCAGAGTTCAGGAGGAAGAGCAGAAGGCAACCAGACTAGAGAAGGAACTGNNAACGCA 

GACCACAAAAGTTTCACCAAGACCAAGACACAATTATGGCGAA 

Sequence 1662 

GACCACGCGTCCGGAAGGAAGGGACGGGCTGAGTTCCCCGACGAGAGACACACCCAGATT 

TTCCTGCAGCTTGGGGAGAGGTCCTCCCAGGAGCCTTGGTCCCTCCTGGCCTGCCGGAGT 

CCTTAGCCAGGATGGAGGCTGTTGTGAACTTGTACCAAGAGGTGATGAAGCACGCAGATC 

CCCGGATCCAGGGCTACCCTCTGATGGGGTCCCCCTTGCTAATGACCTCCATTCTNCTGA 

CCTACGTGTACTTCGTTCTCTCACTTGGGCCTCGNATCATGGCTAATCGGAAGCCCTT 

Sequence 1663 

GTCGACCACGCGTCCGGGCTCCATCCGGGCTATCCTGCCGCCTTAGCGGCTGCTTCTCCC 

CAGGATGCGGGCAGGGGGCCTCTCTCCCACTCCCCACACACCGATTTCTGAGTAGCGATA 

GGGGCTGGAGGCTTATTTTATGGGGTAGGGGGCCGCTGGTAGGCGAAGATTGTCCGAGGG 

AGAGGGGGAGGATGAAGCCAGTGCGTGGCGGAGACTTGCCAGATGTTGATGCCTAAGAAG 

MCCGGATTGCCATTTATGMCTCCTTTTTAAGGAGGGAGTCATGGTGGCCAAGAAGGAT 

GTCCACATGCCTAAGCACCCGGAGCTGGCAGACAAGAATGTGCCCAACCTTCATGTCATG 

AAGGCCATGCAGTCTCTCAAGTCCCGAGGCTACGTGAAGGAACAGTTTGCCTGGAGACAT 

TTCTACTGGTACCTTACCAATGAGGGTATCCAGTATCTCCGTGATTACCTTCATCTG 

Sequence 1664 

CCGCG TCCGG GGGTTGGTACCCGAGCGCCTTCCCCTCACCTCAACCAGAGAAGAGCATCC 

GGTTGCTTTTTAMGCTTTTAGCCTGCCCTAGCAAGGACAAAGCATGTTAGATTAGAGAT 

GCTTCTGCTGATCGCAGGGGTTCTTATTTGAAAACATCTATGATGGGGGTGGGGTGGGAG 

GAGACAGGTTGTGGTTATGCAGGAAAATCTTGTCCTAAAAATATATGAGTTTGGGGGTAA 

GG GGTG GGATAGCCAAGCAAAATCAGTAATTATTTTAAAATGMCATATGTATTTTTATT 

AACTTTTAGTTAAATACAGATTTTACAACGAGGTCAGCATAAGCCTAAATCTATATAGAG 

GGCTMCTCAGGCATTGTCTTGTTTATTTGTAGACTGGATTAAAAACAACCTGTCCTGTT 

TTGTNAGTTCCCAGCTTCTTTCGTTTAGAATAAATTAGACCAAAAGAA 

Sequence 1665 
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GCGTNCGCTTAATGTCAATGTGGCCTGGGCTGGAGGTCTGGACCCCCCCATGGGGGATCC 

TGAGTACCTGGCTGCTTTCAGGATAGTCGTGATGCCCATCGCCCGAGAGTTCTCTCCAGA 

CCTAGTCCTGGTGTCTGCTGGATTTGATGCTGCTGAGGGTCACCCGGCCCCACTGGGTGG 

CTACCATGTTTCTGCCAAATGTTTTGGATACATGACGCAGCAACTGATGAACCTGGCAGG 

AGGCGCAGTGGTGCTGGCCTTGGAGGGTGGCCATGACCTCACAGCCATCTGTGACGCCTC 

TGAGGCCTGTGTGGCTGCTCTTCTGGGTAACAGGGTGGATCCCCTTTCAGAAGAAGGCTG 

GAAACAGAAACCCAACCTCAATGCCATCCGCTCTCTGGAGGCCGTGATCCGGGTGCACAG 

TAAGTGTGGAGATGGGACACTCGCTGAGCTCAGACTGAAGGATCTTGGT 

Sequence 1666 

CGACCNCGCGTCCGGTGTGATGATCGCTACTGCTGGAGACCGCACAGAGGAGTTCCACGG 

CCACNGCAGTGAACCTCCTGGGGAACTTGCCCCTCAAGTGTCTGGATGTTCTCCTCACCC 

TGGAGCCACATGGAGACTCCACGGAGTTCATGGGAGTGAATATGGATGTGATTCGTGCCC 

TCCTCATCTTCCTAGAGAAGCGTTTGCACAAGACACACAGGCTGAAGGAGAGTGTAGCTC 

CCGTGCTGAGCGTGCTGACTGAATGTGCCCGGATGCACCGCCCAGCCAGGAAGTTCCTGA 

AGGCCCAGGTGCTGCCCCCTCTGCGGGATGTGAGGACACGGCCTGAGGTTGGGGGAGATG 

CTGCGGAACAAGCTTGTCCGCCTCATGACACACCTGGACACAAGATGTGAAGAGGGTGGC 

TGCCGAGTTCTTGTTTG 

Sequence 1667 

NCGCGTCCGACACTATTTAGAGAGCTCCCTTCCCACCTCTCTGCCCAGCCTTGTTACCTC 

ACTTCTGCTCTGGCCATGGCTGTGMGGGCCCAGCCAGCTCCCTGTTTTGATGTTCTGTG 

CAACAGCTCCGGGGTCTTGTGACTGGAGATCCTCAACAGGCCCTGGAGCCAGGACTGGAG 

TCTTGGCAGCTGATGAGCAGCACCTTGCCGGCCAGGAGGAGCTGATGCTGACGATCTCCC 

CAACATCTGAAGGCTTAAAGAACATTGTCGTTCTTCAGCCCTCCTTGCTTCTCTCAATAC 

AATAAGACATTGCAGAAGCAAAAGGGTGGCCTCTGCTCCAGGCAAGGCAGCTGGCTCTGT 

CT GGGG GCGTCGGCCTGGGGCTTGGGTGCCACGTGCTGAGATTGCATAGTCAAAACAAGC 

CAI I n IGCCAACAATAGCTTGTGGCTCCCACATTTTTCTACCCTTGCACTNAANGGCCA 

GACCACTCT^GCATGGACCAANACCATNmCCAMCCCATGGGGCTTTTTTTNCC 

Sequence 1668 

CANGAATACTGAAAAATGAAGCCTAAAATGAAGTATTCAACCAACAAAATTTCCACAGCA 

AAGTGGAAGAACACAGCAAGCAAAGCCTTGTGTTTCAAGCTGGGAAAATCCCAACAGAAG 

GCCAMGAAGTTTGCCCCATGTACTTTATGAAGCTCCGCTCTGGCCTTATGATAAAAAAG 

GAGGCCTGTTACTTTAGGAGAGAAACCACCAAAAGGCCTTCACTGAAAACAGGTAGAAAG 

CACAAAAGACATCTGGTACTCGCTGCCTGTCAACAGCAGTCTACTGTGGAGTGCTTTGCC 

TTTGGTATATCAAGGGGTCCAGAAATATACTAGAGCACTTCATGATTCAAGTATCACAGG 

MTTTCACCTATTACAGAGTATCTTGCTTCTCTAAGCACATACAATGGATCAATCCATTA 

CTTTTGCTTTGGAGGATGGAAAGTTATGAGATATATGTTGAAGACTTGAAAAAAG 

Sequence 1669 

GTCGACCNCGCGTCCGCCCGCCCATCACTGCTGTTCCTCCAGGGCCAGCACTCGGGCGAG 

GCAGGGGAGCTGCCTTCGGTACATAATTTGAAGGGGCACTCCCTCTTGGGCACATGCCGG 

CCCTGAGTGCCTCCCTTGCCTCACTCTGATCCTGGCCCCATAATGTCCTCAGTGGAAGGT 

GATGGGGGCCGGTGCTGTGGGGAGAGTAGAAAGAGGGGTTGGCATGACTAAAAATACCAG 

TATGTGTATTAAGTATTTTGAGAATGAAATGCCAAGGAGTGCCTACTATATGCCAGCTCT 

AGGAATGGAGTAGACAGTGGACACAAGAAGGACTTACGCCCTGAGCACAGGTGCCAATGG 

TGACAAGACTGGCAAGACGTGAGGGCATGAATGGTTCATTCAGGCAGCTGCTGCAGATGT 

GGTCACCTGGTGCCATCTGCTGCTCCCTTTTCCACTTTTCTATGTCCTCCTTCCACCCCA 

A 

Sequence 1670 

CGACCNCGCGTCCGGTCTGAAGGGTCTGGCTGGTGAGCCAGGTTTTAAAGGCAGCCGAGG 
GGACCCTGGGCCCCCAGGACCACCTCCTGTCATCCTGCCAGGAATGAAAGACATTAAAGG 
AGAGAAAGGAGATGAAGGGCCTATGGGGCTGAAAGGATACCTGGGCGCAAAAGGTATCCA 
AGGAATGCCAGGCATCCCAGGGCTGTCAGGAATCCCTGGGCTGCCTGGGAGGCCCGGCCA 
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CATCAAAGGAGTCAAGGGAGACATCGGAGTCCCCGGCATCCCCGGTTTGCCAGGATTCCC 

TGGGGTGGCTGGCCCCCCTGGAATTACGGGATTCCCAGGATTCATAGGAAGCCGGGGTGA 

CAAAGGTGCCCCAGGGAGAGCAGGCCTGTATGGCGAGATTGGCNCGACTGGTGATTTCGG 

TGACATCGGGGACACTATAAATTTACCAGGAAGACCAGGCCTGAAGGGGGAGCGGNGCAC 

CACTGGAATACCAGGTCTGAAGGGATTCTTtGGAGAGAAG 

Sequence 1671 

GAGTCGACCNCGCGTCCGCAACTGGTGTCCAGCTCGGTGCACTCCAAGCGCCGTTCCCGA 

GCGGACCTCACGGCCGAGATGATCAGCGCCCCGCTGGGCGACTTCCGCCACACCATGCAC 

GTTGGCCGGGCCGGAGACGCCTTTGGGGACACCTCCTTCCTCAATAGCAAGGCTGGCGAG 

CCCGACGGCGAGTCCTTGGACGAACAGCCCTCTTCTTCATCTTCCAAACGCAGTCTCCTG 

TCCAGGAAGTTCCGGGGCAGCAAGCGGTCACAGTCGGTGACCAGGGGGGAGCGGGAGCAG 

CGTGACATGCTGGGCTCCCTGCGGGACTCGGCCCTGTTTGTCAAGAATGCCATGTCCCTG 

CCCCAGCTCAATGAGAAGGAGGCCGCGGAGAAGGGCACCAGTAAGCTGCCCAAGAGCCTG 

TCATCCAGCCCCGTGAAGAAGGCCAATGACGGGGGAGGGCGGCGATGAGGAGGCGGGCAC 

GGAGGAAGGCAGTGCCCCGTCGGAAT 

Sequence 1672 

CGCGTCCGCTCGCGGCNGGGCATCGNGTACATCCTCAGCAACCATGGGCTACGTGCGCCA 

GCTCTCCCAGGCCCTGGACACATCCAACGTGATGGTGAAGAAGCAGGTGTTTGAGCTACT 

GGCTGCCCTGTGCATCTACTCTCCCGAGGGCCACGTGCTGACCCTGGACGCCCTGGACCA 

CTACAAGACGGTGTGCAGCCAGCAGTACCGCTTCAGCATTGTCATGAACGAGCTCTCCGG 

CAGCGACAACGTGCCCTACGTGGTCACCCTGCTTAGCGTGATCAACGCCGTCATC7TGGG 

CCCCGAGGACCTGCGCGCGCGCACCCAGCTGCGGAACGAGTTTATCGGGCTGCAGCTGCT 

GGACGTCCTGGCTCGCCTGCGGTGAGTCCCCACTGTAGCGGTCCTGCCGGNTTNCCCCTC 

CTGCTCCCAAGGCCAGGCCCACCTGCCCTTTGGCTCCCAGCCACCTCACCTAAGCAGCAC 

CTTCCAGATGGCAGGGGAGGTGGC 

Sequence 1673 

GTCGACCACGCGTCCGGCCAGAGCTGAGTGGCAGCCGCCTCCCTTATGCAGGACATGTGC 

TCTCGGCTTCACCAGGGTTCTGACCGGGTCTGCTTCTGCATTCACAGCGCCTCCTGGACC 

TGAAGGCATCTGAGTGTGAGACCCTGTTCTAACTCTTAGAAGTGACATTGTAAGAGGTGG 

TGGGGACCAGCTAATTGGTCCAACCCAGCCTGAGTGCACCACCCTTTGAACAAATGTATC 

AGTGATGAAAATTTGCCTTTGCCCCGGCTTGCCTGTAATCCCAGCACTTTGGGAGGCCGA 

GGTGGGCGGATCACTTGAGGTCGGGAGTTCAGGACCAGCCTGGCCAGCGTGGCGAAACCC 

CGTCTCTACTAAACATAAAAAAATTAGTCAGGTGTGGCGGTGCGTGCCTGTGGTCCCAGC 

TATTCAGGAGGCTGAGGCACCAGAA7TGCTTGA 

Sequence 1674 

TGACGGCGGCCCGGCCGACGGGAGCCGGGGCGGGGCGGCGGNCCANCGAAGGAGCGCGCG 

GGCGGTCTGGCCCCGCCCCCTCCCCGCCCGCCTTCCCGGTGACCTTCAGGGGCCCGGGTG 

GCGGGCGCAGGCCCCTGCGGCGGCGGCGGGATGTTCGTGCAGGAGGAGAAGATCTTCGCG 

GGCAAGGTGCTGCGGCTGCACATTTGCGCGTNCGACGGCGCCTAGTGGCTGGAGGAGGCC 

ACCCNGGACACCTACNGTGGANAAANCTCAAGGAGCGCTTGCCTCAAAGCACTGTGCTCA 

TGGGGAGCTTANAAGATCCCCAAAAGTATAACCCATCATTAAATTTAATCCCACGCTGCC 

TNAANANAAGGGGTGCTTGNGTGATTGCCATGNACCATNCTTGGGAAGGAAGAAACCATT 

CCCAGGACCCAAAAGATGGGCCCTATTTNTTGGATTA 

Sequence 1675 

CACGCGTCCGGGATCCCGTACCCGGGACAGACTCGGCGCCGCTGGCTGGCCTGGCCTGGT 

CGTCGGCCTCTGCACCCCCGCCGCGGGGGTTCAGCGCGATCTCCTGCACCGTCGAGGGGG 

CACCCGCCAGCTTTGGCAAGAGCTTCGCGCAGAAATCTGGCTACTTCCTGTGCCTTAGTT . 

CTCTGGGCAGCCTAGAGAACCCGCAGGAGAACGTGGTGGCCGATATCCAGATCGTGGTGG 

ACAAGAGCCCCCTGCCGCTGGGCTTCTCCCCCGTCTGCGACCCCATGGATTCCAAGGCCT 

CTGTGTCCAAGAAGAAACGCATGTGTGTGAAGCTGTTGCCCCTGGGAGCCACGGACACGG 

CTGTGTTTGATGTCCGGCTGAGTGGGAAGACCAAGACAGTGCCTGGATACCTTCGAATAG 
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GGGACATGGGCGGCTTTGCCATTCTGGTGCAAGAAGGCCA 
Sequence 1676 

TCCCTCTGCTGATGATGGATGCCCCTAACACCTGTGCCTAACACCCCTACTGAACCCCAC 

AGCTCCAGCCTTAGTTTTTGGAGTCMGTGTTAAAGGTTTCTGGCCAGAGGAATTGGGGT 

CTTGCCATCCCTGCMTAGCCCTTTTATGGGCTCTGGGAGACAGCTTTAGGGAATAAATG 

GGGATTTTCCCCTTTTTCTACCCACTCCTTTGCTTCCTCCMGACTTACCCMCTCCTT^ 

CCCCTCAGAGAACCAAATAGCCTGAGGAAGCAGGAGAGTTCCTGGTTATGGCAGATTCTT 

GGTGATTTGGGGCTTCAAGACAGTAGGTGAGAGATGCTGTCAGGGACGTATCTTCTTCAT 

ACCAAAGTCACTGGTCCTTTCTCAGCCTCTCTCGTGCTTTTCTCCTMTGACCATATTTT 

TGCCAAAAATTGGGMTATGTTATCTGACAGACCAGAATATTTGAAGGTTTGGGCTG 

Sequence 1677 

GCGCCGCGGATATNCGGATCAACCTATGGTNTCAATATTGTNAGTTATTCAGCATAAACA 

GMTTATTTCCCMNACTTGATCTGAAATATTNNTAATGGTCNTACTNGAAACTTATATT 

CTTNCTGGGAGNGANGTNTTTATCATTTTTCCATGGAGACAGGTTCTAACTCTGTTGCCC 

AGGCTGCANTGCAGTGATGTGATCATAGCTCACTGCAGCCTGAAACTCCTGGGTGTCAAG 

TGATCTCTGGCCTCAGCCTCCCAAGTAGTTGGAACTTCAGATACGTGCCACCACAACCAG 

CTAATTTATTTTTTAGAGATGAGGTNTCGCTATGTNGCCCAGTCTGGCCNNCTAGCCNCA 

AGTGATCTGGCCATCTNAGCCTTCAGTTGGAGATGTCTGATTTATGTTAATATAAGAAAG 

CTGTTGATCGTTTATCATAAANGCATTT 

Sequence 1678 

GTCNCCNCGCGTCCGCTCCTCCGCCGGCATGCAACTCGGCGCCCGCGGTCCATGGACCGG 

AACCTCGGGCCGACGGACGGGAACCCGGGCCGCGATCGCCGCCTCCCCGCCTCAGGCTCC 

TCCTCCTCGCTCTCCGCCGCCTCCGCCGGACTCCCGCAGGCCCTGCACCGCCGCCGCCAG 

GCTAGCGGAGCTGCCCCGGGAAGCTGGGTGACGGGTTCGCGGCTGCCGCCGGACTGCGGC 

CTACTCCGCCGCCTCTCAGTGCTATTGTCCCTGGGCCTGGCCTTGAGCGGGTCCACTGGG 

GAAGGCNCGTGTGCGCCGGCTCCGCGGAAGATGCCGGACCAAGCCCTACAGCAGATGCTG 

GACAAGAAGTTGCTGGGTTTGTTTTGCTACTGATGAAGATGATAGAACAGCTGAATGGGT 

GAGGACCATGGCAGGTGCAGGAGGATCTACAAAATGGGTTCACCAGGGCCTGTCTACAAC 

GCTGGGTGGATGAAAAGCAAAGAG 

Sequence 1679 

GCGTCCGGGCCCGCGACCGAGCGTGCGGACTGGCCTCCCAAGCGTGGGGCGACAAGCTGC 

CGGAGCTGCAATGGGCCGCGGCTGGGGATTCTTGTTTGGCCTCCTGGGCGCCGtGTGGCT 

GCTCAGCTCGGGCCACGGAGAGGAGCAGCCCCCGGAGACAGCGGCACAGAGGTGCTTCTG 

CCAGGTTAGTGGTTACTTGGATGATTGTACCTGTGATGTTGAAACCATTGATAGATTTAA 

TAACTACAGGCTTTTCCCMGACTACAAAMCTTCTTGAAAGTGACTACTTTAGGTATTA 

CAAGGTAAACCTGAAGAGGCCCGTGTCCTTTCTGGAATGACATCAGCCAGTGTGGAAGAA 

GGGGACTGTGCTGTCAAACCATGTCAATCTGATGAAGTTCCTGATGGAATTAAATCTGCG 

AGCTACAAGTATTCTGAAGAAGCCAATAATCTCATTGAAGGAATGTGAACAAGCTGAACG 

ACTTGGAAGCAGTGGAT 

Sequence 1680 

GTCCGGCGTGGGGAAGGGTGGGGTGAGGGGGCGTGGCCGCAGCTAGGGCGGCGAAACTCT 

CCTCCCCTCGGCCCCACCGCGTGGGACGGCGTGAACGTGGTGTCGGAGGGATGTCAGCCT 

TCTCTGAGGCGGCGCTGGAGAAGAAGCTGTCGGAGTTGAGCAACTCGCAGCAGAGCGTGC 

AGACCTTGTCCCTGTGGCTCATTCACCACCGTAAACACTCGCGGCCCATCGTCACCGTGT 

GGGAGCGGGAGCTGCGGAAAGAGTGGAGGTGCAACAAATGAGAAATTCCAATTGGAGATT 

TTGTCMACAGGMTTTACTCCAAACCMACAGGAAGCTTACTTTTCTCTACCTAGCCAA 

TGATGTCATACAGMCAGCAAGAGGAAGGGGCCAGAGTTTACAAAAGATTTTGCACCAGT 

TATA 

Sequence 1681 

CCGGCAAAGCAGGGACTCCTGATTTATATGTCCCTCCTCCTGGCAATCCTCTCACCCCAC 
CTCCCCTGAGAACCTCAGTTCTTCCTAAATTGCTAAAGCTGAGGGGAAAGGGATGCTTTG 
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CGGCAMGGCGCTGCTGCCTCMGCTTGCTTTACATGCCTCTCTAGTTCCTCTCGCACTA 

CAGAGTGGTAGCGAACAAAAGCGTTCGCCCTAGAAGCGACCTGAATGGAAAAATCTGCAC 

ACGAACTAATGGGTTTGTCACGGAAAGTAGGGAACCGGGTCTGCAGCATTCCCTGGAGAC 

AGACTTTCTGGTTGGTTTTCAAGGGTCCAAGGCAGCCATCAGCCCGGCTGTGCCCTCCCA 

CCCTGCCTCCCACCCAGTTGATTCTCTCTTTGTGTAAGTTTAGCCCCTCTGAGGGTGGTG 

GAGTGAGAGCATCCCATCAGATATATATACGATTCATCAGTCGGCACTTAAAAAG 

Sequence 1682 

TCACCNCGCGTCCGAAAAACGCAGATGATATACCTGCAACATCNGTCATGGCTGCGCCCT 

GTGCTCAGAAGCAACCCGGGTGGAATATTGCTGGTGCAACAGTGGCAGGGCACAGTGCCA 

CTCAGTGCCTGTCAAAAGTTGCAGCGAGCCAAGGTGTTTCAACGGGGGCACCTGCCAGCA 

GGCCCTGTACTTCTCAGATTTCGTGTGCCAGTGCCCCGAAGGATTTGCTGGGAAGTGCTG 

TGAAATAGATACCAGGGCCACGTGCTACGAGGACCAGGGCATCAGCTACAGGGGCACGTG 

GAGCACAGCGGAGAGTGGCGCNCGAGTGCACCAACTGGAACAGCAGCCGCGTTGGCCCAG 

AAGGCCTACAGCGGGCGGAGGCCAGACGCCATCAGGCTGGGCCTGGGGAACCACAACTAC 

TGCAGAAACCCAAGATCGAGACTCAAAGCCCTGGTGCTACGTCTTTAAGGCGGGG 

Sequence 1683 

CCGTCCGCTCCTTGGCAAGAACGAAAGGTGTGATGAAACCTCCCTGCTCGGAAGGGTCTC 

CGTGGAGGTGTCCTCATTTCACATGCTGGGTTTTGCAAGCGAGGAAGCCAGGCAGTGGAG 

GAACTAGAGAGAGGCAGGCGTGTGTGTGGACAAGCGCTGGAGCCGCAGCCCTCAGACTGG 

CACGGGAACGCCAGCGTTGGGTGTTCAGATTCCACGCGTATGTCTGGGCTCACTCACAGC 

ATGGCCGAGTGTCTGCAGTGCTGGTCCTGACCCTTCCAGAGCAGCAGTGGACAGATGAGA 

TAAGACTGTTTCAGAAACAAAGATGGCCACAGCCTTCCTAACAAGCAGGTCATCTGGCCA 

TGTCTGTATTGTAACTGGTAAAAGGCTTCAAGTCAGATTGATGATCAAGAAAANGTCAAA 

ACCCCAGCCCMGATTGGGAMGCAGGTTNGTGGNTCCAANGCTTTTTAAAAAAATTATT 

TGAAGCTCTTCATTCCTNTTTCTGTGAGTGTGTCTTTCTCTT 

Sequence 1684 

NCCCATACTGGGGGCCCCCTTCCTGCAGGCCCCATCAGGTGCAGAGCTGTGGGTCTGGTT 

CCAAGACACTGTCACTGATGTGGATAAATCTTGGAAGGAGCTCAGTAATGTCCTCTCAGG 

GATCTTCTGCGCCTCTCTCAACTTCATCGACTCCACCAACACAGTCACTCCCACTGCCTC 

CTTCAAACCCCTGGGTCTGGCCAATGACACTGACCACTACTTTCTGCGCTATGCTGTGCT 

GCCGCGGGAGGTGGTCTGCACCGAAAACCTCACCCCCTGGAAGAAGCTCTTGCCCTGTAG 

TTCCAAGGCAGGCCTCTCTGTGCTGCTGAAGGCAGATCGCTTGTTCCACACCAGCTACCA 

CTCCCAGGCAGTGCATATCCGCCCTGTTTGCAGAAATGCACCGCTGTACTAGCATCTCCT 

GGGAGCTGAGGCAGACCTGGCAAGTTGTATTTGATGCCTTCATCACGGGGCAGGGAAAGA 

A 

Sequence 1685 

CCGCTGGTTATTACCCAGCTGGATGGTTTCCTTTTAGGCAAGAAGGAGGTCATCAGCAGG 

CTCCCAACAATAATGCCGAAGTTAACAATGATGGGCAAAATGCAAACAACTTGGAACTTG 

AAGAAATGGAGCGTCTTATGGATGATGGGCTTGAAGATGAGAGTGGAGAAGATGGAGGTG 

MGATGCCAGTGCMTTCAMGGCCTGGATTAATGGCTTCAGCTTGGTCTTTCATCACCA 

CCTTCTTTACTTCACTAATACCAGAGGGGCCTCCCCAGGTTGCCAATTGACCTGAAAAAC 

TGTGCCAGCTACMGGAGGGTCTGACTTCAGGAAAGTGGTTTAAATAACAGTGCAATTTC 

AAAAAAATTTATAACTTTCTTTTGATCATCATGTACAGAGGTG I (Mill I CTTTAGGCT 

TCTCATGCATATGAATATTTTAAGCACGAATGGACTACTAAATATCTGAGI 1 1 I I I I I I I 

I I I I I I I I I AAAGAATC 

Sequence 1686 

CGCGCCGGTTTGGCTGCCCTGCATAAGCTGCTACAAATAGAATAAAGAATTTCATACGCC 
TGTATCTATCATTTAGATGCATGGAAAAAAATGGGCTTTGCACACAATGGGTTTGGAGCT 
GAC TGGGAA CAATGGAAAAAATTACATTAGCTGTGGTTGTAAAG I I I I I I I G I I I I I GGT 
TTTGI I I I I I U I ICI I I I M C I I I I I II I I I II I I A CCATC TTGTGAAAGGTTTCTGAA 
ACTCGATAATAAAAAGCGGTTGGTGTAMTTATTCTTTTGTGTCACATTTTTAGAAGGAA 
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AAACATAAAAGAATGTATCCTTAGTACTGGTTCTTAAACAGCCCATAAAAACCCATTGGC 
CTGAAGCTTATATCTCAGGCCTATGCCCATCTTATAGTCTTGGAAGACAAAA 
Sequence 1687 

CGCGTCCGTGGGTCTCGCCCTCAACCTGTGCATGTATATGTGTGTCTTTGTGTGTGTATG 

TGTGATCTCTGCCTGCAGGACCAGCCCGGTGGCACCCTGGATCTGACCCTGATCCGTGCC 

CGCCTCCAGGAGAAGTTGTCACCTCCCTACAGCTCCCCACAGGAGTTTGCCCAGGATGTG 

GGCCGCATGTTCAAGCAATTCAACAAGTTAACTGAGGACAAGGCAGACGTGCAGTCCATC 

ATCGGCCTGCAGCGCTTCTTCGAGACGCGCATGAACGAGGCCTTCGGTGACACCAAGTTC 

TCTGCTGTGCTGGTGGAGCCCCCGCCGATGAAGCCTGCCTGGTGCTTGGCCTGAAGTTTC 

CCAAGGAAGCTGTCTGGTGGGCCCCTTGGGTGATGGGCCCCTTGGAGGGCTTGGAAGCCC 

CCCCATGGGCCAAGCCCCAGCCCTGGGCTTCTGGTTCNTCTTGTCCCTGGTCACCCCCAT 

CCCCACTCCCCCTTGGGTGGGCCTTGAACTTNCCACTTCCCTTGGGTGGGGCCC 

Sequence 1688 

AGGAGGNTTGAAGGAGTTGNNGGAGGAGGAGGATGGAGGCGAGGGCGAGCGAGCCCAGCG 

GGGTCCNGGNCGCCCCGCGGGCCAAAGTCGAGCCCTNCCGCCNNTGGGCGAGCGCGCCAG 

CCGCCCNTTTCANAACAGTTCGNCGCCACAAAANAAAAGAACGGGGGGGTGCCGAGGTTN 

CCCATTGANCTCTTAAAGTGGTGCAGGTCCCTGTTGAGTGCGCTGCACCGGGCCGTGACC 

CGCGCCCCTGTGCGTCCC 

Sequence 1689 

GGAGTCGACCACGCGTCCGCGCCGGCCGCCGGTGTCCGGACCGCTCGCCCCCGTTTGGAC 

CCGACTTCGGTTCTTCTGGGGTGTTGATGCTCCTAAAGCCCGAGAGCACGTGTCCAGACC 

CTAGCCTGTACGACGCTGACTCTGCCCGGTCCCAGAACCAAAGCCATGCCGGGGTGTGGC 

CTCTGACCCCACGCGGAGGGGACCTCGCCTTGCGGGACCCCACCTGGAACCCGACCTNCC 

AGNCTCGCAGCCGGCCTGAGCCGCCATGCGCGGGAAGTTGCTGCCGCTGGCCGGCCTATA 

CCTGGTGCAGGGCCTGCCCTACGGGCTCCAGTCCGGCCTCCTGCCAAATGCTGCTTGCGT 

GCCGGCGGCCTCTCGCTGACGCGCGTGGGGCTGGCCAAGGTTCTGTACGCTNCGTGGCTT 

GCTTCAAGCTGGCT 

Sequence 1690 

CNCCCCGCGTCCGCGGACGCGTGGGTGCTTGCTGCTGAACCTGAGCTGCAAGTTGGAATT 

GATATAATGAAGACTAGTTTTCCAGGTCGTGGTTCAATTCCAGMTTCTTTCATATTATG 

AAAAGAAAGTTT ACCA ACAAAGAATGGGAAACAATCAGAAGCTTTAAGGATGAGTGGACT 

CAGCTGGATATGTTTTATAGGMTTGGGCACTTAAGGAAAGCTTCATAAAAGCCATTGGT 

GTTGGACTAGGATTTGAATTGCAGCGGCTTGAATTTGATCTATCTCCATTAAACTTGGAT 

ATAGGCCAAGTTTATAAAGAAACACGTTTATTCCTGGATGGAGAGGAAGAAAAAGAATGG 

GCATTTGAGGAMGCAAAATAGATGAGCACCATTTTGTTGCAGTTGCTCTTAGGAAACCC 

GATGGGATCTAGACATCAGGGGATGTTCCATCTCAGGATGATTCCAAACCAACCCAGAGG 

GCAATTTACTATTCTCAACTTTAATGATTTAA 

Sequence 1691 

GACCACGCGTCCGCCCGTCCAGGAGCCCTAGGAGTGCTACGGGGGGCCGGAGCCTTGCCC 

GGGCCGCTGCCCCGTCCCTGGATTCGGGGCTGGACGCAGCAAGCGGGGCGCTGTGTCCCC 

AAGCTCCCCGTCCTCGGCCAGGCGGGCACCACGGCAGGGGCTGAGCTACCCTCATGGAAG 

GGAGAGGACCGTACCGGATCTACGACCCTGGGGGCAGCGTGCCCTCAGGAGAGGCATCCG 

CAGCTTTTGAGCGCCTAGTGAAGGAGAATTCCCGGCTGAAGGAAAAAATGCAAGGGATAA 

AGATGTTAGGGGAGCTTTTGGAAGAGTCCCAGATGGAAGCGACCAGGCTCCGGCAGAAGG 

CAGAGGAGCTAGTGAAGGACAACGAGCTGCTCCCACCACCTTCTCCCTCCTTGGGCTCCT 

TCGACCCCCTGGCTGAGCTCACAGGAAAGGACTCAAATGTCACAGCATCTTCCACAGCCC 

C 

Sequence 1692 

ACAGAATTTAGGGGTGGGTGAAAGCACTTGNGCTTTAGCTNTTTCATATTAAATATATAT 

CTATATTTAAACATTCATGGCATAGATGATGATTTACAGACAATTTAAAAGTTCAAGTCT 

GTACTGTTACAGTTTGAGAATTTGTAGTATTACATCATTACATAAGTCATTTTAGTAACA 
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tSCCTTTGTGAAATGAACTTGTTTACTATTGGAGATAACCACACTTAATNAAGAAGAGACA 

GTGAMGTACCATCATAATTMCCTAAATTTTTTGTTATAGCAGAGTTTTCTTGTTT 

AAMMTAAMTCATCTNGAAAAGCATTTTGTACAGTTAAMTGTATMTGMGCTTTTG 

CCAACCAGACTGGTGCTTAGCAACCAAA I I I I I I M I I AAATAAAGCTTTTATGGCAGGT 

GGGTAAATAAGGTGGCCTTCCAAATATATTGGTGTCTTGATGGAGAGTTNTTTAGTTGAA 

ATGAATGTGGGTCTTTTCT 

Sequence 1693 

CGGTTAACATGGCCGTCACCGACAGCCTCAGCCGGGCTGCGACTGTCTTGGCAACTGTNT 

TGCTCTTGTCCTTCGGCAGCGTGGCCGCTAGTCATATCGAGGATCAAGCANAACAATTCT 

TTATGAAGTGGCCNATCAAACAANCTGGGCCTGTTCTTGGTGTGTACATCCCCGATTCTG 

GTATTAATTATCGACATGTTGCAAATACCCTTTCTGTTTATAGAAGTGTCAAGAGGCTAG 

GTATTCCTGACAGTCACATNTGCCCCTAATGCTTGCAGATGATATGGCCTGTAATCCCTA 

GAMTCCCAAACCAGCTACAGTGTTTTAGTTCACAANCAATNTGGAACTAAATGGTGTAT 

GG GAGAATG ATGTGGGAAGGTGNNATTATAGAAGTTTNTTGAGGTAAACNGGTGGGAGAA 

NNNTTTTTTTACCNGGGTMTTTAANCTGNGGGAGGGANTCCCCACCCTAAGTANCTTCC 

TTCGGG 

Sequence 1694 

GTCCGCAAGATGGACGCAGCTCTCTGACCTACGACACTCTCCGGTTTGCTGAGTTTGAAG 

ATTTTCCTGAGACCTCAGAGCCCGTTTGGATACTGGGTAGAAAATACAGCATTTTCACAG 

AAMGGACGAGATCTTGTCTGATGTGGCATCTANACTTTTGNTTTACATACAGGAAAAAC 

TTTCCAGCCATTGGGGGGACAGGCCCCACCTCGGACACAGGCTGGGGCTGCATGCTGCGG 

TGTGGACAGATGATCTTTGCCCAAGCCCTGGTGTGCCGGCACCTAGGCCGAGATTGGAGG 

TGGACACAAAGGAAGAGGCAGCCAGACAGCTACTTCAGCGTCCTCAACGCATTCATCGAC 

AGGAAGGACAGTTACTACTCCATTCACCAGATAGCGCAAATGGGAGTTGGCGAAGGCAAG 

TCCATAGGGCCAGGTGGTACGG 

Sequence 1695 

CCCCGCGTCCGCTCGNAGCTGTCCGCGGTCTGTTTGGCCCGAACGGCGGCGGAGGCGCTG 

ATCATGGCGACATTCATCTCGGTGCAGCTGAAAAAGACCTCAGAGGTGGACCTGGCCAAG 

CCGCTGGTGAAGTTCATCCAGCAGACTTACCCAAGCGGCGGGGGAAGAGCAGGCCCAGTA 

CTGCCGCGCGGCGGAGGAGCTCAGCAAGCTGCGCCGCGCCGCAGTCGGTCGTCCGCTGGA 

CAAGCACGAGGGCGCGCTCGAGACGCTCCTGAGATATTATGATCAGATTTGTTCTATTGA 

ACCCMATTCCCATTTTCTGAAMTCAGATCTGCTTGACATTTACCTGGAAGGATGCTTT 

CGATAAAGGTTCACTTTTTGGAGGCTCTGTAAAACTGGCTCTTGCAAGCTTAGGATATGA 

AAAGAGCTGTGTTGTTGTTCAATTGTGCAGCCTTAGCTAGCCAAATTGCAGCAGAACAAG 

AACCTGGATAATGATGAAGGGATTGAAAATCGCT 

Sequence 1696 

TTCGGGAGTCGACCCCGCGTCCGGGCCAGCCGGCTCGCCCGGGGGCCATGGCAGCAGCGG 

CTACTGCAGCCGAGGGGGTCCCCAGTCGGGGGCCTCCCGGGGAAGTCATTCATCTGAATG 

TGGGAGGCAAGAGATTCAGTACCTCTCGCCAGACTCTCACCTGGATCCCAGACTCCTTCT 

TCTCCAGTCTTCTGAGCGGACGCATCTCGACGCTGAAAGATGAGACCGGAGCAATCTTCA 

TCGACAGGGACCCTACAGTCTTCGCCCCCATCCTCAACTTCCTGCGCACCAAAGAGTTGG 

ATCCCAGGGGTGTCCACGGTTCCAGCCTCCTCCATGAAGCCCAGTTCTATGGGCTCACTC 

CTCTGGTTCGTCGCCTGCAGCTTCGAGAGGAGTTGGATTCGATCTTCTTGTGGAAACGTC 

CTCTTCAATGGTTACCTGCCGCCACCAGTGTTCCCAGTGAAGCGGCGGAACCGGCACAGC 

CTAGTGGGGCCTCA 

Sequence 1697 

CGTCCGAAGGAAGGAAGGGACGGGCTGAGTTCCCCGACGAGAGACACACCCAGATTTTCC 

TGCAGCTTGGGGAGAGGTCCTCCCAGGAGCCTTGGTCCCTCCTGGCCTGCCGGAGTCCTT 

AGCCAGGATGGAGGCTGTTGTGAACTTGTACCAAGAGGTGATGAAGCACGCAGATCCCCG 

GATCCAGGGCTACCCTCTGATGGGGTCCCCCTTGCTAATGACCTCCATTCTCCTGACCTA 

CGTGTACTTCGTTCTCTCACTTGGGCCTCGCATCATGGCTAATCGGAAGCCCTTCCAGCT 



# 
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CCGTGGCTTCATGATTGTCTACAACTTCTCACTGGTGGCACTCTCCCTCTACATTGTCTA 
TGAGTTCCTGATGTCGGGCTGGCTGAGCACCTATACCTGGCGCTGTGACCCTGTGGACTA 
TTCCAACAGCCCTGAGGCACTTAGGATGGTTCGGGTGGCCTGGCTCTTCCTCTTCTCCAA 
GTTC 

Sequence 1698 

CGCGTCCGGCCGCGCCCATGGCCCGCCGCTGCCCGGCCGCGCGGGCGGGCCCGCCACGCC 

GCTGTCGCCCACGCGCCTGTCGCGGCTGCAGGAGAAGGAGGAGCTGCGCGAGCTCAACGA 

CCGCCTGGCGCACTACATCGACCGCGTCCGCGCGCTGGAGCTGGAGAACGACCGGCTCCT 

GCTCAAGATCTCAGAGAAGGAGGAGGTGACCACGCGCGAGGTGAGTGGCATCAAGGCGCT 

GTACGAGTCGGAGCTGGCCGATGCCCGGAGAGTCCTGGATGAGACGGCTCGAGAGCGTGC 

CCGGCTGCAGATAGAGATTGGGAAGCTGAGGGCAGAGTTGGACGAGGTCAACAAGAGCGC 

CAAGAAGAGGGAGGGCGAGCTTACGGTGGCCCAGGGCCGTGTGAAGGACCTGGAGTCCCT 

GTTCCACCGGAGCCGAGGTGGAGCTGGCAGCTGCCCTCAGCGACAAGCGCGGCCTGGAGA 

Sequence 1699 

ACGCGTCCGGAAGAATCTACACTTCTTTGCACCAGAGTATGGAGAAGTCACTAATGTGAC 

AACAGCAGTGGACATCTACTCCTTTGGCATGTGTGCACTGGAGATGGCAGTGCTGGAGAT 

TCAGGGCAATGGAGAGTCCTCATATGTGCCACAGGAAGCCATCAGCAGTGCCATCCAGCT 

TCTAGAAGACCCATTACAGAGGGAGTTCATTCAAAAGTGCCTGCAGTCTGAGCCTGCTCG 

CAGACCAACAGCCAGAGMCTTCTGTTCCACCCAGCATTGTTTGAAGTGCCCTCGCTCAA 

ACTCCTTGCGGCCCACTGCATTGTGGGACACCAACACATGATCCCAGAGAACGCTCTAGA 

GGAGATCACCAAAAACATGGATACTAGTGCCGTACTGGCTGAAATCCCCCAGGCCCTGAT 

CTGCGCTGTGGCTGTCCCTGGGACGTGCTGCAGCCCTCCTGTCCCTTCCCCCCAGTC 

Sequence 1700 

GGGAGTCGCCCCGCGTCCGGATTTCAGTTGGTGGCGTCATAGTCTCATTACAGTGTCTAT 

CTTGGCATTTACCMTTTACTTGTCATCTTTGTCCCMCTATTAGGGATATCTTTGGTTT 

TATTGGTGCATCTGCAGCTTCTATGTTGATTTTTATTCTTCCTTCTGCCTTCTATATCAA 

GTTGGTGAAGAMGAACCTATGAMTCTGTACAAMGATTGGGGCTTTGTTCTTCCTGTT 

MGTGGTGTACTGGTGATGACCGGMGCATGGCCTTGATTGTTTTGGATTGGGTACACAA 

TGCACCTGGAGGTGGCCATTAATTGGCACCACTCAAACTCAAACTCAGTCCATCTGATGC 

CAGTGTTGAGTAMCTCAACTACTATGAAATTTCACCTAATGTTTTCAGTTTCACTTCCT 

TTTGAAGTGCAGATTCCTCGCTGGTTCTTCTGAGTGCAGAATAAGTGAACI I I I I IGTTT 

TGGTTTGN 1 1 I 1 1 1 AAGAAAC 

Sequence 1701 

CCCACCGTCCGCGCGCGGCGCCTCGCCTCGGCCGGCGCCTAGCAGCCGACTTAGAACTGG 

TGCGGACCAGGGGAATCCGACTGATAAATTAAAACAAAGCATCGCGAAGGCCCGCGGCGG 

GTGNTGACGCGANTGCGATNTNCTGCCCANNGCNTCTTGAATGTCAAAGTTGAANAAANC 

CAATGAAGCGCGGGTAAACGGCGGGAAGTAACTAATGACTTCTCATTAAGGGTAGCCAAA 

NGCCCTTCGTCATCTNAATTAAGTTGGACCGCGCANTGAAATNGGATGAAACCNAGANTT 

CCCACNTGTCCCTACCTACNTAATCCAAGGCGGAAAACCACAAGCCAAAGGG 

Sequence 1702 

CGACCACGCGTCCGGACAGATTGATAGCTCTTTCTCGATTCCGTGGGTGGTGGTGCATGG 

CCGTTCTTAGTTGGTGGAGCGATTTGTCTGGTTAATTCCGATAACGAACGAGACTCTGGC 

ATGCTAACTAGTTACGCGACCCCGAGGTGCCTGACCAGTTCTACCGCCTGTGGCTATCCC 

TCTTCCTGCACGCCGGGATCTTGCACTGCCTGGTGTCCATCTGCTTCCAGATGACTGTCC 

TGCGGGACCTGGAGAAGCTGGCAGGCTGGCACCGCATAGCCATCATCTACCTGCTGAGTG 

GTGTCACCGGCAACCTGGCCAGTGCCATCTTCCTGCCATACCGAGCAGAGGTGGGTCCTG 

CTGGCTCCCAGTTCGGCATCCTGGCCTGCCTCTTCGTGGAGCTCTTCCAGAGCTGGCAGA 

TCCTGGCGCGGCCCTTGGCGTGCCTTCTTCAAGCTGCTGGCTTGTGGTGCTCTTCCTCTT 

CACCTTTGGGCTGCTGCCGTGGATTGACAACTTTGC 

Sequence 1703 
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tBATCGACTTCGCCTGACGGAATCCAGGGGTCGTAATATATGTAAACTCGCGTCCGNGCTG 

CGTGCCCAGAGTAGTGGCCGAATACCTTAACGGGGCTGTGCGCGAGGAGAGCATCCACTG 

CAAGTCGGTCGAGGAGATCTTCGACGCTGGTGCAGAANCTGGCCGACCAGTCGGGCTTGG 

ACGTGATCCGCATTCGCAAGCCCTTCCACACANGACAACCACTAGCATTCAGGGCCAGTA 

GGCACCCCTTCACCAACAAGCTTGACCACGTGTCCCGCGGAACTANCGCACCCCCGAGNA 

GGGTTCAANGNATTCCTTGCACCCAGCCCAGATGCCCTGGCTTNTGGGGGGNCAAGTGAC 

CCTTTGTGNAACCCACTCATTTTTTATGGCMGGTGAGCATTNCCTAAAAACCCTTGAAA 

MTGANGGGAANAACTTTCAAGGGGTTTTTTACAGGGCCCTTGGI 1 1 I I I 1 1 AAAATCCC 

CAANATTTGGATAATAAATGGAATCCTCAAAAACACAAGTGGAGGAAGGNTCTTGAAAGG 

GC 

Sequence 1704 

TCGACCACGCGTCCGGCCGGAGAAACTTGAGCCGGCTGCCCCGCCCACGGTGCCCGAAGC 

CCCAAAGGCTGGAATTAGGGGCTAGAAGTCTGGCACCCACCGCCTGGCCAGGTGTTCGGG 

ACGCGACCAGGTGGGCGGTCGCCCCGCCCCGGGAGCGCGGCTTAATAGCTGAGAGCCCGG 

GGGCCAGGCCGNGGCTGCGGCCCAGGCAACGCCCTGAGGGTGGCCACGCTGNCAGGTGTT 

CCACTCCCCCGGGACTATGGGCAAGGGCCCGGGGCGGGGAGGGCGGCAGGTGCTGACACT 

GGAGCTGCGCCGGAGGTCGGGGAACTCGGCCTCCTAAGACTGAGGACACTCGCCTGCTGG 

GCCGGNCGAGCTGTGCGGTGCCCTCCGGGACGCAGGGGGCGCTGCAGCCACGCTGGGTCA 

GGCTCCGAAGGGCCCTCCCAACCCGGGGA 

Sequence 1705 

CGCCACGCGTCCGGAAAGATGGAGGTGTGGGGACAGGAGCTGGGTGTGCTGGGGACTGGC 

CGCGGACCCCTAACCTGTGTCTCCGGTCTCCCTCCGGGAGCGGCTCAACCCAGCCCATCG 

CTCTGGCCCCGTTCTGGCCCTGCAGGGTGGTGGTTGGGACGTTGAAATGAGCGCGCGAGT 

GGTACGTCCTCTCTCCGCGCTCACGCCCCCCTCCTCACCGTGTTTCCCGCCAGGACCATC 

AGCACGTGCCCATCGACATCCAGACCAGCAAGCTGCTCGATTGGCTGGTGGACAGAAGGC 

ACTGCAGCCTGAAATGGCAGAGTCTGGTGCTGACGATCCGCGAGAAGATCAATGCTGCCA 

TCCAGGACATGCCAGAGAGCGAAGAGATCGCCCAGCTGCTGTCTGGGTCCTACATTCACT 

ACTTTCACTGCCTAAGAATCCTGGACCTTCTCAAAGGCACAGAGGCCTTCCACGAAGAAT 

ATTTTTGGC 

Sequence 1706 

TCGCCNCGCGTCCGCTGAAGCAAGAGAATCACTTGAACCCAGGAGGTGGAGGTTGCGTGA 

GCTAAGATCGCGCCACTGCACTCCAGCCTGGGCGACAAGAGTGAAACTCCGTCTTAAAAA 

AGCCCATGGCAGGCTGGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCCA 

AGGTGGGCGGATCACAAGGTCAGGAGATCGAGACCATCCTGGCGAACACGGTGAAACCCC 

GTCTCTACTAAAAAAAATACAAAAAAATTAGCCAGGCGTGGTCGTGGGCACCTGTAGTCC 

CAGCTACTCAGGGGGCTGAGGCAGGAGAATGGCGTGAATCCGGGAGGCGGAGCTTGCAGG 

GAGCCGAGATAGTGTCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCGTCTCAAAA 

AACAAAAAGCCCGTGGCAATTAATGGTAAAGGAAACCCGGCTTTTAGTGTAAAGAGGTAA 

CATAA 

Sequence 1707 

GCGTCCGGCCTCCAGCAAAGCCCATTCAGTCAGCTCTGCAGGCTCATCTTACAAGAATAA 

TCCCTTTGCCAGCTCMTCTCCAMCATGGGGTTTCTTCTGGCAGCTCTTCCTCGGGAGG 

AACACCAGTCCAGAGTTCTGTTTCTGGGAGCCTGGTCCCTGGCATACAGCCTCCCTCCGT 

GGGACAGGCCACCAGCCGACCCGTCCCAAGTTCAGCAGGGAAAAAAATGCCTGTTTCCCA 

GAAGTTGACTCTGGTAGCCCCTCCAGGCGGTCCAAACGGAGATTCCAGTGGTGGGACCCA 

GGGGAGTGGCAAAGTTGCTGACCTCGCCGTCCCTAAAGCCCTCTGCAGTTAGTAGTGTGA 

CATCGTCTACCTCCTTGTCAAAAGGAGGCGAGTGGGGACTGTGCTGCTGGCCGGCTCCTC 

TTTGATGGCTTTACCCTACAAATCCAGCAGCCCAAAGCTGTCTGGGGCCATGAGCTCGAA 

CTTCTTG G G AAATT AT AC 

Sequence 1708 

CACGCGTCCGGGAAGCGGTGCGCTCCGTCACCACGGGAGTGCGGGAATCCGCCGTTTGCG 
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CTGAGGCAATGGCGGCAGCTGCGCCGGTGGCCGCGGACGACGATGAGCGGCGGCGGCGGC 

CGGGGGCTGCACTGGAGGACTCCCGGTCCCAGGAAGGGGCAAATGGTGAGGCCGAGTCAG 

GTGAGCTCAGCCGGCTTCGGGCTGAGCTGGCAGGCGCCCTGGCAGAAATGGAAACCATGA 

AGGCTGTGGCAGAGGTGAGCGAGAGCACGAAGGCCGAGGCTGTGGCTGCGGTGCAGCGGC 

AGTGCCAAGAGGAGGTGGCCTCGCTGCAGGCCATCCTGAAAGACTCCATCAGCAGCTATG 

AAGCCCAGATCACCGCCCTGAAGCAGGAGCGACAGCAGCAGCAGCAGGGACTGTGAGGAG 

AAGGAGCGGGAGCTGGGCCGCCTGAAGCAGCTGCTGTCCCGGGCCTACCCCCTGGA 

Sequence 1709 

CACGCGTCCGCGGACGCGTGGGTCCGCGGCGGCGTCCGGGGTCTCCAGTAGGGCTGACGC 

TCCGGTGCTCGCACAATCCCCCGCCTCGGCTGGCAACGGGCGTCCCTCCACTCCCCGAGT 

CCCCGGCAGCCGCCGCCACCCCAGCGCGCCCCGATCTGGCCCCCTGCCCCGCGAAGATGG 

CTGCCGTACGCCGGGCCCGCAGTTATTGCCGCTGCCTGGTGCGCTTCTCCGACCGAGAAC 

TCTGCTAAGCTCCGCTGCAGAGACAGGCAGGAGTAGACACCCGGACACCCAGCACCCCTC 

CTTCGGGGGGCGGTGCAGAGGGGGCACGGAGAGCCCCTCGAGCGCAGCAGGCCGCCCCGC 

CAGCATGGCAGAAGCTGAGGAAGATTGTCATTCTGATACTGTCAGAGCAGATGATGATGA 

AGAAAATGAA 

Sequence 1710 

ACGCGTCCGGCGAGTGGCCTTCCCGGTTGGCGCGCGCCCGGGGCGGCGGCGCTGGAGGAG 

CTCGAGACGGAGCCTAGTTATGTCTGGGAGGCGAACGCGGTCCGGAGGAGCCGCTCAGCG 

CTCCGGGCCAAGGGCCCCATCTCCTACTAAGCCTCTGCGGAGGTCCCAGCGGAAATCAGG 

CTCTGAACTCCCGAGCATCCTCCCTGAAATCTGGCCGAAGACACCCAGTGCGGCTGCAGT 

CAGAAAGCCCATCGTCTTAAAGAGGATCGTGGCCCATGCTGTAGAGGTCCCAGCTGTCCA 

ATCACCTCGCAGGAGCCCTAGGATTTCCTTTTTCTTGGAGAAAGAAAACGAGCCCCCTGG 

CAGGGAGCTTACTAAGGAGGACCTTTTCAAGACACACAGCGTCCCTGCCACCCCCACCAG 

CACTCCTGTGCCGAACCCTGAGGCCGAGTCCAGCTCCAAGGAAGGAGAGCTGGACGCCAG 

AGACTTGGAAATGTCTAAGAAAGTCAGGCGTTCCTACAGCCGGCTGGAG 

Sequence 1711 

CNCGCGTCCGAAGGCACAGGCGTCTTGCTCTGTTGAAGCAAGTCAGTATCCGAGAAAACT 

GCTGTTCCCTTTGTTGTGATGAGGTAGCAGACACACAATTGAAGCCATGTGGACACAGTG 

ACCTGTGCATGGATTGTGCCTTGCAGCTGGAGACCTGCCCATTGTGTCGTAAAGAAATAG 

TATCTAGAATCAGACAGATTTCTCATATTTCATGACACATGTGAAGAGGCATCGTGGACT 

TTTTTCTACTCAATTCCAGCCAATGTTGAAAAGAAAAAGAAAAAAAAAACTCTAATCAGT 

TGTACACACATTGAAACTTATAGCCATGGCCAGATTTTATGCTAAAAATGGTAGTTTGTC 

AAAGACAAAATTCTCTTAGAATCTMTCCAACTTGCCAGCCCTGAGAAAATCCCTTTTAA 

GGCCAAGGGAAAGCTGAATGCTAGCAGCCAGGCCTGTGGTACTTCCATGAGAAACCATAG 

CAGGACAATGCCCTC 

Sequence 1712 

CCACCGTCCGGGCGGCCAGAGGTGCGAGAAGGCCGAGGAGAAGGCCAAGGAGATTGCGAA 

GATGGCAGAGATGCTGGTGGAGCTGGTCCGGCGGATAGAGAAGAGCGAGTCGTCGTGAGC 

GCGGTCGGCGGTTTCCAGCCAATGGATTCTGGTCAACTGGTGGAGATTGGCTGACACCCT 

GGAGAAGCCGAAACCAGAGAGCCTTTTGTTTTCTC I III! I CCTGTCTATGCTGTGTCTC 

ACTTAACACTACGTTTTCTGCTATGGTCTGTGGTTGATGACCTCMTATGAGTTTCGATT 

GTTMCCGTGTTTTTGTTTGGGAAGTMTTTTGTTTGAAAATGCTCTCACATACAGGAAT 

TAGGGCCTAGATTGTAAGCTCTTGCAGCAGTCACATTTGTTCCCGGGCTTTGGTGGTTAT 

TTTCTAMTTTTTGAGGTGCCTTTGCTATTTCTTGTGTGACCTGATAGCTTCCCCTG 

Sequence 1713 

GCGTCCGAGCCTCTGGGGGTGGATCCTGAAAGGTGGTCCAGCCGCCTGGCCCTGCGTGGG 

ACCCTCCACCTGGCAGCAGGGTCTCGCTCTGTCACACAGGCTGGAGTGCAGTGGTGTGAT 

CTTGGCTCATCGTAACCTCCACCTCCCGGGTTCAAGTGATTCTCATGCCTCAGCCTCCCG 

AGTAGCTGGGATTACAGGTGGTGACTTCCAAGAGTGACTCCGTCGGAGGAAAATGACTCC 

CCAGTCGCTGCTGCAGACGACACTGTTCCTGCTGAGTCTGCTCTTCCTGGTCCAAGGTGC 
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CCACGGCAGGGGCCACAGGGAAGACTTTCGCTTCTGCAGCCAGCGGAACCAGACACACAG 

GAGCAGCCTCCACTACAAACCCACACCAGACCTGCGCATCTNCATCGAGAACTCCGAAGA 

GGCCCTCACAGTCCATGCCCCTTTTCCCTGCAGCCCACCCTGCTTCCCGATCCTTCCCTG 

Sequence 1714 

GTCGCCACGCGTCCGCAGAAGATTGACAAATCTGAGGGCCGCTTCCATGTCCAGAACCTT 

AGCCAGGTGGAGCAGGATGGGCGGACGGGGCATGGACTCCGCAGATCTTCCAAGTTCTGC 

TTGAAGGAGCACAAAGCCCTCAAGACGTTAGGCATCATCATGGGCACTTTCACCCTCTGC 

TGGCTGCCCTTGTTCATCGTTAACATTGTGCATGTGATCCAGGATAACCTCATCCGTAAG 

GMGTTTACATCCTCCTAAATTGGATAGGCTATGTCAATTCTGGTTTCAATCCCCTTATC 

TACTGCCGGAGCCCAGATTTCAGGATTGCCTTCCAGGAGCTTCTGTGCCTGCGCAGGTCT 

TCTTTGAAGGCCTATGGCAATGGCTACTCCAGCAACGGCAACACAGGGGAGCAGAGTGGA 

TATCACGTGGAACAGGAGAAAGAAAATAAAC 

Sequence 1715 

CCCCCGTCCGCTTTTGTTNATCTAAAGGCTTNAGTCCCAI I I I I I I ATACGTTGT ATTTT 

AAMACGTTTGAMGGAGTCTTACACCTGTATCATGAAMCTGMTCCTTTTGAAATACC 

ACTATATGMGAGAGAGATGAAATTTAGTGAACAGAATTGGAAAAGGTGCTCATAATTTC 

ACTAT GCAM CTTACCCCAGTCTCTAAAAMGTMTTTAGATTTAAAGTTCTTTGATGTA 

TTTGATTTTCTAMTCTTTATGGTTATGATTTGGMTAAAATGTGCCTAATCCTGTGTTA 

CATTCTGTTCTTAMTCTGAATGCCTTCTCATTTAATTCTGAGGAAATATCACACAAGTG 

TCTTCATTGACCTTGAAGAAATGTATATACAGTTGCCTTATAAAACAACATAAATTTAGA 

CCATMCTTTTATAGAGAAAGGGTTTTGTCAAATGTTTTCTGAAAATCTGAGTAATTCAA 

AGCATGCCTCTGCCCCCTTTAATA 

Sequence 1716 

NGTCGCCACGCGTCCGGCGCTCTCGGCCGCCGCCGCCTCTGCGTGGGCCGGCCGGGAGGG 

CCTCGGGGGACTGACTGACAGAGTTTCACTCCTGTTACCCAGGCTGGAGGACAATGATGT 

GATCTCGGGTCACCACAACCTCCGCCTCCCGGATTCAAGCGATTCTCATGCCTCAGCCTC 

CCGAGTAGCTCAGATTACAGGCATGTGCCACCACGCCCGGCTAATTTTGTATTTTCAGTC 

GAGACGGGGTTTCCCCATGTTGGTCAGGCTAGTCTTGAATTCCCGACCTCAGGTGATCTG 

TTCGCCTCGGCCTCCCAAAGTGCTGGGATTACAAGGCGTGAACCACTGCACCCGGCGAGG 

CATTTTTTACTGTCTACAGAAACTTATTGTAATTCATTTTTCCTCACTCCAAGTAGTAAG 

AATTATACCAAATTGAAAAGATATGAATGAGTATCCTAAAAAAAGAAAAAGGGA 

Sequence 1717 

CCGAGGCNCTGATAAGCCNTGGTAACGGGAAACACAGCTCTAACCTCACCTCATTCTCCA 

GGTTACAAAGGCCATGTGCCCCTTTGAATCTGGCAGAGAAAGTTTCCTCGTTGTAAGTAT 

TTGCATCTACTTCAAGCCAGATTCTTCTGCCTCTTTCTCCTTTCCAGACCCCTACTCTGT 

GCAGTGCTGACCACAGCTAGAGCCACCGCCCCATTGCTCAACCAGTATTTATTTCCCTAA 

ACGACCCTTCCTCATATTCCCTTCCCTCCACCTCTCCTTACCAAGCACCCAAAAGAGGAT 

TTAGMCTAGCAGGGTGGACATCATTCTGGTTGT1TCTACTTTTCTCTGCCTAGCACAAA 

ATTGGGAGAAAACTGGAGCCTCCATCCGCAGTCACACGTGTACAGATCTGGGNGATTTGG 

ATGTAGGCTTTTTCTAACTTCTCTCTCAGAAGCTTCTACA 

Sequence 1718 

CGGACGCGTGGGTGCCGCCGCCGCCGTCGCTGTCGTAGTCGCCGCCGCCGCTGCCGGAGA 

AAGAGCACGAGCGGGGAAGCCCCAGAGTGAMTCTAGCATCCTGCCGGCTGGTCTGCCCG 

CCCCTCCTTCCTTTTCCCCCCGGCCCCNGTCCCCTCCCNCCGCAGGTGCCATCCGTCGCC 

ATNCGCCTCTCTACCCTCNCATCCCCAGGTGAGGGGGGTGAGTTCAGGAAGCGGNNACCC 

CNAGGAACCCANCAGGGTCACCATTTGCAGCGCAACATGGCAGGAGCTGGAGGAGGGAAT 

GATATTCAGTGGCGTTTTTTTCAGGTGAAAGGAGCAGTATGATGATGATGTAGCAGNAAG 

CANGATATNATTTCTACAGTANAATTTMTCNTTTCTGGGAGAAATTCTAGCAACANGAA 

GATAAAAGGTGGGTAGAGGTTGTCATTCTTTCAACAGGGAGCAGGGAGAAC 

Sequence 1719 
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TCCGGCCGGCCAGCGTCATCACCATCGTCAAGTCCACCCCGGGCTCGGGGTCTGGCCCCG 

CCCACGGNCCGGACCCCGNCCACGGCCCGGCACNGCNGCTCCTAGTGCCCGCCCCCGCCA 

TGCCTCANCCACGCCCCACCGGCCATAGNTNCCCTCCCCCAAAGNGCNCTGAGGCGCATG 

NCTTCNTGNGACNCAANGNTCCCATTNTCTATTGCAGGCAACATGGCCATTCCCCGAAAC 

TAAAAAGCAGTTGGGGNGGGNGAAAGTACNANGTGGAAAACCCAGNNATCACCGGNANTG 

GNGGGAAAACAANGGCCNGNAGGGACACATTTCCAACTTTAAGCTGGGCNAGTGNNTGGG 

GAACCAAAAAACCTGGGGNGGNCCCNTCANTGGCANGGCCCNTGAGNNCNCAGCNCATGN 

GCATTTNCAGGGGNNGAACCAGNGGGACAAGGGGGACCTCAANAAAAATGNTGGNTGGGG 

ANGGGCCCANTCNCTGGGCNNGGGGGAAGAACCCGNATTCTACNAGCAAAAGGGGANGGG 

GGGGACNCAGCAAAGGCAATCANGGGAAGAAGGAAGATAAGGGTCNACCANCGGGAATNG 

GCAAAGAAANGGGGGAA 

Sequence 1720 

CTGANGCTCGTTTTCGTGAAATTAAGCTTCAGAGGGAAGCCCGNGAAACACAGGAGAGCG 

AGCGCAAGCCCCCACCATACAAGCACATCAAGGTGAATAAGCCTTACGGGAAAGTCCAGA 

TCTACACAGCGGATATTTCAGAANTCCCTNAGTGCAACTGCAAGCCCACAGATGAGAATC 

CTTGTGGCTTTGATTCGGAGTGTCTGAACAGGATGCTGATGTTTGAGTGCCACCCGCANG 

TGTGTCCCGCGGGCGAGTTCTGCCAGAACCAGTGCTTCACCAAGCGCCAGTNCCCAGAGA 

CCAAGATCATCAAGACAGATGGCAAAGGGTGGGGC 

Sequence 1721 

CATGGCTTCTGCGAGAAAAGTGATTTAGGCAGACGGAGGTTTTTTCTCMTCAGAGGCTT 

TCAGTAACTCTGCTGATGCACAGAGAAGAGACTTCCTCAGCCTGCAGGCTACAAGAGCCA 

ACTGTTAGTGCAAAAAAGGACTTTAATACAAATTTCTTATTCCAGAMTTTTGTTCCAGG 

TCTGGACAAGCTGAGAMTTTATCATTGTTTTTCGAGTTTTTAAGATACCCACTTT^ 

GAGAGGTATGGGTGTGTGTGCAAGGCACACACATACAGTCTTTCTGTACATGCATGCATA 

TTTATGCATGTACAGGGAAGTATCCAGACACCAAATTTTTAATAAAATGAATTCCCCAAA 

GGGGAGTCTTGACCTGAATTMGGCTGTTGTTTATAGGGAAGCCAGATATAATTGATGNT 

GAAAAANAAACTAATTTTTATACTTAATCACCGGCAGNTANCGGGGGCANGGGGGAAAAA 

GTACAGANGGGGTGT A I 1 1 I I IGG I I I I I II CT 

Sequence 1722 

TCGCCACGCGTCCGCTCTTAACACAGAGTCTGCAGCCCCTAACTGACACCCTGTCCTTCC 

TCCTAGGAAGTGCTGGACTCCCTGGTCAGCAATGTCAACATTGAGCTGCTCAATGCCCTC 

CGCTACCATATGGTGGGCAGGCGAGTCCTGACTGATGAGCTGAAACACGGCATGACCCTC 

ACCTCTATGTACCAGAATTCCAACATCCAGATCCACCACTATCCTAATGGGATTGTAACT 

GTGAACTGTGCCCGGCTGCTGAAAGCCGACCACCATGCAACCAA 

Sequence 1723 

ATCCGGTTCGCCCCNCGTCCGGGCGGCCGAGGCGGAGGCAGCGGCGGCGGGATGGCGGAC 

GCCAACAAGGCCGAGGTGCCCGGGGCCACTGGTGGCGACAGCCCGCACCTGCAGCCCGCA 

GAGCCGCCGGGCGAGCCGCGGCGAGAGCCGCACCCCGCGGAGGCGGAGAAGCAGCAGCCG 

CAGCACAGCAGCAGCTCCAATGGCGTTAAAATGGAGAATGATGAATCAGCAAAAGAAGAG 

AAATCTGACTTAAAGGAAAAATCTACAGGAAGTAAGAAGGCCAATAGATTTCATCCTTAT 

TCAAAAGACAAGAAT 

Sequence 1724 

GTCGCCNCGCGTCCGTGC I II I I I I CGACATACTGGTTTTTCT7TCTGTTTTTCTTCTCT 

TTCTTCTATTTCTTGTGGATATTATGGCTAATAACACAACAAGTTTAGGGAGTCCATGGC 

CAGAMACTTTTGGGAGGACCTTATCATGTCCTTCACTGTATCCATGGCAATCGGGCTGG 

TACTTGGAGGATTTATTTGGGCTGTGTTCATTTGTCTGTCTCGAAGAAGAAGAGCCAGTG 

CTCCCATCTCACAGTGGAGTTCAAGCAGGAGATCTAGGTCTTCTTACACCCACGGCCTCA 

ACAGAACTGGATTTTACCGCCACAGTGGCTGTGAACGTCGAAGCAACCTCAGCCTGGCCA 

GTCTCACCTTCCAGCGACAAGCTTCCCTGGAACAAGCAAATTCCTTTCCAAGAAAATCAA 

GTTTCAGAGCTTCTACTTTCCATCCC 

Sequence 1725 
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Mtntgccagcctttattgagctttacmctmcaacctgatagttggcagttaattcac 

AGTTACAGATAATGCTTTTATTTACATAAATATACCAAGTAGTACCCTCTTATTGTATTC 

ACTTCATCTATTTTCTTAGAATACTTGCAATTTCTMTGACCCCTTCCCTTTCCCTCCTG 

CTGCCCTGTCCACCCTCTTTCCCCTTCTAACATCCTTAGAGGGATGAAATCTCAGCATAT 

GTTGCAGGACACCAAAAGGAAGAAAACAATCAAGCAAATAAAATAAACAGTCAAACAAAC 

CAGGAGTTTAAAACAACAACCCCAACAACAGAAGCCTTGGCAAAGAGGAATGAGTGATCA 

GCAAGTGAACACACTCTATGTCMCTCTCCTTTTTATCCAGCTGAGATTTTATGGTAACC 

TTTAATTTAA 

Sequence 1726 

CCNCGCGTCCGGAGCCGAGAGTGTGTGGAGCAGTTACAGCTGGAAGACCGGGTCCTCTGC 

CTCCACAGTAGATGGCTGAATCCTCTATGCGGGACTGGCAAATGGCACTGTGGTCACCTT 

CAACATAAAGAACAACAAACGACTTGAGATCTTTGAATGCCATGGCCCTCGGGCAGTCAG 

CTGTCTTGCTACAGCTCAGGAAGGTGCCCGAAAACTGCTGGTCGTGGGGTCTTATGACTG 

CACAATTAGTGTACGCGATGCCCGGAATGGACTGCTCCTCAGAACTCTGGAGGGCCATAG 

CAAAACCATTCTTTGCATGAAGGTGGTGAATGATCTCGTGTTCAGTGGCTCCAGTGATCA 

GTCAGTCCATGCTCACAACATTCACACTGGTGAGCTCGTGCGGATCTATAAAGGTCACAA 

TCATGCAGTGACTGGTGGTGAATATCCTAGGAAAAGTGATGGTGACTGCTTGCCTGG 

Sequence 1727 

CNCGCGTCCGGATNMTATTTTCATCCCTGAGGTTAACMTTACCATCAAMTGTTTTGT 

GGAGACTATGTGCAAGGAACCATCTTCCCAGCTCCCAATTTCAATCCCATAATGGATGCC 

CAAATGCTAGGAGGAGCACTCCAAGGATTTGACTGTGACAAAGACATGCTGATCAACATT 

CTGACTCAGCGCTGCAATGCACAAAGGATGATGATTGCAGAGGCATACCAGAGCATGTAT 

GGCCGGGACCTGATTGGGGATATGAGGGAGCAGCTTTCGGATCACTTCAAAGATGTGATG 

GCTGGCCTCATGTACCCACCACCACTGTATGATGCTCATGAGCTCTGGCATGCCATGAAG 

GGAGTAGGCACTGATGAGAATTGCCTCATTGAAATACTAGCTTCAAGAACAAATGGAGAA 

ATTTTCCAG 

Sequence 1728 

TCGACCACGCGTCCGATCCTGGATCTGGAGAGAGAGCTCTCCAAGCAAATCAACGTGTGC 

CTCTGAGCCAGATGACGGGGTGGGACCCCGGTTAGTAAGGACCGGGCGCCCAGTGGCTAA 

GGCGGTGCCCTGGTGACCAAGGAGAGCCAGACCTGTTGCTCAGGCCGAGCTCCTGGTTGC 

CAGCGAGTTACCACGGGACCAGTCGCGTGTATGGCTGAGACTCATTCCCAGTTTCCAGGG 

CCCGGTATTTGGACACTAGTTGCCMGTCTGGGGCCTGGGGATTTTAGGGACCAGCGGTT 

GTGACCATCTTTCCTGAGCACCAAGGGCTTCCCCTTTTGTTGCCAAAAAGGTAGTTCTCG 

CGCTTGCTAGGCTGGCCTCTCTTGCCTCCCCTTGGCCGGGGC 

Sequence 1729 

TCCGAMCACCTGTCATTTTACACAAATGCGTTTTGAATGTCTGAAAGACAGCTCCTGCC 

CTTMTTTAGATGTAAACCATTTAGTTTCAAACTAACCACCTGATAAAATCTATCAACAT 

TTTATCATGAACTAGAGCAGATGTCTGTTATTTGATGTCTATGTTATTTGAGTTTACTGT 

TTMTMGTGMTTCATATCAATTAATCCTGCTAACAAATTTGACACTTAAGGTGATTCT 

GAAMTCCTTTAMCTTAAAGTAGATGGMTCTTMGTATGGGGCCTTTTAGTGTCCGTA 

AAGAAAAACTGCATGCAACAAAATATAGCAGGTCCTCACTTGTTGAGATTCATGGAAATT 

GTGACTTTAAATGAAATGACATGGCTGGGCATGGTGGCTCACACCTGTAATCAGCACTTT 

GGGAGGCCACGGCGGGTGGATCACGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 

T 

Sequence 1730 

CTGNGAGTTAAATTGGTCCAGAAACAGTTATGACCCTCTTTATACTGCCAAGAAATACGC 

AGTCCCAGCCTTGGAAGCACACTGTGTAGAATTTCTCACCAAACATCTTAGGGCAGATAA 

TGCCTTTATGTTACTTACTCAGGCTCGATTATTTGATGAACCTCAGCTTGCTAGTC7TTG 

TCTAGATACAATAGACAAAAGCACAATGGATGCAATAAGTGCAGAAGGGTTTACTGATAT 

TG ATATA GATACACTCTGTGCAGTTTTAGAGAGAGACACACTCAGTATTCGAGAAAGTCG 

ACTTTTTGGAGCTGTTGTACGCTGGGCAGAAGCAGAATGTCAGAGACAACAATTACCTGT . 
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GACTTTTGGGMTAMCAAAMGTTCTAGGAAMGCACTTTCCTTAATCCCGGTTCCCAC 

TGATGACAATTGAGG 

Sequence 1731 

ACCCCGCGTCCGAGCAGCCTCCAGTTGCCCCTACTTAGTGGCTTGCCCTCTGCCTGCCTC 

AGCTGCTGCCTGACCGGCTGGGGGAGGCACTGGCGGGAGGCCTCGGGCTCCCCTGGAAGG 

GCGCTGGGCTGGCGGGTCAGCTGGTGGTTCTTAGGTTTCCTTCTGTTTGTTAAAAGGGAC 

MTGTGGCCACTTCTCTGTGGAAAGGGAGTTGGTTGGGGGGTTGAGATGGCCCGTGTTCA 

TMCTCAGTTTCCTGTTTTGCACGATGTAAAAACCCTGTCTTTTTGCACGATACAGCCAA 

MGTATTGGCTGATTTCTTGCTGAGTGCCCTCTTAGTTGGTGTGTGAGGTCTTGGTGGGC 

TCAGGCCAGCTGTTTGCGAGTGTGGGMCTCATAGGTTCTGTCTTTGTCTCTTCCTTTCA 

CCTCATTCTGGTAGCAGCATAAAGGTTAGGCAATCACTGGGACCC 

Sequence 1732 

GCAAATCATACATTGCATTCCCCAAAGCATCTGAACGTACTTCTAGAAAACAAACCAACC 

AAAAGGGAAMTATGCATGCTTTTTGTAATTAACTGGTGCTTTGAAAATCI Mil IAAGG 

GAGMAAATCTCMCCAMGTTATGCTCATCCAGACAAGCTGACCTTTGAGTTAATTTCA 

GCACAACTCATTCTTCAGTGCCTCATGACTGAAAACAAAAAACAAAAAAAGAAAGCATCT 

TCNCMTGAAGCTTCCANATAGCACCGTTTTGCTAAMGATACATTCTCATTGTTTTCCA 

ACAGNGATGGCTTCCACATMNGGTTAMCAAACTGGGNGCTTGGAAATMTTTNTNACN 

GGTTACTTNTTCGCATTTTTTNGAACNAAGGAAANGGGAT^ 

GAATTGNGGNCAAGTTTAAAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 1733 

CGCGTCCGAATTTAGGAAGACCCCGGCGACCTGTTCCTCACCCCCGCTTCGCCCTCACAC 

TTTCGGGATGTCTGCGATTCCTGCTGAGGAGAGCGACCAGCTGCTGATCCGACCCCTTGG 

AGCTGGGCAAGAAGTAGGAAGATCATGTATTATTCTCGAGTTCAAAGGAAGAAAAATAAT 

GCTCGACTGTGGGATCCACCCTGGCCTAGAAGGAATGGATGCTCTTCCTTATATTGATTT 

MTTGGACCCACCTTGGMTTGNACCTCCNANTMNTNANNNCATTTTCATTTTGGAANC 

^GGGGGGGCNTNNCCCCNNGGTTTTTCNNAAAMNNCCCCTTNAAANGNNAAAAAATTT 

TTTTNTTTTTTNTCCCANAAAAAA I MINI I NAAAANNCCCNCTNNTNTNAGGGGNTNT 

TNAAAAAATTNCCCAAAAAMNNNNCCCCCCTTTNTNT^^^TTTNAAAAAAAAAAAAAAAA 

AAAANATNGNCCCCNNAA I I I II I I I I 1 1 ATATTTTTNANNAAAANAAANNTNTTTTTTN 

AAAAAANTNTATN I I I I I 1 1 I CNNTNNANNNNGTGNNNNCCCNNN I I I I 1 1 1 I 1 1 I I 

Sequence 1734 

CCACGCGTCCGCTCCCGCCAGGCGCTTTCTCGGACGCCTTGCCCAGCGGGCCGCCCGACC 

CCCTGCACCATGGACCCCGCTCGCCCCCTGGGGCTGTCGATTCTGCTGCTTTTCCTGACG 

GAGGCTGCACTGGGCGATGCTGCTCAGGAGCCAACAGGAAATAACGCGGAGATCTGTCTC 

CTGCCCCTAGACTACGGACCCTGCCGGGCCCTACTTCTCCGTTACTACTACGACAGGTAC 

ACGCAGAGCTGCCGCCAGTTCCTGTACGGGGGCTGCGAGGGCMCGCCAACAATTTTTAC 

ACCTTGGGMGGTTTGCNACNATTCTTTGTTTGGANGANTAAMAAAGGTTCCCAAAATT 

TTCCCNCTTTNIAAAAGANNNTNGNGACCNNNCACTNGGGGGGGGGCCCCAAAAAAAAATT 

TTTTTTTNTTAANNCCCCCCCGGGGNNGNAAAAAN I I I I I I 1 1 GGGGGGGGNCCCCCCCC 

NNGGGNNAAAANNNTTTTANAAAAAA I I 1 1 I I i I 1 1 1 N I 1 1 I I CCCCAAAAAAAAATTTT 

T 

Sequence 1735 

GCGTCCGAAAATACAATACACGGACTCTTTCGAGGCCCTGTAATTGGAATGAGTCCACTT 
TAAATCCTTTAACGAGGATCCATTGGAGGGCACTTCCAGAATACCTCCTCCCCCAGCGCC : 
CGCTGCCAGCCCCACACCAGGTGTGAGAACCAAGGTCTGGTGGAGGCAGCTCCAGGCACT 
GCCCAGTCCGACACAACCTGCAAAAATCCATTAGAGCCACTGCCCCCAGAGATG 
Sequence 1736 

CGAACCTCCTGGTTCCAAGTGGGAGACATGGTGCTGTCGGAGCTAGCGGCGCGCCTCAAC 
TGCGCCGAGTACAAGAACTGGGTGAAGGCGGGCCACTGCCTGCTACTGCTTGCGCAGCTG 
NCTGCAGGGTTTCGTCGNCCGCNAGGTGCTCTCCTTTCCACCCGCGGCCTACTCGCCGCA 
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GCCCCCGGC 
Sequence 1737 

ATCCTTTTGCCTAAAGATGTAAACAAAACTCAAGACAGAAGGAATCAGGGAATATGTGCT 
NTTGTGNGC ATCTT GTTTACATTTNGGGATCAANTGATGGCAAAAGAAGTAATGAGACCA 
CTNNMATTGTTTTNCANTTGNNTTTAAAANACCA^^ 

NMGTTTTMCAATTGACCTTCTTAAGNGACATTCTCTTTCAAAAAAGANANGTAAANCA 

GGNGAAATGAAGGGTGGNTGGGGAAA 

Sequence 1738 

CCGGCTCATTCCCTGAGGCCGGCCCGCGTCCCGTCAGGGCGCCGCGCGGGGTTAGCGCGG 

GGTCAGCGGAGGTCAGCGGGGGTCAGCAGCAGCGGCTCCGAGGGCGCGGCGGACGCAGGA 

TGTACACGCTGCTGTCGGGCTTGTACAAGTACATGTTTCAGAAGGACGAGTACTGCATCC 

TGATCCTGGGCCTGGACAATGCTGGGAAGACGACCTTCCTGGAGCAGTCGAAAACCCGAT 

TT 

Sequence 1739 

GTCCG NCCAC GCGTCCGATTTCTTGTGTGCTTTGAAAAAAGTTTCAGCTTGCTGTCTCTT 
TTAGTGTTTTAAAGAAGTGTTATACAAAGCATTGTTTGCAAAATATAGGGAGGATAATGG 
GAGTCC CACTT TMN TTNGG GAANTCNTTGGGNGANCTTNTTNATCCAAGGTTTANTCAA 
GCCTTCNTTTTCCAACTTTTAAAAAATTTTTTGTTMAA 

AAAAAI I I I I AAAAATATTTTANTGTTCTTGCNACCAAATTTGTTCTTNAAAAAATAATA 

AAMCTTGNTGGCAAATTCNTTTGGTCNNTTTAAAAAAAAAAAAAA 

Sequence 1740 

ACGCGTCCGCAGCCATCTTGGGATCTGGGCAAGTGAGCGAGCTCCTTCGTCACCGGGCTG 

ACTAGCCTCTCCTTTCCCTGTCCCCCTCCATCGCTGCTCTGCAGGAAGCCAGCCCCCAGG 

GCCAGTCCCGGAGGGGCTGATCCGCATCTACAGCATGAGGTTCTGCCCCTATTCTCACAG 

GACCCGCCTCGTCCTGAAGGCCAAAGACATCAGACATGAAGTGGTCAACATTAACCTGAG 

AAACAAGCCTGAATGGTACTATACAAAGCACCCTTTTGGCCACATTCCTGTCCTGGAGAC 

CAGCCAATGTCAACTGATCTATGAATCTGTTATTGCTTGTGAGTACCTGGATGATGCTTA 

TCCAGGAAGGAAGCTGTTTCCATATGACCCTTATGAACGAGCTCGCCAAAAGATGTTATT 

GGAGCT 

Sequence 1741 

CANCGCCCCGGCCCGNTCAAGCAGCNTAATAAAGCTCATANAGGCGGACNGGCATCNGGG 

TCTNGGGATCTGCACAGCGGGACTGGCATAGGGCCGTCTGGCACTGAAAACCCTAANCAA 

GAAGGTGTGAAAAGAAACTTNAGCNGANNCGACCNGAGGCATCNNCGCCAGCCAGCTCCG 

AAAGCAGAAGANGGAGGCGGTTCTGGCATGAGAAAGANACAGCTGGGCTGGCNAGGATAG 

GCCCCTCCTTCATCAAGGTACTGGTGGTGCCCCTGCACAGNANAATTNTNCCTGCCAGAG 

GCCATGCANCCTGCTTCAAGATAGGGACACTGGAACAGATACACTTGAATGAATTGGGGA 

AACACCCCAGAACTTTT 

Sequence 1742 

CACGCGTCCGTTTTTTNCAAGGGTCTATTGTTTCGATTAGTTTCCTTGCAGGAGGTAGAA 

GGTTTCCTCCATCCCGAGTMTTCATAGAGTGTTTGGCTTGNCTACCTTCCTCCTGACTG 

MGTCACCTGATACTTTTTGTTTTTTCAAGAAGGMGAGAGMCCCTGTTGCCTCAGTTA 

CTAGCAATGATACAATTCTCAAAATCTGGTC I I I 1 1 I I GTTTCTTTGAAATAGTTTCTCC 

ATGTTGTGTGACACAGCAGCCCCTGTCTTATCATAGTTGTCTTCCCTCCACCACCTGTAC 

CAGAGATGTTGGATATGTTGGAGGTGMGGTGTGCAAGGTTTTTAACTAACTGTTCTAAT 

T AMGGATT CTGCAGGAAAGMCATGGGTTTACAAAAGAGAAGCTTTTGTATTATTAGTA 

A I 1 1 I I I I ICTTTGATGMTTTATGTGCTTAGTTTGGAGAATCGAGAGTTGGCTGGGAAA 

AGATTTCTGAGGAGTTAAGGGACTCTGGTGCTGTTTGGGAA 

Sequence 1743 

GTGATTAGAAGTAAGCGNTGATGAGGCTGAAGAAAAGGAAGACAAAACTGAGTACTTGGA 

GGMCGAAGAGTMTGGGATATCCAATAMCTGAGAATGTTTCTTCACAATCTCCTTTAT 

TCTTCGTTCCTCCAAGTACTCAGTTTGGTCTTCTTTCAGGTGTAGGAACGTGATAAAGAA 
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GTAAGCGATGATGAGGCTGAAGAAAAGGAAGACAAAGAAGAAGAAAAAAAAAAAAAAAGG 
Sequence 1744 

CGATTTCTTTGTTGGACAACCCAGCTGGGGCTAGGAATGGTTCAGAAGGTTTAAGGCCGG 

MNGGGNMTGMGGGGCCCGGCGCTMCCCTCTAGGGACCTGTTTTGCTTCTGTTTAAA 

CCAAATGGGCAGTCTGTCATTACACACACCCTGNGTCTTCATATGTGGCTCGCCAGTATA 

ATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCCTGGTCGAGACGGGAGCCCTGGGGCCA 

ATGGCATTCCGGGTACACCTGGGATCCCAGGTCGGGATGGATTCAAAGGAGAAAAGGGGG 

AATGTCTGAGGGAAAGCTTTNAGGAGTCCTGGACACCCAACTACAAGCAGTGTTCATGGA 

GTTC ATTGAATTATGGCATANATCTTGGGAAAANTGCGGAGTGTACATTTACAAANATGC 

TTTTCAAATAGTTGCTNTAANANTTTTGTTCAG 

Sequence 1745 

GGACGCGTGGGTGGAAATGTAAACAAGAATAGACTGTTCATTCCTGATGGCTTTTAGTCT 

ATACTAACATATTGTTTGTCATGGCATCCGAGACTGAAAAGACCCATGCTTTACTGCAGA 

CTTGTAGCACTGMTCTCTTATTTCCAGCCTTGGGTCTGGGGGCATTTTGCCTCGTAGCT 

GACAGACTTCTTCAGTTTTCCACAATTCAGCAAAATGACTGGCTTCGTGCTCTCTCAGAT 

AATGCAGTACATTGTGTAATTGGCATGTGGTCATGGGCGGTAGTCACTGGAATCAAGAAG 

MG ACTGAC TTTGGAGAAATCATTTTAGCTGGATTTTTAGCCTCTGTTATTGATGTAGAC 

CACTTTTTTCTAGCTGGATCCATGTCTTTAAAGGCTGCTTTGACTCTCCCGCGAAGACCT 

T TCCT TCACTGTTCTACTGTGATTCCCGTTGTGGTTCTGACCCTGAAATTTACTATGCAC 

CTTTTCAAGCTCAMGACTCATGGTGCTTTCTTCTGGGATGGTATTTATATCC 

Sequence 1746 

GTGMCAGGIsrrATTGACTATGGTAAACTTATTTTATTGMGTTTTCAACCGGAAAATAGT 

MG TGGAA TATGATACAACTGTTATTTTCAGAACATATTTCTTTAGGGCTATTTAAAATA 

ACCTTTTTAAAGGGCAAAAACTTTCAATTTGAGAGAACAAATTCCTCTCCTCTGTGGGAA 

ATATTGGCTGAGATTTTGTATAGAATAAGAGACATGTATGTMCATATATTTATATTCAG 

CATAAGTCTACTGCMTCATGTACACATCTTAGCMGACGAGAGGATTTTGTTTAGTCTT 

TGTTTATGACTTCTACAGTTTCCTGTATCTAGTGTTAAGTTGTAAGGAAAAACTAAACAT 

GCAATTTAAAGGTAAACTTGATMCTATTTATGGAACATAAGCATACACCAATGGTTATT 

TCTCACAGTTTTCATGCGCATTTGTTTATTGTTTACTTGGATTAGGCTCATTAAAAACCA 

TAATGCTGGTCACAATTAGMTGCTAATATTTGGGGAAGCTATGCAGAAAATATTT 

Sequence 1747 

CNTGTGTGCCATGTATACCTAACGGGAGTCCCAGAAGACAGGAGAGAAAAAAAGAAAGAA 

ATAAAAAGAATATTTGMTTTAAAATTGCTTGAAAATGTCTCAAATTTGATGAAAAATAT 

TACTCTGCACATTCMCCCATGMCTATMGTTGTATAAAATCAAAAAGTTTCACACCAA 

GGCGTGTCATAGCCAAACTGTCAAAAGCCAAAGACACAGAATCTTGAAAGCAGTGAGAGC 

AAAGCAGACAAGGGATCCCCAATAGGATTAACAGCAGATTTCTCATCAGAAGCCATGCAA 

GCCAGAAGGCTATGGGAGACATACTCAAAATGCTGAAATAAAAGACTGTCAACAAACATT 

TCCACATCCAGCAAAAATCAAAAACGAAGGAGAAATCTGTTGCATGTGAGCTGAATAGAA 

mGTATTCTGCTGTTGTTGGATTGMGTATTCTTTAMTGTCAATTANATCAATTTG 

Sequence 1748 

CGATGGCAATACATGTACTCAGATAGTTACATCCCTATATAAAAAGTATGTTTACATTTA 

MAAATTAGTAGATMCTTCCTTTCTTTCMGTGCACMTTTCATTTTGACTTGAGTCAA 

CTTTTGTTTTGGMCAMTTAAGTMGGGAGCTGCCCMTCCTGTCTGATATTTCTTGAG 

GCTGCCCTCTATCATTTTATCTTTCCCATGGGCAGAGATGTTGTMGTGGGATTCTTAAT 

ATCACCATTCTTGGGACTGGTATACATAAGGCAGCCGTGAAACTGGAAAGTCATTTTGAT 

GACTG ATGTG ATACATCCAGAGGTAAMTGCATTTAAACATATTAAAGTATTTGCCAAAG 

ATACAATTTTCTTGCTGACATAAAAATCACACAAACAAGTCCCCCCCAAACCACAACTGT 

CTCTCAAATAGCTTAMAAAATTGAAAAACATTTTAGGATTTTCAAGTTTTCT 

AAAAAGATGTTCAAGCTATTAGAGGGAATGGTAAA 

Sequence 1749 
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GCCCNTCCCGACCCTCCAGCTCATGGTGTCTGGGGCCTGCGGCTAGACTCTTGGAACATT 

CTGGAACTCTCTCCTTTCCTGGCTGGGGCTCTGACCACAAACTCCCCTCCAGGCTGCCCC 

TGGGACATGGTGGTGATGTGGGTGCAGGAGCCAGTGTCTGTTGTCGGGACTCGCAAGTGC 

CCTCATCACAGCCACCCCCACCACGAGTGTCTCCCCAGTGCAGACTCAAGTTATGCTTGA 

AATGAAAAAGTCTATCTGGTAGTGGGTAAACGTAGACCTGGCACTGTTCCACGCGGGCGC 

CCCAAGCCTGCCACTCCTGTGTCCCTGCCTCCCTGGGCTCCCGAGATAGGCACCACTGTA 

TCCTCCAGCTCCTTCCTTCCTTCCCCCAGGAACACGGAGGCCACCGAGGGGGCTGGGCCT 

ATCAGGAGGACACAGGCTGCAGCCTGGCACCCACCCCTCCATCTTCACCCCACATGGAAG 

ACTTGTCTTCAAC 

Sequence 1750 

GCGTCCGGATTCTAAAAAAAAMAAAAAAAAAAAAAAAAATTMTTGGGGTGCCTTTTTG 
TTATAGTTTCTATTTTCTGTTTTGTAGGACMGCTGCATTTTCTGTAAATATAGGTCTGG 
ACTAAAGGATACATAAAGAATGCACAAAATGTCAACATCAGCAGAGATGCCCAGATCTAT 
TTATCTCTAAGTATAmGAAGTGATTGCTC 

TCAGTAAAGCTACCTGTAAAATTTCAGTCCAAAAAAAATAAAGCTCTCAGGGAGACATGA 
ATAAAATCAATGAACATTAGAAAATAAAATATAGATGCTTACCATTAACCTACCAACTCT 
TAATATCCTTAAATTATGTGATATATAAAGAGGACTGTTAC I 1 1 I I l ACTTTCl 1 1 I I I I 
I I I I I I 1 1 GGCTTTGCTTTATTTATTTGGAGT 
Sequence 1751 

GGGCATGCTCATAGGCACAGCTGTTGGTCAGTATGCCAATAACATCACACTTTGGATCTT 

TGCAGTCACTGCAGGCATGTTCCTCTATGTAGCCTTGGTGGATATGCTTCCAGAAATGTT 

GCATGGTGATGGTGACAATGMGAACATGGCtTTTGTCCTGTGGGGCAATTCATCCTTCA 

GMTTTAGGATTGCTCTTTGGATTTGCCATTATGCTGGTGATTGCCCTCTATGAAGATAA 

MTTGTGTTTGACATCCAGTTTTGACCTTTCCCAGTAATCACTGTTGATTACGAGAATGT 

TACCATGCAGCTTTGCATCTGTTCCTTGTACTGTATGCACATTGCTCAAAGGAAAGTCAG 

TGGCTT GCAC TACTTACAAGTTTCATAGATTTGAGCCTMCCACAAGAGGCTGGTGCTTA 

GTACTGTTTTCCCTGCACGTAGGGGTCTTTTAAAAATATAAAGCTTGTGATAAAGAGAGG 

A 

Sequence 1752 

CTGGTTCAGCAGCCGCCCACCCACCTCTGAGTCTGACCTGGAACCTGCCACAGATGGGCC 

AGCCTCCGAGACCACTACCCTCAGCCCAGAGGCCACCACCTTTAATGACACCAGAATCCC 

TGATGCAGCTGGTGGCACGGCCGGCGTGGGTACCATGCTTCTGTCCTTTGGGATCATCAC 

GGTGATAGGCCTGGCTGTGGCCTTGGTTTTGTACATCAGGAAGAAGAAGAGGCTGGAGAA 

GCTACGCCACCAGCTCATGCCCATGTACAACTTCGACCCCACGGAGGAACAAGATGAGTT 

GGAGCAGGAGCTGCTGGAGCATGGGCGGGACGCCGCCTCTGTACAGGCTGCTACTTCTGT 

GCAGGCCATGCAGGGCAAGACTACTCTGCCCTCCCAGGGCCCACTCCAGAGACCCAGCCG 

GCTTGGTGTTTACCCGATGTGGCCAATGCCATCCATGTGTGAGTGGCCTGGGACAAGC 

Sequence 1753 

GTCGCCCCGCGTCCGGTGCTCTCATGTCATCTCAGAGTTCCAGCTTATCAGAGGCATGTA 

GCAGGGAGGCTTATTCCAGCCATAACTGGGCTCTACCTCCAGCCTCCAGAAGTAATCCCC 

AACCTGCATATCCTTGGGCMCCCGAAGAATGAAAGAAGAAGCTATAAAACCCCCTTTGA 

MGCTTTCATGAAGCAGAGGAGGATGGGTCTGMCGACTTTATTCAGAAGATTGCCAATA 

ACTCCTATGCATGCAAACACCCTGAAGTTCAGTCCATCTTGAAGATCTCCCAACCTCAGG 

AGCCTGAGCTTATGAATGCCAACCCTTCTCCTCCACCAAGTCCTTCTCAGCAAATCAACC 

TTGGCCCGTCGTCCAATCCTCATGCTAAACCATCTGACTTTCACTTCTTGAAAGTGATCG 

GAAAGGGCAGTTTTGGAAAGGTTCTTCTAGCAAGACACAAGGCA 

Sequence 1754 

TCGCCCCGCGTCCGGACTGATCATAAACCATGCTGGTATTGCACCTTCTGGAACTATGGG 
CTTGAGAAAACCCCCAGGATCACTTCTCCTTGGCTTCCTTATTTTCTTGAGGCAGGTCGC 
ACGTTCTACCTGCCCAAGACGTGTGATATCAGCTTCTCAGATCCAGACGACCTCCTCAAC 
TTCAAGCTGGTCATCTTGTCCTGATGAGGGCtTCTACAAGAGTGGGAAGTTTGTGTTCAG 
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1TTTAAGGTGGGCCAGGGTTACCCGCATGATCCCCCCAAGGTGAAGTGTGAGACAATGGT 

CTATCACCCCAACATTGACCTNGNGGGCAACCGTCTNGCCTCAACATCCTTCAGGAGAGG 

ACTGGA MGCCAGTCCTTACNATANACTCCATAMTTTTATTGGGCCTTGCNGGTATTCT 

TTTTTTNTTTGNGAGCNCCAAACCCCNNGANGGACCCCCACTTGGAACCNAAGGGAGGGC 

NCGCAAGAAGGGTTCCTTGGCAAGAAACAAACCCGG 

Sequence 1755 

TCCGGCCCGCCCACCCGCCGCCAGCTACCATGGATGATGATATCGCCGCGCtCGTNTGTC 

GACAACGGCTCCGGCATGTGCANGGCCGGCTACGCGGGCGANGATGCCCCNCGGNCCGTT 

TTCCCTCATCGTGGGGCGCCCCANGCNCCAGNTGCGTGATGGTGGGCATGGGTCAGAAGG 

ATTTCTATGTGGGCGACGAGGNCCAGAGCAATGAGAGGCATNCTNACCCTGNATTACCCC 

ATCAGAGCACGGCATNTNTCACCAACTGGGACCGACATTGGAGAAAATCTGGCACCACAC 

CTTTCTACAATGAGCTGCGTGTGGCTCCCGAGGAGCACCCCGTGCTGCTTGACCGAGGCC 

CCCCTGAACCCCCAANGNCAACCCGNCGAGAANGATNACCCCAGATNCATGTTTTGAGGA 

CCCTTTTCAACACCCCCAAGCCCATGGTTACCGTTTGCCT 

Sequence 1756 

GCGNNGGGGCCGGGGCTGGGGTCGGGGCCAGGGGTCGGCCGGNGGCGGNGGACGGGCGTN 

CGGTGCCTGGGCTCTCTCCGCTCCCTGAGGCGCCGCTGGGGAGTCGAACCCTGATGATGG 

GACCTGTGAATTCAGCGGGGTGCCAAGTCGTTTTCTGTGTGGGTTGAGAGACAGGCTGNG 

CAGCCCACTGTTGCATAGGACTMCTACTACAAATCATGCTGAGACCGAGCTATTTTTGC 

TGCTTAGAGGCTTTGCAGCCTTGAGAGGAATTTTGTTGTCAGGAGAATAAAAGGAGGTTG 

TCCATNATTGACTTTAAGCAGCAATCAAGTAAAACATTGAGCTCTTNAGCTCCGCCTTTC 

TTGCTCTGAAAATTGGAAAACCAANAAGGTTTTGNTGTTATTGTGTGACCCCACCTGAAT 

TATAMCCAAGATGAACATMCCAAAGGTGGTCTNGGTGTTGTTTTTCAGCAAAAC ' 

Sequence 1757 

GGATCTACCANGTGTCTNGAGCAGCAGGAGAGAGCCTTCACGCTGGGGCTAGCAGGACTT 

CTCGGCGAGGGAGNNTTTAACTTTGGAGAACTNCTCATGCACCCTGTGCTGGAGTTCCTG 

AGGAATACTGACCGGNAGTGGCTGNTTGACACCCTCTATGCCTTCAACAGTGGCAACGTT 

GAGGCTGGTTCCAGACTCTGAAGACTGCCTGGGGCCAGCAGCCTGATTTTAGCANCTAAT 

GAAA GCCCAGCCTTCTTGAGGAAACATTTCAGTTTGTTTGTTGCCTCATTGGNAGATGAC 

TTTTCACACCGACCCTGGCCNATTCACAAGACANACTTNACTTTTTTGAAGGAAAAATTG 

CCAMAMGTTGGCCTAAAAAAATCNACAGGTTGAMTGGMGGGTGNGMGNCTTTTCT 

GGGGTGGATTGGAAANGGGCACCCTTTTTCGGNTNGGGGGGCNTGGGT 

Sequence 1758 

GAGCCGGCTGGCTCGAGTGGCCTTCGTCGTCCCTTGGCGCCCTGGGAGAGTCGCTGACGG 

GTGGACTGACGGACCGCCTGAGGACGGCCGGCCAGGGCGGTGAAAGCGCCAGCCCTATGG 

CGCGGGTCGCAGTGAGGCGGAAGGCCGAGGACGGCCGGCGGCGGCGCCCGCCCCGGCGAT 

GCGGGCCCCGCCCGTCGCCTNAGGTGCCATTTGGATTGTACTTTAGTGGCACCGATGTAC 

TCTGAGTGGAGGTCACTGCATTTGGTGATTCAGAATGATCAAGGCCATACCAGTGTGCTG 

CACAGCTATCCAAAGAGCGTTNGACGAGAGGTGGCAAATGCTGTAAGTCCGGCCTCTTGG 

GCAGGTGTTAAGTACCCCTTCAGTGGCTGGTAGTGAGAATTTGTTAAAAACTGACAAAAG 

AAGTA 

Sequence 1759 

GTCGACCACGCGTCCGGTCTCGGCGCCCGCTGCCCTCTCACCGCCCCACGCAGGATCCCG 

GCCTGGTCACCGGGCAGTGTGATGCTTCCCGACTGCCGCGGGGACAGCGAGGCACACACA 

GGGCTTGGGCCGCGCCGGAGGCCACACGGCCTGGCTGAGTTGCTCCTGGTCTCCCGCCTC 

TCCCAGGCGACCCGGAGGTAGCATTTCCCAGGAGGCACGGGCCCCCCCAGGGGGATGGGC 

ACAGCCACGCCAGATGGACGAGAAGACCAAGAAAGCAGAGGAAATGGCCCTGAGCCTCAC 

CCGAGCAGTGGCGGGCGGGGATGAACAGGTGGCAATGAAGTGTGCCATCTGGCTGGCAGA 

GCAACGGGTGCCCCTGAGTGTGCAACTGAAGCCTTGAGGTCTCCCCAACGCAGGACATCA 

GATTCCTCATGGTGCAAAATGGCCATTCCAAGCTTCCATCCAGCCATCACATCACAGGAG 

GAAGGGAAGA 
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Sequence 1760 

CCGATGTCTACAAGCTCGTCAAAGACTAGGGTGCCCTCTGCGCCTCCTTAAGGATGCAGG 

GTGAGCATCTCCTCTCCACACCTGCCTGGCACCCCTGGGGGGGTCCAGGATTGAGGATTC 

ATCTACCTGCCAGGCCTCAGGCCCAGGACCCAGGAGGCCTCCCCACCTACCCCAGCACAC 

ACACTCCCTGCCACTGTTCTGCGCT7TAATTGTGGGAGAAGAGAGGAGAGGAGGGCTCAG 

CGGTGGGGCAGCCTGTCCGGGGCGCTGACCCACCATCACCCTGCTCTGCCCAGCCTCGCG 

TGACCTCAGAGAGGTGGGGATAGGGGACACCTTCAGCCTCCAGCATGTGTGGCCACTTGT 

ACCCCCACCCACCCTTGGGGGAGCATGATGGGCAGGTTGAGGGGCAGGATGGAGACCAAG 

GGGAANTCAGTGAGCAGNAGGCCCTGGGGAAGTGTCCGGTCGGGGTTGGACTTGAGGGAC 

AGAGGGGCCCAACACTTTCTTGCCCCTTTTG 

Sequence 1761 

CCGCCAGGCCCACCATCACCACCGCCTCTGGCCGNCACCCCCATCTTNCACCTGTGCCCT 

CACCACCACACTACACAGCACACCAGCCGCTGNAGGGCTCCCATGGGCTGAGTGGGGAAG 

CAGCTTTNCCCTGGCCTCAGTTCCCAAGCTAACTGCCACGTCCACCCACGCATACACACA 

TGCCCTCCTGGACAAGGCTAACATNCCACTTAGCCAGCACCCTGCACCTGCTGACGTCCC 

CACNTCCCTTGGTTGGTGGGGACATTGCTTCTCTGGGGCTTTTTGGAATTGGGGGGCGCC 

CTTCTCTGCTCCTTTCACTGGTTAGNCTCTGGCTACCNATAAGCANGAGGCCNTGGGAAG 

GGGTTTCTCCCCTGGGCCCTTAAAAAAAGGNGGGCCCCAAAGC 

Sequence 1762 

AGTCGACCCCGCGTCCGGCAAGCGGGACCGGTAGGGiGCCGGAGCATGCGGCGGCGGCGCT 

CGGACTGTCCCATCCGCCCCGTATTGAGGCGCTGGGAGCGGCGGGGCGACAGGAAAGCGA 

TGGTGAAAGCGGGGCCGTGAGGGGGGCAGGNGCCGGGAGCCGGACCCGCAGTAGCGGCAG 

CAGCGGCGCCGCCTCCCAGAGTTCAGACCCAGGAAGCGGCCGGGAGGGCAGGAGCGAATC 

GGGCCGCCGCCGCCATGGAGCTGAGAGTCGGGAACAGGTACCGGCTGGGCCGGAAGATCG 

GCAGCGGCTCCTTCGGAGACATCTATCTCGGTACCGGACATTGCTGCAGGAGAAGAGGTT 

GCCA 

Sequence 1763 

CGCGTCCGGGGTGATTTGAGCTCATCTTGTTGAGCAAGGGGAGTGAAACCACAGAAACTG 

TTAM TTGMCGTAMTAACTTTGGAAAACAGTTTTGACACTAGAACAAGGTCGCTTCTC 

TTTTCCTCTTCTCTCCCAACCTGTTAACCTAGTGCTAGATAGTCAGCTGCTGTTGCAGGT 

AGAMAGCTCACCAATGCCACATTTGTTCATTCTGTTCAACCTGTTTTGACCTCCCAGGC 

ATCTGMCAGGATGACATCATCTTATTATATCTTACTTTACACATACGTGTATGTACACA 

CACACACACAATACAAGACATTTTTCCTGTTTTAGAAAATATAGCCCTTGTTTGGATTAC 

TGCTGTCTGAGCACTAGAAATTTCTAATGGAAAGAGGCCTCTGAATGGCTAAGGGAACAT 

CTGGAAAGGMGGAAAATGAGCCTNAAGGTTTTTGGTGCTTGGI I I I I GGTTCTTTTTCT 

TTCNTGGTTCC I I I I I IGGTTT 

Sequence 1764 

AGTCGACCCCGCGTCCGCGGACGCGTGGGCGGACGCGTGGGGGCCCTCCGGGAAGATGGC 

GGCCGTGCAGGCGGCCGAGGTGAAAGTGGATGGCAGCGAGCCGAAACTGAGCAAGAAGTG 

GTGGTAATCATTAGTTCCAGGGTGCTCTGCCATGTTGACGCAAGCTGCTGTAAGGCTTGT 

TAGGGGGTCCCTGCGCAAAACCTCCTGGGCAGAGTGGGGTCACAGGGAACTGCGACTGGG 

TCMCTTGCTCCTTTCACAGCGCCTCACMGGACAAGTCATTTTCTGATCAAAGAAGTGA 

GCTGAAGAGACGCCTGAAAGCTGAGAAGAAAGTAGCAGAGAAGGAGGCCAAACAGAAAGA 

GCTCAGTGAGAAACAGCTAAGCCAAGCCACTGCTGCTGCCACCAACCACACCACTGATAA 

TGGTGTGGGTCCTGAGGAAGAGAGCCGTGGACCCAAATCAATACTACAAMTCCCGCAGT 

CAAGCAATTCATCAGCTGAAGGTCAATGGGGAA 

Sequence 1765 

TCACCCCGCGTCCGACTTGAATCCCGTCAGCTTAAAACTTGTGTAAGGGAATCCTGACTT 
TTAAMTGTGAGGGTATTTGGATCTGTGTTGAAAGTCGTATATTTTTATCTGTGCGGTGC 
TGAGTGCAGGCCACCAGCTCCTAAATAGAGGTTCCCTATATGCGCGTATGACATGGTGAA 
TAAACACAACTCTCTCCACTCAGGACATCCGGAGCGTTATGGACCGTGGTAGGTGGTCGT 
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TCTGTGTGCTTGTGAAAGTGTCCAGGCGTGTGCACAGCCAGTGCGCCCACTTCCGGGCTC 

CTTGCTCCCTGCTGTACTGMGTTTTGGATTTTGCATCCAATCCTGTGTGCCTGCCCTTC 

T 

Sequence 1766 

GTCGACCACGCGTCCGGCTCCCGAGCCTGGAGAGCTCGGACTGCGAGTCCCTGGACAGCA 

GCAACAGTGGCTTCGGGCCGGAGGAAGACACGGCTTACCTGGATGGGGTGTCGTTGCCCG 

ACTTCGAGCTGCTCAGTGACCCTGAGGATGAACACTTGTGTGCCAACCTGATGCAGCTGC 

TGCAGGAGAGCCTGGCCCAGGCGCGGCTGGGCTCTCGACGCCCTGCGCGCCTGCTGATGC 

CTAGCCAGTTGGTAAGCCAGGTGGGCAAAGAACTACTGCGCCTGGCCTACAGCGAGCCGT 

GCGGCCTGCGGGGGGCGCTGCTGGACGTCTGCGTGGAGCAGGGCAAGAGCTGCCACAGCG 

TGGGCCAGCTGGCACTCGACCCCAGCCTGGTTGCCCACCTTCCAGCTGACCCTCGTGCTG 

CGCCTGGACTCACGACTCTGGCCCAAGAT 

Sequence 1767 

CCGGAAGAGTGCTTTATTGTGAAATTATTTAAAACTGTCCTTTAAAAGAAAAAGAGGAAA 

CGATGMCAAAAACTMTCTAATTGCCAAGTTAGAATTCATNATTTAATTTACX^TCCTAT 

GCAATGATTAATGCTGCAAAATGTATGGTTATGTTACCGTATATTCACAAAAGAAATATT 

ATTACAAGGTTTCAGAGGTAGCCAATTGCATTCTTTTGGAMTTTACTGTACTGTTTCAA 

TGTGTTAAGTGCCTTGTTGTAAAGTAAMTTTTAAGTCTAGATTCATTATTTTCCTGACA 

TATA TTnTCATTATGATATCTACTGTATGCTATTGTGATAGTTTTATGAAATGCTTACA 

TTTTMTCAAATATGTAAATTTCAGAAGCTCTTTTTTCCTACCCACCAGTACCTTAATCA 

TTTGGTTTATCACATTGGATTCAAATTCAGGTTCC I Nil I G ATAAAGAGG AAATTTGTT 

T 

Sequence 1768 

CCGGTGGGATAGTAAAAGAGAAGACGCGGAGAAGAGGAGAGGACCTACAAGAACGGAGGA 

CAGGGGCGCACGATGGTCCCGGGGGGAGCGGAAACAAAGGNACGCAAAACGGAAAAGCGT 

GTGTAGGGGAGCGGAAAAGGAAGGTCACCACCTGTGGCCTGCGACCGAAATGGCGAAAAG 

TCTTT TGAAG ACAGCCTCTCTGTCTGGAAGGACAAAAATNTGCTACATCAAACAGGATTG 

GTCACTTTTA TAGTA CATCCCATGGTATGTTTATGNAGGGAAGGAAGTGNAAAANNAACA 

CCTTCAGCAGTTTTCCTGGIsn"GGNATTCCATTGNACCTNTNAGNAAGGGAAGNACAANTG 

G GGCATT TGGGNCATTTCTTTAC^^^CTTGAACCMTTCCCMGTTAGGAATGGAAATGCC 

CI 1 1 1 1 I CAGGTTGTTAATTGNATGCTACAAACTTCTTGGNAAAAAAGTTAATTTGGAAT 

TTGGGAAAAAATTTGGGAACAANAGGGGGGAAAAAGGCCTCCATTTGTNCCCGGTGGGGG 

GCCAAAAAAATACCTT7TCTTC 

Sequence 1769 

GCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGATTTG 
TGTACAAATTGAAATGTCTGTACTGATCCTCAACCAATAAAATCTCAGTTATGAAAAAAA 
AAAAAAAAGG 
Sequence 1770 

GTCGCCCCTCAGCCGCTTCCCCTCGCCATGGAGGCGAGGCCGCCGCCGCCGCCGCGGGGC 

TCGGAGCCGCGGGCCGGGCGGCGGCCCTGAGGGCTAGTGGCGGCCCGAAACGCCGCCGCG 

GAGCCGAGGCGGAGCCGCTGTCCTCGTCCCCAGCGGTCCCGCCCAACGCCCGACTCTGTG 

ACACAACTAAAAAAAAACAAAGGTATTTATGGAATTCCACTGAGTGGTAATGGATGATGC 

AGTTCAAATAACTAAGATAAGCATGGATTCCTGGTTCATTCTTGTTCTGCTCGGCAGTGG 

TCTGATATGTGTCAGTGCCAACAATGCTACCACAGTTGCACCTTCTGTAGGAATTACAAG 

ATTAATTAACTCATCAACGGCAGAACCAGTTAAAGAAGAGGCCAAAACTTCAAATCCAAC 

TTCTTCACTAACTTCTCTTTCTGTGGCACCAACATTCAGC 

Sequence 1771 

GCGTCCGGGCCACATGGCCGCCGCCGCCGCTTGGAGCTGAAGTGCCGCCGCCGCCGGGCA 
GCCACGGGGAATCCGCCCGCATCGCCGCCCTCGCCGGCCGGGCGGCCGTGGGGCCCAGAG 
CGCCGGAGGCCAGGGCTGGGGCGGCACCGCGCAGCGGCCACGGGGTCCCGTTAGAGCAGC 
GCCCGGCGGCTATGCCGAGAGCCCGGAGCGGCCGGAGGAGCAGAGGGGCCGNCGGGAGGG 
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AGGAAACCTTTCTGCGAGTACGAGCCAACCGGCAGACCCGACTGAATGCTCGGATTGGGA 
AAATGAAACGGAGGAAGCAAGATGAAGGGCANAGGGAAGGCTCCTGCATGGCTGAGGATG 
ATGCTGTGGACATCGAGCATGAGAACAACAACCGCTTTGAGGAGTATGAGTGGTGTGGAC 
AGAAGCGGATACGGGCCACCACTCTCCTGGAAGGTGGCTTCCGAGGCTCTGGCT 
Sequence 1772 

TNGATGTGTATAT GGACT GTTNTGMGGGTTTTTTTCTTTATAGCCCANTTMGTTNTGT 

TTGGCTCGGTGCATTTTTCATTTATTTMTTAGTMTTTAAGTAACNGTGT^ 

ATCATTGTG AAGTT TCAAGATTCATTATGGGANGAGTTGATGGTNCANTNANGCATGATG 

GTTTMCAA ATTTT AACACCAAAAATGTTAAATCCTGCATAAATTCAACTGTANATAATA 

AATANGGTGTTTTCNTGTATATG ATAG N AATG C AATT AG AAGTACCTNTAGTAAANTCTT 

TTGGMTCACCAATNCTTTTTTGGCTTGAAMTTGGGAAAGMTTTCTGTTTATAATNCC 

TTTNNAATTAAACCTTGNGNGGGGGGAGGGGGAAAATAAAAAATTGCAGGAAAAACCTGC 

ATGAGNCACCTTMNAACCTTTAAMGTMGGGGGCTTNCCAATCT7TTTANTCCCNGGA 

AAACCCTGGTTGCCTN I I 1 1 IGGCA 

Sequence 1773 

CGTCCGTTTAAAGGCTCTGACTCTTGATCTTGAAAGCCGGACGCGGCACTGGCACTCGGC 

TTCAGTTTCCACTGTGACAGATGGAGGTCTCCTTTCGCCCCAGCCCAGGTGGCCAAGCCC 

ATCCTGGCCTCAGMCATGCTGAGCACATTTTGTAGGGTGGCACCTTTTTATCCMGTTA 

CTAGCTACACATCAGTGTTTAAAGAGAAAAAAGTGACCTTTC Al I I I l I I I i I ICTTGAA 

ACTTG AGG AAACAAGATACATACTACTG A 1 1 I I II 1 1 1 I I I CTTAAAACTAAATGCATGA 

CTGCAGAGCGGTAGAGGTGTATATTTTTCATACTGTGGGGCAMGTATTTGTGCTGCTTT 

TTGGAGATGGACTGGAACGTCTGGTTTCTGTCCCCGGGCCCGGCAGCTACCGTCTATTTT 

CTGTAGAAGGTGCCACAGTGAGACCTGGAG 

Sequence 1774 

CCCCGCGTCCGCTTCCGGTTGCTAACGGTTCCCAAACAGCCCCCGAAAACGCTACGTGAG 

CTGGGCCCTGGGCCAGAGGCAGAAAACGGACGGAAGAAAAGGTCTGGCCGGAGATGGGTC 

TCACTCTGTCACCCAGACTGGAGTGCAGTGAGTGGTGCGATCATAGCTTACTGCAGCCTG 

AAACTCCTGGGCTCAAGTGATCTTCTCGCCTCAGCCTCCTGAGTAGCTGGAGCTACAGGT 

GTGAGCTACCCAGCATGGCTCATTTGAGATTTCTGAGTAGAGAAGTAACATGATTAAAC 

Sequence 1775 , 

GGMCCTCCCCTAGATTTCAGAAGATGTATGGAAACGCCTAGATGCCCTGGCAGAAGTTT 

GCTGCAGGGGCGGGGCTCTCATGAAGAACCTCTGCTAAGGCAGTGTGGAAGGGAAATGTG 

GGGTTGGAGCCCCCAAACAGAGTCCTTAGTGGGGTGCTGCCTAGTGGAGCTGTGAGAAGA 

GGGCCATCATCCTCCAGACCCCAGAATGGTAGATCCACTGACAGCTTGAACTGTGCACCT 

GGAAGAGCCGCAGACACTCAATGCCAGCCCGTGAAAGCAGCCAGAAGGGAGGCTGTACCC 

TGCAAAACCACAGGGGCAGAGCTGCCCAAGACTGTGGGAACCCACCTCATGCTTCAGTGT 

MCCTGGATGTGAGACCTGGAGTCAMGGAGATCATTCTGGAGCTTTAAAGTTTGACTGC 

CATGCAGGATTTCGGACTTGCATGGGCCCTGTMCCCCTTTGTTTTGGCCMTTTCTCCC 

GTTTG GAAC GGCTGTAATTACCCAATACGTGTATCCCCATCGTATCTAGGAAGTAACTAG 

CTTGCTTTTGTTTTTACAGACTCATAGGTGGAANGGACTTGCCTTGTCTCAAATGAGACT 

TTTGGACTGTGGACTTTT 

Sequence 1776 

GCGTCCGGMCTTTATAAGAATTTATGCCGTTNrTACATGMCCGTTAAGTTTTGT ACTTG 

ACGTTTCTGTTTATTANGCTAAATTGTTCCTCAGGTGTGTGTNTATATATATACATATAT 

ATATATATATATATATGTATATATATACACATATATACGTATATATACATATATATGTAT 

ATGGAGTCTCGCTCTGTTGCCCAGGCTGGAGTGCAGTGGCACGATCCCAGCTCACTGCAA 

CCTCCGCCTCCCGGGTTCGGGCGATTCTTCTGCCTCAGCCTCCCTGGTAGCTGGGGCTGC 

AGCCATGTGCCACCMGCCCAGCTMTTTGTATTTrTGGTGGGAGACAGGGTTTCACCAT 

GTTGGTGAGGCTGGTCTGGAACTGACCTCAAGTGATCTGCCCACCTCGGCCTCCCAAGGT 

GTTGGGATTMCAGGTGTGAAGCCACCACGCCTGTCCCAGTATATTGTTTAACAAGTTTA 

TTTTTGGGTGAAAAATTCTCTTTAATGGGAAGA 
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Sequence 1777 

CCACGCGTCCGGCGCCCCCTCCCGGCCGCCATGTTGGCTGGTGTGTGGGTGTCAAACTGA 

GCCAGACGCGGCGGTGGCGGCGGCTCCGCGGGCTACGGTCGCTCCCGCCTCTCGAGCGCT 

GCCGGTGGCCGCAGCGGCGCACCCACGCCGGCCCGGAGGAGCAGAGTGCTAAGTGCTGGG 

TGCTCACTGGTGATGAGGCAGATGAAGGTTACCAAACTTGTGGACAGGAGCCTCATATCA 

GAGACGTGGACCTCACTGTAGCCTGGTCATGGCTTCCAGCTTTTCGAATCTGAGGCTCCA 

AAGGAGGAAATGACCATTCAGGGATCTTACTCCAGCTTGATTACGGAGACTGAACCTTCA 

TAGGGTGCGCACTTACCMGGACAGGAAGGTTTCTCTGTTTGAAGGGCTTTAAACTTATA 

ACAAAGAAAATAA 

Sequence 1778 

CGCGTCCGAGACAGTTCATACTGGAGACAAACCCTACAAATGTAATGAATGTGGCAAAAC 

CTTTAAACGGAACTCAAGCCTCACTGCACATCATATAATCCATGCAGGAAAGAAACCATA 

TACATGTGATGTATGTGGCMGGTCTTTTATCAGAATTCACAACTTGTAAGGCACCAGAT 

AATTCATACTGGAGAGACACCTTACAAATGTAATGAATGTGGCAAGGTCTTCTTTCAACG 

TTCACGTCTTGCAGGGCACCGGAGAATTCATACTGGAGAGAAACCCTACAAATGTAATGA 

ATGTGGCAAGGTCTTCAGTCAACATTCACATCTTGCAGTGCATCAGAGAGTTCATACTGG 

AGAGAAACCTTACAAATGTAATGAATGTGGCAAAGCCTTTAATTG 

Sequence 1779 

NCCCCGCGTCCGCTGTTGGAGCAGTAGACTTCTCACATCI I I I I GTCACTTCATCGTTTG 

ACTGGACAGTAAAGCTTTGGACAACTAAGMTMCMGCCTTTGTATTCATTTGAAGATA 

ATGCAGACTATGTTTATGATGTTATGTGGTCACCTACCCACCCAGCCCTGTTTGCCTGTG 

TGGATGGCATGGGGAGATTGGATTTGTGGAATCTCAATAATGACACAGAGGTACCAACTG 

CCAGCATTTCTGTGGAGGGTAATCCTGCTCTTAATCGTGTGAGATGGACCCATTCTGGCA 

GAGAGATTGCTGTGGGTGATTCTGAAGGACAGATTGTTATATACCGATGTGGGAGAGATT 

GCTGTTCCCCGCAATGATGAATGGGCACGGTTTGGCCGAACACTTGCAGAAATTAATGCA 

AACCGAGCTGATGCAGAGGAGGAAGCAGCTACCCGAATCCTGCTTAGTTCCTGAAAAGGG 

GAGTGTAACTAGTGGATTGGGAAAGGGTC 

Sequence 1780 

AGTAGGAACCAAGMAACTTCTTTTGCCAACTTTACAGGATATCTGGTAAACTATTACAT 

NGTCAGGCCAAACATGCTCCTTGCATTTTTGTTGGCTGAATNTGGGTACAGAGTGGTTCT 

ATACGATGGTAATAACCAACTTGNAATCAAAGGAAGNATTCCAACAGAAACAGATAGGAN 

AANGTCTTGAGAAGATATATNAAGGAATNTGTCACTTGTCACACNATGCCCGATCACCGG 

GACACAAATTCCTGCAAGAAGGGACCACACCGGACTCTATTTTCCTACAAGNGCAGGAAA 

CCTTGTTCATTCTAAGCATGGTTTCTGNTTGCCCAGGTATTCAAAAAA 

Sequence 1781 

ACCCACGCGTCCGGCTGCGTTGGGCTTGCGTGCGGCTCGCTAAGACTATGGCGTCCGGGC 
CTCATTCGACAGCTACTGCTGCCGCAGCCGNCTCATCGGCCGCNNCAAGCGCGGGCGGCT 
CCAGCTCCGGGACGACGACCACGACNACNACCACGACGGGAGGGATC 
Sequence 1782 

CCGCGTNCGTTTGTGTTGMTGGNTTGTATACTTCTTTACACAACCTATCCATTACTTAA 

GGAATCTGCTCTTATTCTTCTACAAACTGNTCNGGAACAAANTGATATCAGAAATTNGAT 

MMGMCTTCGMATGNTTGAAGGAGTNGAGGAAGNTCATTGAATTNCATGTTTGGCAA 

CTTNGCTGGAAGCAGAATCATTTGCCACTG 

Sequence 1783 

GTCGACCCCGCGTCCGGGCCCGTCTACAAGGNTTGTAGATAAAATAGAAACATACCTTCC 

TTGAAMTGCAGMTAAATTTTTTAAAGGCAGGAAGGAAGTGTTTGAACCATGTGTCAAC 

AAGCTTTACTGTCAMGCAGGCTTTTGGTATGGGAAGAAAAATACTTATAAATACTNGTN 

TTMTATTTGCTTTATTAAAATACATTTAAAATACAGCATTTTTAAATCTCTAAGCTCAA 

CTTGMGATATAAGAACAGTAAATTTGATAAAAATGAGAMTTACATTCCCATTTCTTTA 

AC AATT TGTAAATTCCAATTATCCTGAACATT TAAC ACCATTTACATATTTTATTAATCA 

CATTTTCTTAAACATTTGATAAGAGATTTAATATTTTGATCCAACTACCA 
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Sequence 1784 

GCGTCCGCGGACGCGTGGGTGTGGAGCCTGTTACCTCCGCATTTGACACGA6TCTGCTGT 

GAACCCCGCAACCTCCTCCCCACCTCCCATCTCTCCTTCCAGGCCCATCCCTGGCCCAGA 

GCAGGAGGGAGGGAGGGACAGATGGCGGTGGGTTTTTGTATCTGAATTTGCTGTCTTGAA 

CATAAAGAATCTATCTGCTGTTGAAAAAAAAAAAAAAAAAAAGGnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnTCACMATAMGCAnnnnnnnnnnnnnnnnnnnnnnnnnGTTTGTCCAAACTCATC 

MTGG TATCTTATCAATGTCTNGGATCCCCCGGGTACCCGANGCTCGAATTTAATTTCCT 

TTTTTC 

Sequence 1785 

ACGCGTCCGGCTTCGTTCTCCCGTCCCTCCCCGCGCCTTGGCGCGGGGGGTCGACTAGCC 

MGTGAGGCGGGAGGCGACTCGGACCTTTCCCTGCATTTCGTTTCGGCCAGTGCCGGGGG 

CTACCCGCCCTGGGGCCTGGGATCCTTGGGGCCCGTGAGGCCCACTCTTAGCGGCCGGGG 

CCTACCGCGGCCCGCCGCTGGCCCTCATGAGGCATAGCCTGACCAAGCTGCTGGCAGCCT 

CGGGCAGCAACTCCCCAACCCGCAGTGAGAGCCCGGAGCCGGCTGCAACTTGTTCGCTGC 

CCTCTGACCTGACCCGGGCTGCAGCGGGGGAGGAGGAGACGGCGGCGGCNCGGATCTCCC 

GGCCGCAAGCAGCAGTTTGGCGACGAAGGAGAGTTGGAAGCCGGGAGGGGGGGAAGCCCG 

CGGCGGCGTGGCCGTGCGCGCGCCCTCCCCCGAGGAGATGGAGGAGGAGGCGATCGCCAG 

CCTTCCGGGGGAAGAGACGGAAGGATATGGAC I I 1 1 1 1 I CTiSGGCTGGA 

Sequence 1786 

ACGCGTCCGGCCAGGCAGTGATGGAATCCCGGGGTCNGCAGGAGAGAAGGGTGAACCAGG 

TCTACCAGGAAGAGGATTCCCAGGGTTTCCAGGGGCCAAAGGAGACAAAGGTTCAAAGGG 

TGAGGTGGGTTTCCCAGGATTAGCCGGGAGCCCAGGAATTCCTGGATCCAAAGGAGAGCA , 

AGGATTCATGGGTCCTCCGGGGCCCCAGGGACAGCCGGGGTTACCGGGATCCCCAGGCCA 

TGCCACGGAGGGGCCCAAAGGAGACCGCGGACCTCAGGGCCAGCCTGGCCTGCCAGGACT 

TCCGGGACCCATGGGGCCTCCAGGGCTTCCTGGGATTGATGGAGTTAAAGGTGACAAAGG 

AAATCCAGGCTGGCCAGGAGCACCCGGTGTCCCAGGGCCCAAGGGAGACCCTGGATTCCA 

GGGCATGCCTGGTATTGGTGGCTCTCCAGGAATCACAGGCTCTAAGGGTGATATGGGGCC 

TCCMGGAGTTCCAGGATTTCAAGGTCCAAAAGGTCTTCCTGGCCTCCAGGGAATTAAAG 

GTGATCAAGGCGATCAAAGG 

Sequence 1787 

AAGCTATTGTCACACCTTTGAAACCAGTTGACAACACTTACTACAAAGAGGCAGAAAAAG 

AAAATCTTGTGGAACAAGTCCATTCCGTCAAATGNTNGTTCTTCCCTGGAAGTTGAGGCA 

GNCATATCAAGAAAAACTCCANCCCAGCCTCAGAGANAGATCTCTTAANGCTTTCTGCTC 

AGAAGGATTTTGGAACAGAANAGAAAAGCATCATGGTAANAAATGAAAGCCAAGAGATGT 

G CCCA CTCCTGTTMTTCATTCGATTGAAAATTCATACCCCTTCTAAGAAAAAATGAAAA 

GTTTTNTAAACAACAAAAAAGAAAGCCCNNNAGGCAAGGAAAGGGCANGTGCTTCATNAA 

GNATACCTGNTGNAAAANAGGAAATGGCATCTTCCNCCAAGAAGAAAAAGCCCAANGGGT 

TTGTACCATACCTGGTGCNCTTTGTATGCNCACCCTGCCAMGGCATAMGGTTTTCTAA 

AAAAAGTTACCTTGAGNGAAGCCAANGAAGCCTGGGAGAAAANAGTANTGNAAAAAATGC 

AAGCNAAAGAAGGNTGGGTGGGAAG 

Sequence 1788 

CCCCGCGTCCGAGCAAACATAAGAAACCTGAGTCATTTTGTCATTTAGAGTATTCTGATA 

A AATC TCTTG AAAATACTGAAATCAAAAGGTTAATG A I I I I 1 1 GTTCATTCTGATTTGTC 

ATTTTATTATCTGTCTAGCAGAAAMTCAAATGGGTAMTTAGCACTTTAGACAGCCAAC 

ATAGTGAAACCCTCATCTCTACTAAAAGTTGGCAATTAAATCTGAATTTACAGATACAGA 

TAACAGTTTATCAGMATCATATTTTTCCTGAAGAAMTTTMAATTAGGAGTTGTGGGC 

CTGGTGCGGTGGCTTACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGAGCAGATCA 

CTTGAGGTCAGGAGTTCGAGACCAGCATAGCCAACATAGTGAAACCCTCATCTCTACTAA 

AAGTACAAAAATTAGCCAGGTGTGGTGGCCGTGTGCCTGTAGTCCCAGCTACTCGGGAGG 
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CTGAGGCGGGAGAATCGTTTGGAACCCG 
Sequence 1789 

GACCACGCGTCCGGTCGGCGTCTGGGCCTCGTCCCCTTCTCTCTGTCTCCCTTGCCTCCC 

CCATCACGTCCCCTGACACCGACACCCCATTGCTCCCACAGTCTCCCCAGNCTCCACTTT 

GGTCCCCAGCGCTGTCTGCCCGAGGATTTGCCTGAAGGCTGCCCCCAACTCTGCACCCGC 

CCCCCGAGGGCCACCGAGGACCATGACTAAGACAGATCCTGCCCCGATGGCCCCGCCACC 

CCGAGGAGAGGAGGAAGAAGAGGAGGAGGAGGATGAACCCGTCCCCGAGGCCCCCAGCCC 

CACCCAGGAGCGCCGGCAGAAGCCTGTTGTGCACCCCTCGGCACCTGCCCCCCTCCCTAA 

GGACTACGCTTTCACCTTCTTCGATCCCAATGACCCGGCGTGCCAGGAGATCCTGTTTGA 

CCCTCAGACCACCATCCCCGAGCTGTTTGCCATTGTGCGCCAGTGGGTGCCCCAA 

Sequence 1790 

CGGGGTCTTCTTGCTGTGAGGTCGCGTTCCCCAGTGTTACGGAGGGTCCTTGAGGCAAGG 

AGTGAAAATTGGGTCTGGGGGTTAGTCCTGGGGTGGAGGTCTGGGCACGCCGGGTCGGAC 

CCCCTNCATCTTCGGNTTTGCACACCCCGCTTTCCAGCGCGGAGTCCGCGGCGGGTAGGG 

CNGGCGTCNCGTGCGTGACGTCATCCAGCGGCGCCTNGCGAGGCTNCAGTGGCCTTGACC 

TCCCGCGGNGTGGGAGGCTGCGCGGCGATGCTGCANTTTCGTCCGGGCCGGGGCNGCGGG 

CCTGGCTT 

Sequence 1791 

GGGTTATGAGAAGAACGCTCAGAGCAGAGCACCGAAAGTGGCCACTACCAGCATGAAGAG 

CCCMCAATTCAAACTGGNAGMGTGAGAAAAACAGAATGCAGCTTTCAAGGTTCGTTTC 

AAGCAGTTGGCTTGTGGGACTCTGAGAGATGCTGCTGNCCATGACATGCGGGAATTATCA 

TGATCAACTA CCCAG CTTGGATTTCACCCAGTGGCCAACTAGCTTTGTGTGGGAGACGGC 

AAGGGTTGGATTTTTCAAAAGAGTAAACCAGACCCGTGACCAAGGTGTNAACTAAGAAGT 

GGAGTCATGCTTCACACGGNCNTATCNTGCTGGCAGCCATTCCTGGGTCTGGCTGTGGTG 

TTAATCTTCATGGGATCC 

Sequence 1792 

GTGCTGGTTTNTCTTGCAGATGCTGCTGCTAGGGGTGGTGGGAAGCAGCCGTGGGACGCG 

TGGCCGGNAGCGGNGGTGACAGCCTGGGANNNCGGGGGCTTNTCTTCCTTGTCCTCCTCC 

TNTCCTGTCTATTCCCAGNGNGGGGCGTGGCTGACACTAAAGACTNTGTANNCATCAACC 

CGAGTGCANNTTCNATGGMAATGAAGGTTGCACGTTTTCAAAAAATACCTAATGGTGAA 

MTGAGACAATGATTTGCTGTATTGNCANCCAAAAAAAGCAAAGGGNATTNCCANTCATT 

GAAGNTGCAAGCATTCTNCAAGCTGATCTTCNAAATGGGTCTAAACAAAATGTNAAGNTA 

AGTTCNTNAGGCNAGCCCTTTCATGGmNGAAATGAGNTTTCGrsmTTTATGTGAM 

TGGAGGCCNCCNTGTNGGGAMGAAANGNGTTTNTTTTCNCCAGGTTTTNAAAA 

Sequence 1793 

GTCCGTTTTACAACCTAGTAATAATGTGGATAAATGTATCTACATGACACATGTCAAGAC 

CAAAATAACTGTGAATGACACACCTTGCTGTAAATGAACTGTGCTAACCCTGACTGTGGG 

CTTGAGAACAAAGATGAACTCTAGAACTCTAGCAGCCTAACTGCTGCTTCTCAAATAACT 

GTGTGMCAGTGAGATATTACTGTTTGTTTCTAAAMTCCTACTGTGCCCAGTTTCCTTC 

ACTACATGCCCTGCATTTTTTATTTAAATATTTAGCTGTAGCGCCATCAGATATGGATGC 

CTTCTMCAATTGCTGTTTGTAAAATAAATCAGGATGGTAGAAAGTGATTATATGGAAAA 

TTGGMCCTGGATGAGACCTTTTCGTTGAATTCTGAAGAGTAATGATGTGAAAATTGATA 

CAGGGCAAGAGATGATTCTT1TGGGTTTTCTTCTACTTCATGTCCAGAAGAGTAAGAGGG 

GAAAA 

Sequence 1794 

TNGTTGCCTGCAATACTACACTTTACAMCAATGTTAACACTGTGATTCCTTCATTGTTT 
TAAGAAGTTAACCTAGGGCCGGGCATGGTGGCTCATACCTGTAATCCTAGCACTCTGGGA 
GGCCGAGGCAGGAGGATCCCTTTAGCCCAGGAGTTAAAGACCAGCCTGGGCAACATAGGG 
AGACCCTGTC1 I I I I I I I GGGCAGCGTGGTGGGGGATAAATAAATAAAAGGAAAAAAAAC 
TTAGCCTAGAATTAGAATTMTTTAATTGAATTCATCTAAAGATGTCTCTGGTGA'I I I Tl 
ATATGTTCCGCTATATAATTGATGCTTTATAGTTTTATCATAATCCAACAACTTCAGTTA 
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TATTTMTTATTGGTMNGGAGTTTAAGACTNGAAAGACTAGAGTGCTTTCTAGTCCAAA 
TAGAGGGTCANGTGAAACCAGCTTTTTGACATCMGATTTTCATTTTGAGAAGGGANAAG 
CCTGTGGGACTGGGCTTAAA 
Sequence 1795 

GTCCGGAGAAAACTGCAACTTGCCCAGNTCCTCTTCTCCACCAATAGATGCAGCATCCGC 

AGAGCCCTATGGCTTCCGTGCCTCAGTGTTATTCGATACAATGCAACACCATCTAGCCTT 

GAATAGAGATTTGTCCACACCTGGCCTGGAGAAGGACAGTGGAGGGAAGACACCTGGTGA 

CTTTAGCTATGCCTATCAAAAGCCTGAGGAAACAACCAGGTCCCCAGATGAAGAAGATTA 

TGACTTGAGTTATGAGAAGACCACCCGGACCTCAGATGTGGGTGGCTATTACTATAGAAA 

ATAGAGAGAACCACAAAATCTCCAAGTGACAGTGGCTACTCCTATGAGACCATTGGGGAA 

AACTACCAAGACCCCTGAAGGATGGTGACTATTCCTATGAA 

Sequence 1796 

CCGGCNTAGGCGGGGGGAACACGCCGCCTGCGCTCTCTTGGGACCCTAGATTTGGGGGAG 

GAGGTAACGAGAGGCGGAGAGGGTGGCTCCTCAAATATACACCCCTCCTGTCCTCCGCCA 

CCCCACCTTTGATTTCTCTTCCCTCAACCCAGCACTCCAGCCCCACCCCAGGGTCAATTT 

TTGCCCCCTTCCATCTGAGCAGTGTTACCAGGCCCCAGGGGGACCGGAGGATCGGGGGCC 

GGGTGGGGGGTCCCATGGAGTACTCCAGCACACGCAGGGGCTCCCTGCAGACAGGGGGGG 

CCTTCGCCCTGGAAGCCTGGACGCCGAGATAGACTTTCTGAGCAGCACGCTGGCCGAGCT 

GAATGGGGGGTCGGGGTCATGCCGTCACGGCGACCAGACCGACAGGCATATGAGCC 

Sequence 1797 

TCCGATTNNGCCNAGGGTTGCAGTTTGTAGACCCCTGATCTAGACCCTTAAGTAGCCTTG 

TTTGTGCCTGAAGTTTACAGATGATCCCCAACTT Al I I I I ATTTT A I Mi l l I GAGATGG 

AGTCTCTCTCTGGAGCCCAGGTTGGAGTACAGTGGCAAGATCTTGGCTCACTGCAACCTC 

CACATTCCGGGTTCAAGTAATTCTCCCGCCTCAGCTTCCTGAGTAGCTGGGATTACAGGC 

GTGTGCCACCATGCCAGCTTTTTTGTATTTTGTATGTTAGCCATGTTGGCAAGGCTGGCC 

TCAACCCCTGACCTCAAGTGATCTGCCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGG 

AGTGAGCCACCATGCCCGAACCCCAACTTATGTTTGACTTACAATGTTTGGACTTTATGG 

ATGGTGCAAATGTTATATGCATTTCAGNAGAAACTGGGCTTCAAAATTTTGMTTTTTGA 

ATCTTTTTATTTT 

Sequence 1798 * 

TCCGCTGCCGAAGTCAGTTCCTTGTGGAGCCGGAGCTGGGCGCGGATTCGCCGAGGCACC 

GAGGCACTCAGAGGAGGTGAGAGAGCGGCGGCAGACNACAGGGGACCCCGGGCCGGCGGC 

CCAGAGCCGAGCCAAGCGTGCCCGCGTGTGTCCCTGCGTGTCCGCGAGGATGCGTGTTCG 

CGGGTGTGTGCTGCGTTCACAGGTGTTTCTGCGGCAGGCGCCATGTCAGAACCGGCTGGG 

GATGTCCGTCAGAACCCATGCGGCAGCAAGGCCTGCCGCCGCCTCTTCGGCCCAGTGGAC 

AGCGAGCAGCTGAGCCGCGACTGTGATGCGCTAATGGCGGGCTGCATCCANGAGGCCCCG 

TGAGCGA TGGM CTTCGACTTTGTCACCGAGACACCACTGGGAGGGTGAACTTTTNCCTT 

GGAANCGTTTTGCGGGGGCCTTGGCCTGCCCMGCTNTACCTTTTCCAACGGGGCCCCGG 

CGAG 

Sequence 1799 

GGCGNAGCTCGNCTTCCTCCNCGCCCAAGTTCCGGCGCCGCTCTTGCGGGAGCGTGCCGC 
ATCACCCCGGGGGCCCCTACGCGAGGATCTCCGGGGCCGTTGGCAGCAGCCTG 
Sequence 1800 

TCACCCCGCGTCCGGGCGGGCGTGGGGCGGTGGGAGGTAGTGAAGAAGGGTCGGCGGCCT 

GGGGTCGGCGCCGGCGCCGGCGGCCGAGGAGGCGGTAGGAACCGCAGGGCGCTCGGGGAA 

GCAAACNGAGTGTGGAAATACNACCTGACCCCTGCAATCCAGACCACAAGCACCCTTTAT 

NAGCGGNGCTTTNAGAATATCATNAAGNGGNTTTAATAAGGAGCAGGTCCCACCCCCTGC 

TGTGGAA 

Sequence 1801 

GTCGACCCACGCGTCCGGGAGCAGAGTCGACTGGGAGCGACCGAGCGGGCCGCCGCCGCC 
GCCATGAACCCCGMTATGACTACCTGTTTMGCTGCTTTTGATTGGCGACTCAGGCGTG 
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GGCAAGTCATGCCTGCTCCTGCGGTTTGCTGATGACACGTACACAGAGAGCTACATCAGC 

ACCATCGGGGTGGACTTCAAGATCCGAACCATCGAGCTGGATGGCAAAACTATCAAACTT 

CAGATCTGGGACACAGCGGGCCAGGAACGGTTCCGGACCATCACTTCCAGCTACTACCGG 

GGGGCTCATGGCATCATCGTGGTGTATGACGTCACTGACCAGGAATCCTACGCCAACGTG 

AAAGCAGTGGCTGCAGGAGATTGACCGCTATGCCAGCCGAGAACGTCAATAAAGCTCCTG 

GTGGGGCAACAAAGAGCGGACCCTCACCACCAAAGAAGGTNNGTGGGACCAACACCACAA 

GCCMGGGAGGTTTGGCMGACTTCTTTTGGGGCATTC 

Sequence 1802 

NCCCCGCGTCCGCGGACGCGTGGGCGGAGCTGCTGTGCAGTGGAACGCGCTGGGCCGCGG 

GCAGCGTCGCCTCACGCGGAGCAGAGCTGAGCTGAAGCGGGACCCGGAGCCCGAGCAGCC 

GCCGCCATGGCAATCAAATTTCTGGAAGTCATCAAGCCCTTCTGTGTCATCCTGCCGGAA 

ATTCAGAAGCCAGAGAGGAAGATTCAGTTTAAGGAGAAAGTGCTGTGGACCGCTATCACC 

CTCTTTATCTTCTTAGTGTGCTGCCAGATTCCCCTGTTTGGGATCATGTCTTCAGATTCA 

GCTGACCCTTTCTATTGGATGAGAGTGATTCTAGCCTCTAACAGAGGCACATTGATGGAG 

CTAGGGATCTCTCCTATTGTCACGTCTGGCCTTATAATGCAACTCTTGGCTGGCGCCAAG 

ATAATTGAAGTTGGTGACACCCCAAAAGACCGAGCTCTTCTTCAACGGAGCCCAAAAGTT 

ATTTGGCATGATCA 

Sequence 1803 

CGCGTCCGCGCTTCTGTTACGGCCAGTGCAACTCTTTCTACATCCCCAGGCACATCCGGA 

AGGAGGAAGGTTCCTTTCAGTCCTGCTCCTTCTGCAAGCCCAAGAAATTCACTACCATGA 

TGGTCACACTCAACTGCCCTGAACTACAGCCACCTACCAAGAAGAAGAGAGTCACACGTG 

TGAAGCAGTGTCGTTGCATATCCATCGATTTGGATTAAGCCAAATCCAGGTGCACCCAGC 

ATGTCCTAGGAATGCAGCCCCAGGAAGTCCCAGACCTAAAACAACCAGATTCTTACTTGG 

CTTAAACCTAGAGGCCAGAAGAACCCCCAGCTGCCTCCTGGCAGGAGCCTGCTTGTGCCG 

TAGTTCGTGTGCATGAAGTGTGGATGGGTGCCTGTGGGGGTGGTTTTTAGGACACCAGAA 

GAAMCACAGTCTCTTGCTAGAGAGCACTCCCTATTTTGTAAACATATCTGCTTTAAGGG 

GGATGTACCAGAAACCCACCTTACCCC 

Sequence 1804 

CCCGCGTCCGTAGATTAAATTATGCAAGTTGCAAGAATGTAGTAACTCTGATCAGCTACA 

AGGAAAGGAGGAAAGAGTAAATGAAGAAAGTCATCTAACTGAAAAGGAATATATAGAACA 

TTGTAACACCCCTACAACTGATTCTGATTCATCTATAGCAGTTAAAGCACTACAAATAGA 

TAGCTTTGGTTTAGTTACATGCTTTCAACMGAGTCTCTTGATGTTTCTCAAATGATACT 

TGGAAAATCTCAGCAACCTGAGTCAAAAATGCAATCTGAATTTATAAAAGAAAAAAGTGC 

TACTTGTTCAAATGAGGAAAAAGGTAACTTAAACGAGGTCAGTAATAACTGAAGAGAAAG 

AAACAGATGGGAGATCACCCTATCTTCATTACTGAACCAAAACTACCNGTTCACAATATA 

CCTGGATTCGACAGCATAAAAGAAACC 

Sequence 1805 

GCGTCCGGCAGCTGAAAGGGGATTTGGGCCCGGAAGATCCGAGTCCATCCGCGGCGGGGA 

GAGGGCAAGCGGGACCGGTAGGGGCCGGAGCAGCGGCGGCGGCGCTCGGACTGTCCCATC 

CGCCCCGTATTGAGGCGCTGGGAGCGGCGGGGCGACAGGAAAGCGATGGTGAAAGCGGGG 

CCGTGAGGGGGGCGGAGCCGGGAGCCGGACCCGCAGTAGCGGCAGCAGCGGCGCCGCCTC 

CCAGAGTTCAGACCCAGGAAGCGGCCGGGAGGGCAGGAGCGAATCGGGCCGCCGCCGCCA 

TGGAGCTGAGAGTCGGGAACAGGTACCCGGCTGGGCCGGAAGATCGGCAGCGGCTC 

Sequence 1806 

GTCGCCCCGCGTCCGCACAGTTGATTCTGMTTTTTAAGGCTTTCCTAATAGGCTGATCA 

CAGAGAATAATCCATTTTGAAGGTATAAAACTGCACTGTATGTCTGTCACTTGTAGCTGA 

A CTGAT TCACATTTTGACAAAAGAGAGAAAATACAAAAATGAGTTTTGCAAATGTAATAA 

CTTTTTCTGCATATAGAACTAAATAATTGAAA AATATG GGCTATAGTTCTCAAAGGTAGA 

TAGTAAAATCACTGGCTTTTTCCAGCTGTATGTTTTTCCACTGTGCGTGTACACACACAC 

TGGAAMTAATTAGGCTGATTTTGCAGGTCTTCATTGTTAGAGATTCTGAAGTATTTACT 

GTCMTTCATAGGTTTCAGTTTATTTAGGAAATTAGTGT!TGACAGCI I I I I I I AAATTA 
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TTTCACTGMGCTGAGATTATTAGTGATCAAAGTTAAMTTTCMTATTTAATTTCTCTA 

TATATTATTAATATTAAAATGGTTT 

Sequence 1807 

GTCGACCCCGCGTCCGGTACTACATCCCCTGAAGAAACATGTGCCCAGGTGATTCGTGAA 

GCTAAGAGAACAGCACCAAGTATAGTGTATGTTCCTCATATCCACGTGTGGTGGGAAATA 

GTTG GACC GACACTTAMGCCACATTTACCACATTATTACAGAATAT^ 

CCAGTTTTACTACTTGCAACTTCTGACAAACCCCATTCCGCT7TGCCAGAAGAGGTGCAA 

GMTTGTTTATCCGTGATTATGGAGAGATTTTTAATGTCCAGTTACCGGATAAAGAAGAA 

CGGACAAAATTTTTTGAAGATTTAATTCTAAAACAAGCTGCTAAGCCTCCTATATCAAAA 

AAGMAGCAGTTTTGCAGGCTTTGGGAGGTACTCCCAGTAGCACCACCACCTG 

Sequence 1808 

CCCCGCGTCCGGCCTTTAAAAGAAGACTTGAATTCCTATGGAACAATAAAGACACAGCAG 

AAMCAGGGATTCTCCTGTTTCAGAGGAMTAAAMTGACCTGTCMCMTTTATCCATT 

ATCACCGTGACCTCTGTATCCGAAACATTGTCAAAGAAAGAAGGTGTGGTGCAAAGACTT 

CTGCTGGAACTTTCTGTGGCTGTGACCTGGTGAGCTGGCTAATTGAAGTCGGCCTTGCCT 

CCGACCGTGGTGAAGCTGTGATATACGGAGACAGGCTGGTACAAGGGGGAGTCATCCAAC 

ATATTACCMCGAGTATGAATTCCGGGATGAGTACTTGTTTTACAGATTTCTTCAAAAGA 

GTCCTGAACAGAGTCCTCCTGCTATTAATGCAAACACTCTCCAACAGGAAAGATATAAAG 

AAATTGAGCATTCATCCCCACCCTCACATTCCCCTAAGACCTAAATTATGCAGGGGAGAA 

CCCTACATGGAATCAT 

Sequence 1809 

CGCGTCCGCTGGAGTGCTGCTGAGGAGCGANGGGCCCATCTGGGGTCTCTGGAAGTCGGT 

GCCCAGGCCTGAAGGATAGCCCCCCTTGCGCTTCCCTGGGCTGCGGCCGGCCTTCTCAGA 

ACGMGGGCAGTCCTTCCACCCCGCGGCGCAGGTGACCGCTGCCATGGCTTTTCCCCATC 

GGCCGGACGCCCCTGAGCTGCCTGACTTCTCCATGCTGAAGAGGCTGGCTCGAGACCAGC 

TCATCTATCTGCTGGAGCAGCTTCCTGGAAAAAAGGATTTATTCATTGAGGCAGATCTCA 

TGAGCCCTTTGGATCGAATTGCCAATGTCTCCATCCTGAAGCAACACGAAGTAGACAAGC 

TATACAAGGTGGAGAACAAGCCAGCCCTCAGCTCCAATGAACAATTGTGCTTCTTGGTCA 

GACCCCGCATCAAGAATATGCGATACATTGCCAGTCTTGTCAATGCTGACAAATTGGCTG 

GCCGAA . 

Sequence 1810 

CGCGTCCGGTGCATCTGAGGACTGGTGGGAAGGCANGGCACAACGGGATTGACGGGCTGG 

TGCCTCACCAGTATATAGTGGTGCAGGATATGGATGATACGTTTTCAGACACTCTGAGCC 

AAAAAGCCGACAGTGAGGCCAGCAGTGGGCCAGTCACGGAAGACAAGTCCTCATCCAAGG 

ACATGAACTCCCCGACAGACCGTCATCCTGACGGCTATTTAGCCAGGCAACGAAAAAGAG 

GAGAGCCACCCCCTCCAGTAAGGCGTCCTGGCAGGACCAGTGAT 

Sequence 1811 

TCAGGAGTCGACCCCGCGTCCGGAAGGCCGATGCTGTGGGGGTGGGCGTGGAGAGAATTC 

TTCTGTGGGTCCTCTGGTGTTGAGTGGTCGGCTTGGTGTGGTGTGCGGAGGAGCTCCAGG 

CCCGTCGGCGCGGAGTGGTCTCACGTGTGAAACATGGCTACAGATTGGCTGGGAAGTATT 

GTGTCCATCAATTGTGGAGATAGCTTGGGTGTCTATCAGGGAAGAGTGTCAGCTGTGGAT 

CAGGTCAGCCAGACCATTTCTCTCACCCGGCCTTTCCATAATGGAGTGAAGTGTCTTGTT 

CCAGAAGTCACCTTCAGGGCAGGTGACATTACGGAGTTAAAAATTCTGGAGATACCAGGA 

CCTGGAGACMCCMCATTTTGGAGACCTTCATCAAACAGAATTAGGCCCCTCTGGTGCT 

GGCTGCCAAGTGGGCATCAATCAGAATGGCACAGGCAAGTTTGTCAAG 

Sequence 1812 

CCGCGTCCGCCCAGTCCNAGTGCTGGCTTTCCCTGTATCTGCCTCTGCCAGGCAACACTT 
ATCATGGCTCCCAATCAGCAGGAGCCTCCATGCTCCACTTTGAACAGCCTCTATGCTCCA 
GCAATGGGGCATTTGTGAAGAGTGACTTGATTAACTTTTCTGACCATGGGTATAATACAG 
TTGCTTCAGAGGGCAGTGGTTCTGGGTGTGA I I I I I ACACTGTAACATTGTATACAGTGT 
CATGGATAATTACTATTTTTTTCTGGTCATTAACACTCACCTACTCTAGTACTAGGATTT 
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CAGACCMGGTCCTCATGACGCCTGGATATTTTAGTATCTATATCCAATAATCTTTTCTC 

TCCTACTGMTATCCAGGCAMGATGAMTCGTTTTCTTTAAAACTGTCAAATTCTGTAA 

AACTCAGGAGCCAGTTCAAGGGAACAAGCATCTTCACAATAGATGGAATCAAGAGTTAAA 

TGTTATAGTGGCAAGCTTGTCTACTGGGCAACAGAC 

Sequence 1813 

CGCGTCCGCCGCCGCGTCCGCTCCCGGTCCCTGGCCCCTCAGCGGCATGGCGTGCGGGGC 

GACGCTGAAGCGGCCCATGGAGTTCGAGGCGGCGCTGCTGAGCCCCGGCTCCCCGAAGCG 

GCGGCGCTGCGCCCCTCTGCCCGGCCCCACTCCGGGCCTCAGGCCCCCGGACGCCGAGCC 

GCCGCCGCCGTTTCAGACGCAGACCCCACCGCAGAGTCTGCAGCAGCCCGCCCCGCCCGG 

CAGCGAGCGGCGCCTTCCMCTCCGGAGCAAATTTTTCAGAACATAAAACAAGAATATAG 

TCGTTATCMGAGGTGGAGACATTTAGAAGTTGTTCTTAATCAGAGTGAAGGC7TGTGCT 

TCGGGAAAGTCAACCTCACTCCTCAGCACTCACAGCACCTAGCTCTTCCAGGTTCCTCAT 

GGATGAAGAAGGACCAGCCCACATTTACC 

Sequence 1814 

CGCGTCCGTTGMGAATAATATTGTATGTGCATTTTATCCATTMTGTTTCATACTTTCT 

G AGAGTAT MT ACCCT TTTAAAAGATATTTGGTATACCAATACTTTTCCTGGATTGAAAA 

Ci i i I I i I AMCTTTTTAAAATTTGGGCCACTCTGTATGCATATGTTTGGTCTTGTTAAA 

GAGGMGAAAGGATGTGTGTTATACTGTACCTGTGAATGTTGATACAGTTACAATTTATT 

TGACAAGGTTGTAATTCTAGAATATGCTTAATAAAATGAAAACTGGCCATGACTACAGCC 

AGAACTGTTATGAGATTAACATTTCTATTGAGMGCTTTTGAGTAMGTACTGTATTTGT 

TCATGAAGATGACTGAGATGGTAACACTTTCGTGTAGCTTAAGGAAATGGGGCAGAATTT 

CGTAAATGCCTGTTGTGCAGATGTGTTTTCCCTGAATGCTTTCGTATTAGTGGCGACCAG 

T 

Sequence 1815 

GTCGC CCCGCGTCCGATTTAAACTGGGTC7TTATAAMGTAAAATGGCCAACATTTAATT 

ATTTTGCMAGCAACCTAAGAGCTAAAGATGTMTTTTTCTTGCNATTGTAMTCTTTTG 

TGTCTCCTGAAGACTTCCCTTAAAATTAGCTCTGAGTGAAAAATCAAAAGAGACAAANGA 

CNT IsrTTC GANTCCANNTTTMGGCTGGGGGAMNTGGGTTTTTTAGCNCMCCNTTNCAA 

MGTTTTNTTTNGGGGATTCATAACANC^lCCACCNAATTG^r^TTTTTGTTG 

CATTTCMTACTAGTTATATTNANNAGGAGTNGGTAGAGAGGAAACATTTGACTTATCTG 

GNAAAAGCAAACTTGTACTTAAGAATAAGAATAACATGGNCCATT 

Sequence 1816 

TCGACCCCGCGTCCGCTCTGCTCCTTGTCTCCNTCTNCCTTTTTCTGTCTTTGCCGGGTC 
TCTGGGTCTCTGACCCCCATCCGGCCCTCATGGCTTTGGGTCNGGAGCTNTTGAAGCAAT 
GTTCATCAT 
Sequence 1817 

CCACGCGTCCGGGGGAGCCGGACGCCAGAGTCCCCTCTCCACGCCGTGCAGCTGCGCTGG 

GGCCCCCGGCGCCGACCCCCGCTGCTGCCGCTGCTGTTGCTGCTGCTGCCGCCGCCACCC 

AGGGTCGGGGGCTTCAACTTAGACGCGGAGGCCCCAGCAGTACTCTCGGGGCCCCCGGGC 

TCCTTCTTCGGATTCTCAGTGGAGTTTTACCGGCCGGGAACAGACTGGGGTCAGTGTGCT 

GGTGGGAGCACCCAAGGCTAATACCAGCCAGCCAGGAGTGCTGCAGGGTGGTGCTGTCTA 

CCTCTGTCCTTTGGGGGTGCCAGCCCCACACAGTGCACCCCGATTGAATTTGACAGCAAA 

GGCTCTCGGCTCCTGGAGTCCTTACTGTCCAGCTCAGAGGGAGAGGAGCCTGTGGAGTAC 

AAGTCTTTGCAAGTGGTTTCGGGGCCAACAGTTCGAGC 

Sequence 1818 

TCGACCCACGCGTCCGGTGAAACACAAAACCAAGGAGTACATTAAGAAGTACATGCAGAA 

GTTTGGGGCTGTTTACAAACCCAMGAGGACACTGMTTAGAGNGACTGTTGGGCCAGGG 

TGGGAGGATGGGTGGTCAGGTAANGACAAGACTCTAGGGNAGAAGGAAANCCTGTGGGCC 

TTTNTGTCCCACCCCTGTCAGCACTGGTGCTACTGATTGATACATNACCCTGGGGGGNAA 

TTNAACCCTGCCAGMATGTCAACNTGGAANGGCCACAAAGAAGTGGAACCTCCCATCTAC 

AAAANNGAGTTTACNCTTANATTTTGTAGAAGCCTNGTTTGGCCATTGTTGCNNNTAGAN 
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AGTCCATNNANTAGGNGGCAAGGGGGCCTNTANTAAAAATGAACCCCTGGNACAGAACCT 
TGNACTTNCACATTCTTTAAMNCCCTGGGAGATGNTTTGCTTTCTNTGGGNCGNGTTGN 
TTTGNTTCAAGCTGCTACNAAGTAACTCTCAATGGCCGGCAATTATCCCCAACTCNCACN 
AAACTCCNTTTTTAACCCCTGGCANGGAATCCTTGCAAATTTAATAA I 1 1 I 1 1 AAATGGG 

Sequence 1819 

GTCCG GCTT TAGTGMTTCTTAATAGATNGTATATATAAMGTACATTTTAATAGAAAGC 

CAGGGTTTTAAGGAATTTCACATGTATMGGTGGCTCCATAGCTTTATTTGTAAGTAGGC 

TGGATAMTGGTGCTTAAATGGTAATGTACTCCACTTCTTCCTATTGGAAGATTAACATT 

ATTTACCAAGAAGGACTTAAGGGAGTAGGGGGCGCAGATTAGCATTGCTCAAGAGTATGT 

AAAAAAAAAAAAAAAAAAAANGAACCAAACCACTGGAAATAATCAAATGCAAAAAGGTAA 

CAAATTCATAACTGGAAAGCAAAGAGAAGAACAAGTATGATTTGGATGATAAAGCATTGT 

TTTMTGGTGAAAACTTCACCAGATCACTTMTGTTTCTAGGAGGTTAACTTCAAGTGGG 

CAANTGGGGGTTTTTAGGTAGGTCAGTGGCCCTAAGTTCCTAAAGCCCACAGATTAGGGA 

TCTGTTAMCTGMTGGTCTGTTGGAMGGTTTTGTTTTAACTGCTTGGGAGGCTTTCCT 

TTTAAG 

Sequence 1820 

GCGTCCGGGAAMGTTTGCCTTCCANGCCGAAGTTAACAGAATGATGAAACTTATNATCA 

ATTCATTGTATAMAATAAAGAGATTTTCCTGAGAGNACTGATTTCAAATGCTTCTGATG 

CTTTAGATAAGATAAGGCTAATATCACTGGACTGATGAAAAATTGCTCTTTNTTGGAAAT 

GGAGGAACTAACAAGTCAAAAATTAAAGTGTGATAAGNGAGAAGAACCCTGCTGCATGTC 

ACAGACACCTGGTTGTCTGGAAATGACCAGTAGAAGAGTTGGGTNTAAAAAACCTNTGGT 

ACNCATTAGTCCAAATCTGGGACAAGNCGGAGTTTTTTAAAACAAAAATTGACTTGAAAG 

CCACCAGGGAMGATGGCTCAGTTCAMCTTTTTTGGAATNTGGATTGGGCCCAGTTATG 

GTGGTCCGGTTTTNCTATTTCCCNCCTTTCCCTTGTAGCAGATTNMGGTTTATTTNGTC 

ACTTTCAAAAAACAACAAACAAACNGATTACCCCAAGCAACATCTTGGGGGAGNTCTTGA 

CTTCCAAATG 

Sequence 1821 

CGTCCGCGGGTAAAATGTTATGGTMGCATGCACANGTTTGCAGTCTACAGTTTTTTTAT 

GTA GCACAA AATAGGTGTACCTTTATMGTACATTCAATTTTATGATTTACATTTATCAT 

GT AATTTT TAAAAAAATCCATCTATCTAGGATATGTTGGATACAMGTCTGCTTTTGCTA 

TTCTT1TTGCTTAMTACTCCTATCATTTTCTGMTTACTTGGTATTTAGMCTC 

ACCACGGGGAAGAATAGAGGTATCATCAAACGTGGCAAATTTTCTTTCAGGAATAATAAA 

GAGCATGATTCCACAGCTTTTCTGGGGATGTTTGAGATTCI 1 1 I I I AGTACTAAGCAAAA 

TTCTCATCACAGGAATGTAGCCCAGGCCMTTTATMCTAAATCTCTATTTTGTTCGGAT 

GATGCTTCTAAAACAGCATTGATAGGTTAMGAAGCTTGGGTATTTTTMTTTACTTCAA 

TGATTAGCTCMTTGCTTCTGGGAGTTTTAATCCTGTGGATATGTCAT 

Sequence 1822 

GGGCGCCCCGCGTCCGCTGATCTCGGGCTCCTATTTCATTTACATTGTGTGCACACCAAC 

GTAACCAGTGGGAAAACTTTAGAGGGTACTTAAACCCCAGAAAATTCTGAAACCGGGCTC 

TTGAGC CGCTA TCCTCGGGCCTGCTCCCACCCTGTGGAGTGCACTTTCGTTTTCAATAAA 

TCTCTGCTTTTGTTGCTTCATTCTTTCCTTGCTTTGTTTGTGTGTTTGTCCAGT^ 

TTCAACACGCCAAGAACCTGGACACTCTTCACTGGTAACATATTTTGGCAAGCCAACCAG 

GAGAAAAGAATTTCTGCTTGGACACTGCATAGCTGCTGGGAAAATGAACATCAGTGTTGA 

TTTGGAAACGAATTATGCCGAGNTTGGTTCTAGATGTGGGAAGAGTCACTCTTGGAGAGA 

ACAGTANGAAAAAAATGAAGGATTGTAAACTGAGAAAAAAGCAGATGAAAGTGTCTCACG 

AGCTATGTGTGCTCTGCTCAATTCTGGG 

Sequence 1823 

CCCCGCGTCCGGTCCTTTGGCACTGTCATNTGTGTCCCTCGAGTGAGCNTCACCAGAGCT 
GCAGTAANNGNGCACCTACTACNGGCTCTGNGCTGAGTCCTTCCAGTGNGCCTCTCACTG 
AATNCTCACCCCACTGNCATGAGGTTTNCCCCATTTGACTGATGAGGGTGNAGAGCCAGG 
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GAGCCTTGNTCACTGGTTCATTGANTACAT7TACAAATATTATTNACAGAGTGGGAGAAG 
AGCCGTATAGGGNCTTMTGCCATGGTNGGGACTTTGGAATTTAATTCAAGTGATGTGGG 
AGTACCTGCCAGATGGATGGAGGGTGAAATAATGCTTAAGCCCTCAGCC 
Sequence 1824 

CGCX^CACGCGTCCGCACCGGCCCGATGAGCTGCAGCGGCTCCGGCGCGGACCCCGAGGCG 

GCTGCCGGCCTCCGCCGCCTCGGCCCCGGGCCCCGCGCCCCCGGACTNGGCTCCCGCCGC 

GCTGCCCTCCAGCACCGCCGCGGAGAACAAAGGNCAGCCCCCGNGGGGACAGGNNGGGGA 

GGACCTGGGNGCTNGGAGCAAGCATGCTGGGGGGCACAGGGACCCTTTGGCGGGAAGCCG 

GGCCGGTAGAGCNCAGCNTGAAGCAGGNTNTGAGGAGCTGGNTCCCTGNGNATNGGNNGA 

GGATGGGCCGCNGGANGCCCTCTCCGNGAAGGGGGGNGCTCAATAATCTTAACCGGGGGG 

TTTNAA 

Sequence 1825 

CGCGTCCGGCTGAAGGCTCCCCTGGGNTTNCTGGCCTCCTGGGGCAGAANGGGAGAGAAA 

GGCGATGCTGGCAACTCCATTGGAGGAGGCAGAGGGGNACCTGGCCCTCCAGGGCTCCCT 

GGGCCCCCAGGGCCAAAGGGAGAAGCAGGTGTCCGATGGCCAGGTTGGCCCCCCAGGGCA 

GCCAGGAGACAAGGGGGAGCGTGGAGCAGCTGGAGAACAGGGACCAGATGGCCCCAAGGG 

CTCCAAGGGAGAACCAGGGAAAGGAGAGATGGTGGATTACAATGGAAACATCAATGAGGC 

TCTCCAGGAGATCCGGACGCTGGCCTTGATGGGGCCTCCTGGTCTTCCTGGGCAAATTGG 

CCCACCTGGAGCTCCAGGGATTNCAGGCCAGAAGGGGGAGATTTGACTGCCANGGCCTTT 

CAGGACACGATGGGGAAA 

Sequence 1826 

GTCGACCACGCGTCCGG I I I II I I I I II I I I GAGACAGAGTTTTGCTCTTGTTGCCCAGG 

CTGGAGTGTGATGGCTCGATCTTGGCTCACCACAACCTCTGCCTCCTGGGTTCAAGCAAT 

T CTCC TG CCTCAG CCTCTTGAGTAGCTTGGTTTATAGGCGCATGCCACCATGCCTGGCTA 

ATTTTGTGTTTTTAGTAGAGACAGGGTTTCTCCATGTTGGTCAGGCTGGTCTCAAACTCC 

CAACCTCAGGTGATCTGCCCTCCTTGGCCTCCCAGAGTGCTGGGATTACAGGTGTGAGCC 

ACTGTGCCGGGCCCGTCCCCTCCTTTTTTAGGCCTGAATACAAAGTAGAAGATCACTTTC 

CTTCACTGTGCTGAGAATTTCTAGATACTACAGTTCTTACTCCTCTCTTCCCTTTGTTAT 

TCAAGTGTGACCAGGATGGCGGGAGGGGGATCTGTGTCACTGTAGGTACTGTGCCCAGGA 

AGGC 

Sequence 1827 

CGACCNCGCGTCCGGCACTCTGTTCTTCCGCCGCTCCGCCGTCGCGTTTCTCTGCCGGTG 

AGCGCCCCGCCCCGGGGCCTGAGCTGGACGTCGCAGGCCTGCGCCCCCCGACCCCGGCTG 

GCCCCGCTTCCAGCTGCCGAGGCCTCGTCGCGCCTTCCCCGGGAACAAAAGGCGGGGTCG 

CAATGGAAGAAGAGATCGCCGCGCTGGTCATTGACAATGGCTCCGGCATGTGCAAAGCTG 

GTTTTGCTGGGGACGACGCTCCCCGAGCCGTGTTTCCTTCCATCGTCGGGCGCCCCAGAC 

ACCAGGGCGTCATGGTGGGCATGGNCCAGAAGGACTCCTACGTGGGCGAC 

Sequence 1828 

CNCCACGCGTCCGGACCGGGGAAGACGCCTCTTCGCCGCCTCCGAAAACCGAGGCAGCGA 

GCGACCCCCAGCATCCCGCGGCCTCCGAAGGGGCCGACGCCGCCGCCGCCTCGCCGCCAC 

TGCTGCGCTGCCTAGTGCTCACCGGCTTTGGAGGCTACGACAAGGTGAAGCTGCAGAGCC 

GGCCGGCAGCGCCCCCGGCCCCTGGGCCCGGCCAGCTGACGCTGCGTCTGCGGGCCTGCG 

GGCTCAACTTCGCAGACCTCATGGCTAGGCAGGGGCTGTACGACCGTCTCCCGCCGCTGC 

CTGTCACTCCGGGCATTGGAGGGCGCCGGGGTGTTNTGATCCGCAGTGGGCGAGGGGAGT 

CAGCGACCGCAAGGCAGGAGACCGGNTGATGGTGTTGAAC 

Sequence 1829 

GGTGTCGCCCCGCGTCCGCTTGT C I MM I GGGGGTGTGAA I I I I I I GCATTGTTCTGAT 
CA TATT CTTTATCATGTAATTTATGTTC I Mil I ACTAAGTATTATGTGTGGTTATTATA 
GATTTTCACAAAGATATATTGCTGGTMTATATTTTATTGTGTAGTCTTATMTTTACTT 
AACCTTCTTTCAATTGTTAGAAATTTAGGCTATTTCCAGATTTTCAGTATTGTAAATAAT 
GCTGTGATGACCAATTTTGTGAATAMATGTTTTTATGTATTTCAGATTATTCCCTTAGG 
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ATAGTCTCTCAGTGCCAAGTTGTCAAAMCATCTCTATTTTGCTTATCTTCCTGCTCTCT 

TGCTGCCTTAGGGGGTAGTAAACTGAAACATAAAGTAAACATGCATACAAATAAAAAACA 

TAAMCAAAAATAAGCAACCTGATGGTMTAGGTGAAAGTGGTAACCTGTTTTAACTTTG 

AATTCTTGCCCGGCGCGGTGGGCTCACGCCCTGTAATCCCAGCA 

Sequence 1830 

CGCGTCCGGTAAACCAGCCGGAGCGGCGCGGNAGCGGCAGGACCGCCGTGGCGCCTAGAG 

TAG CAGA CCCGGGGGGAGCGCGGGGCGACGCTGGCTGCAGGGACCCGGTGACAGCGTGAG 

AGGTTTTGACAAGCTTGCATCATGCGTGAGTATAAGCTAGTCGTTCTTGGCTCAGGAGGC 

GTCTGGAAAGNCTGCTTTGACTGTACCMTTTTGTTCAAGGMTTTTTGTAGAAAAATAC 

NGATCCTACCGATAGGAAGATTCTTATAGAAAGCAAGTTTGAAAGTAAGATGCACAACCA 

GTGTATGCTTTGAAAAATCTTTGGATACCTGCCAGGNAACCGGGAGCCAATTTACANCCA 

ATGGAGGGGGATTTATMCATGNAAAAAMTGGGACAAAGGGATTTTGCATTTAGTTTAA 

TTNCAATCACCAGACACAAGTTCCCACCAATTTTAAAACGAATTTTACCAAAGAACCCTG 

GAGGAGNAACCAAGAANTTCTTTNNGMGGTTTAAAAAGGAACCACCTTGAATTGGAATG 

GTTTCCCAAATG 

Sequence 1831 

CCAATTATAGACTATATAGGGGGAAGAGCACTGGATTTGGAGTCAAGAAACCTGGACACT 

TGGCTCCACACTTCCTTAGCTGGGTAACTTTGGGCAAACCGCTTGGTCTCTCAAGCCTAA 

GGTTCTTCAGCTATAAAATGGGAATAATACTTCACTAACTACCTCACAGAGTTGTGGTAA 

GAATATAATC AGATA ACTGGATAAAAACACTATATAAACTGGAAAGCGCCGTACAAATGT 

GAGAGATCAGTTTTATTATCAMTCACTGTTTTCCACTGCCTCTTGMTCGGCTTTATTC 

TAACCMCCATTACATCTTTCTCATCTTTTGGAGTATGGGTAATTGAGGCTTGGGTGTGT 

CATCAGGGACTGGGAAGTTATTTCAGCTCCCATGTAAAAGGTGGGAGAGGTGGTTTGTGG 

GNGCAG 

Sequence 1832 

GNGTCGACCNCGCGTCCGCTATTTACTACCTCCTTATGAGGAAGTGGTGAACCGACCTCC 

AACTCCTCCCCCACCATACAGTGCCTTCCAGCTACAGCAGCAGCAGCTGCTGCCTCCACA 

GTGTGGCCCTGCAGGTGGCAGTCCCCCGGGCATCGATCCCACCAGGGGATCCCAGGGGGC 

ACAGAGCAGCCCCTTGTCTGAGCCCAGCAGAAGCAGCACAAGACCCCCAAGCATCGCTGA 

CCCTGATCCCTCTGACCTACCAGTTGACCGAGCAGCCACCAAAGCCCCAGGGATGGAGCC 

CAGTGGCTCTGTGGCTGGCCTGGGGGAGCTGGACCCGGGGGCCTTCCTGGACAAAGATGC 

AG A AATGTAG GG AG G AGCTGCT 

Sequence 1833 

GCCNCGCGTCCGTGAAACGCAAAAGAAGGAGCTCGGAATATAAGAACGTCAGAACGAGTG 

ACACTMTAGTGGATAACACTAGATTTGTTGTAGACCCATCCATTTTTACTGCACAGCCA 

MTACAATGTTGGGCAGGATGTTTGGATCTGGCCGAGAACATAACTTTACACGACCCAAT 

GAGAAAGGAGAGTATGAGGTGGCAGAGGGAATTGGTTCCACTGTGTTTCGAGCGATTCTG 

GATTACTATAAAACAGGAATAATCCGTTGTCCTGATGGCATATCTATTCCTGAACTGAGA 

GAAGCATGTGACTATCTTTGTATCTCTTTTGAATATAGCACTATTAAATGTAGAGATCTC 

AGTGCCCTMTGCATGAGTTATCAMTGATGGTGCTCGTAGACMTTTGAATTTTATCTG 

GAAGAAATGATCCTCCCTCTCATGGTAGCTAGTGCCCAGAGTGGGGAACGG 

Sequence 1834 

CCNCGCGTCCGCTTTAACCACCATAAGAAATCAGAAAACGCAAGATAAAGTTCAGCACAC 

AGTATGTATGGATTGCAGTAGCTACAGTACATACTGTTATCGCTGTGATGATTTTGTGGT 

TAATGACACCAAGCTGGGACTGGTACAGAAAGTCAGAGAACACTTACAGAACTTGGAAAA 

CTCAGCTTTCACAGCTGACAGGCATMGAAMGAAAACTTTTGGAAAACTCAACACTAAA 

CAGCAAGTTATTAAMGTMATGGAAGCACCACTGCCATTTGTGCCACAGGCCTTCGGAA 

TTTGGGGAACACATGTTTCATGAATGCCATCCTTCAGTCACTCAGTAACATTGAGCAGTT 

TTGGCTGTTATTTCAAAGAACTGCCCCGCCGTGGAGTTAAGGAATGGGAAAACAGCAGGA 

AGGCGGACATACCACACCAGGAGCCAAGGGGATAACAATGTGTCTTTGGTAGAAAG 

Sequence 1835 
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TTACTGGGCACCCANCTCCATGTGCANGACTTTTCCCAACACAGCCTTGGCCAGTCAGAT 
GGTGTGNCAGGGCCNNAGGTTTNCGTANCCTCTTGGGTGATAGAAAGGGGCCCAGGCCCT 
GGGCTGGGGCTCATAANGGACTCAAANGAGGCACCTTGCCC 
Sequence 1836 

TCGCCCGGGGGCCATGGCAGCAGCGGCTACTGCAGCCGAGGGGGTCCCCAGTCGGGGGCC 

TCCCGGGGAAGTCATTCATCTGAATGTGGGAGGCAAGAGATTCAGTACCTCTCGCCAGAC 

TCTCACCTGGATCCCAGACTCCTTCTTCTCCAGTCTTCTGAGCGGACGCATCTCGACGCT 

GAAAGATGAGACCGGAGCAATCTTCATCGACAGGGACCCTACAGTCTTCGCCCCCATCCT 

CAACTTCCTGCGCACCAAAGAGTTGGATCCCAGGGGTGTCCACGGTTCCAGCCTCCTCCA 

TGAAGCCCAGTTCTATGGGCTCACTCCTCTGGTTCGTCGCCTGCAGCTTCGAGAGGAGTT 

GGATCGATCTTCTTGTGGAAACGTCCTCTTCAATGGTTACCTGCCGCCACCAGTGTTCCC 

AGTGAAGCGGCGGAACCGGCACAGCCTAGTGGGGCCTCAGCAGCTAGGAGGACGGNCAGC 

CCCTGTCCGACGGAGCAACACGATGCCCCC 

Sequence 1837 

CGCGTCCGTTCTAGATCGCGAGCGGCCGCCCl I I I I I I I 1 1 I I I I ATCTTTCTG 1 1 MI C 

CACTAAAAGCTCCGTTTTTCCATCTTCCCATTACTCTCCCTTTCCTGTGNACACTCCTGA 

AG ACAGG CATCCTCATAAGGTGTTCTGMTTMCTTTAGGGNGTTCTCCAGGTACTTTGC 

ATCTTTTTATATTTCTTGTAMTTGNTMTTTCTAAGCTCCATGATTAAAGAGAATTCAC 

CACTAAAAAAAAAAAAAAAGG 

Sequence 1838 

CGCGTCCGGGCTGGCGAGCGCCGNCGCCGGCGGAGACCGACNCTNGGCCAGAGCCNGCCC 

GCGGCGCCCGGGCCTGGCCGGCTGCTTCCCGCCTCAGCNNGGCGCCCCNGCCTCCGTGCG 

CCGCAGACTTTGCCTAGGCGGGCNGAAGCTGAACAAGAGGTCCCTCGGCCCTGCACGGTC 

CGGCCGCGGTCCGGAGCCGANGCGCATGAGATTCCCCAGGAAGCCATCACACTCCTTCCC 

ACTGTGGTTTNGGGAGCATGAAGGCGTTG 

Sequence 1839 

GCCCCGCGTCCGTTTTATTTTGCACTTTTATGGGTGACAG I I I I I ACGCATAACCTTTGA 

TAAAATACACTCAAGTGACTTGG ACTTAGATGCTTATCCTTACGTCCTTGGTACC I I i I I 

TGTATTAACA MCACTGCMTTTATAGATTACATTTGTAGGAAGTTATGCTTTTTTCTGG 

TTT TTGTTT TACTTTCMCC TAGGT TATMGACTGTTATTCTATAGCTCCAACTTAAGGT 

GCCTTTTTMTTCCCTACAGTTTTATGGGTGTTATCAGTGCTGGAGAATCATGTAGTTAA 

TCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCAGCCTACATTCACTTCTAAA 

GTCTATGTMTGGTGGTCATTTTTTCCCTT7TAGAATACATTAMTGGTTGATTTGGGGA 

GGAMACTTATTCTGMTATTMCGGGTGGTGAAAAGGGGACAGTTTTTACCCTTAAAGT 

GCAAAAGTGGAACATACAAAATMGACTAMTTTTTNAGAGGTAACTCAAGTAATTTC 

Sequence 1840 

GTCCGGCCAGCTGATGCCGGGAGCTAACTACCGCGCCGGGGCCGGGGCCGGGGCCGGGGC 

CCGACGTCCCCGCGGGGCCCGGGACCGNGAGGAGGACGGCGGGGGCCTGGAGCCCGCGGC 

CGTGGCCCGCGACCTATTGAGGGGCACATCTAACATGTCATTTGAGGAGCTGTTGGAATT 

GCAGAGCCAAGTGGGGACTAANACNTACAAACAATTGGTAGCTGGAAATAGTCCTAAGAA 

ACAAGCTTCTAGACCACCTATCCAAAATGCATGTGTTGCANATAAGCACAGGCCTCTGGA 

MTGTCANNCAAGATCCGAGTNCCATTTTTACGTCAGGTTGTTCCCATTAGTAAAAAGGT 

AGC 

Sequence 1841 

GCACCCGCTGNGAGAGGCGGTAGCGGCGGCGGCGGCGGTGGTATCGGCGGCAGCTGTGAG 

GGGGTTCCGGGAAGATGGTGCTGATCAAGGAATNCCGGGTGGTTTTGCCATGTTCTGTTC 

AGGAGTATCAGGTAGGGCNAGCTTTACTCTGTTTGCAAGAAGCTTTGTAATGAATTGAGA 

CTGGGTGGGTGGAGAAGGAATTGAAGNTCTTAAAGAATGAACCTTATGAGAAGGATGGAG 

AAAAGGGACAGTTTACGCACAAAATCTATCACCTAAAGAGCAAAGTTGCCTGCATTCTGT 

GAGGATGATTGCTCCCGAGGGCTCCCTTGGTGTTTCATGAGAAAAGCCCTGNAAATGCCG 

TACCCCTACTGTAGNAACAATTNTNMCCGAAATGGAATATATTGAAANGAATGATTTTC 
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CTTCAATTNAAAANTCCGAAAACAATNGGCANCCAAAACCCATNACTTTGGGGAACCATN 
TAGGAAAAAANNGTACCNATNGGGTTTTTAGGATTCCCAAAACCACNATTGGGAAAAAAC 
CTTG 

Sequence 1842 

CCCGCGTCCGGCGGAGAAGACTCAAAAGAGTGTGAAGATTGCTCCTGGAGCAGTTGTATG 

TGTAGAAAGTGAAATCAGAGGAGATGTAACTATCGGACCTCGGACAGTGATCCACCCTAA 

AGCAAGAATTATTGCGGAAGCCGGGCCAATAGTGATTGGCGAAGGGAACCTAATAGAAGA 

ACAGGCCCTTATCATAAATTGCTTACCCAGATAATATCACTCCTGACACTTGAAGATCCA 

GTAACCMNMCCCTATGATTCATTGGCACCAATAAATGTTGTTTGGAAGTTGGGCTGGT 

TTATTCCCAAAGGCCCATTGAAAGATGGGGGAGGATAAAATAAATGGTCATTTGAAATCA 

MAAGTCMTTATTGTTAGGCCAGAMATGGTAAATATTTGGACCAAGTTGGCTTGCATT 

CATTTGGGNGGGCTTTGGTTGCMCCTTAMNTACATTTTGTAAGTTCATCCCCTGGAGA 

ATACCNGGTGGAATCCTTAAT 

Sequence 1843 

CGACCNCGCGTCCGGGGGATCTGTGCCTGGCATGGGGACGAGTTCTGGCCTCCTTAGGGT 

ACGGGGAGAGCTTGGACTTTGGTCCTGACGTGGTGGACGACACACCTTCGAAGAGTGGAC 

GTTACCTCAGTTGTCTGTTGTTAGAGTTTMTCGATCACTCCTCTGTTTTGTTGTGTTCT 

TTCCCAGAAATMCTTTACCAAAGGAAAGCTATTTTGCGAACTATCTTCTCCAGCGGAGA 

TGGCCMTGTGCTTTGTAACAGAGCCAGACTGGTTTCCTATCTCCCAGGATTTTGCTCTT 

TAGTTAAMGGGTTGTCAATCCCAMGCCTTTTCGACTGCAGGATCATCAGGTTCGGATG 

AGTCTCATGTGGCTGCTGCACCTCCAGATATATGCTCTCGAACAGTGTGGCCTGATGAAA 

CTATGGGACCCTTTGGAC 

Sequence 1844 

GGGACAGAGCCCCCGATCCGCCAGCACCACCTGAGGATTNNGAAACCGCCCCAGCGATGG 

AAGAGGGNCAGGAGCTGGAGAGGAAAGCAATAGANGAACTGCTTAAGGAGGCAAAACGTG 

GGAAAACTAGAGCTGAAACAATGGGACCCATGGGTTGCTTATTACAGGGACAAGATACAA 

ACTAAAATCAGCCAAAATAAGACACAAAGATTAAAGCCA 

Sequence 1845 

CGTNCGGGACCCGNCAACATGGGCCGCGTTCGCACCAAAACCCGTGAAGAAGGCGGCCCG 

GGTCATCATAGAAAAGTACTACACGCGCCTGGGCAACGACTTNCACACGAACAAGCGCGT 

GTGCGAGGAGATCGCCATTATCCCCAGCAAAAAGCTCCNCAACAAGATAGCAGGTTTATG 

TCACGCATCTTGATGAAGCGAATTTCAGAGAAGGCCCANTTAAGGAGGTATCTTCCATCA 

AAGCTGCAGGNAGGAGTGANGAGAGNAAAGGAGTAGACAATTATGTTTCCTGAAGGTCTC 

AGCCTTGG GATCAA GNGAGAATTATTNGAAAGNTAGATCCCTGNACAACTAAAGGAAATG 

CTTGAAGNCTTTTTGGACCTTCGNCAGGTTCTTGTCCAACCCTTTCAAGGTTCACTTCAG 

GCCTACAAGTTGGGGATTGAMTTTTTCAAAAACGCCNTCGGGGGAACCCTGTTTTGAAA 

ATTTTTTTTCTTG NT AG NTG CCTTGT ATTAATTTTT 

Sequence 1846 

GTCGACCCCGCGTCCGCAGCCTGGCCTGTGAGACCCTCGTGGACAACAACCTGCGGGTCA 

CCAACTGGAACCGCAAGCTGGGCTGCAAGTGCCAGTACAAGCACATTGTGGACTGGTGTG 

GCTGCTCCCCCAACGACTTCAAGCCACAGGACTTCCTCCGGCTGCAGCAAGTCTCCAGAC 

CCACCTTCTTCGCCCGGAAGTTCGAGTCGACTGTGAACCAGGAGGTGCTGGAAATCCTGG 

ACTTCCACCTGTATGGCAGCTACCCCCCCGGCACGCCAGCCCTCAAGGCCTACTGGGAGA 

ACACCTACGACGCGGCTGATGGCCCCAGTGGGCTCAGTGATGTCATGCTCACTGCTTACA 

CAGCCTTCGCCCGCCTCAAGCCTGCACCATGCCGCCACTGCTGCACCCCCAATGGGCACC 

CCACTCTGCAGGTTTGAGCCCAGGGGCTTTGCCGTCCAAGCGTGCACCTGTATTTCTATG 

ACGACCATTTC 

Sequence 1847 

TCCGCAAGAGTTATGCTTAAGACCAGCCAGCCTTGATAGTGGCAGAACATCCACTAGCAA 

TAGCAATAATAATGCTTCACTACATGAAGTCAAAGCAGGTGCAGTTAATAACCAAAGCAG 

GCCACAAAGCCACAGCAGTGGAGAATTTAGCGTGCTTCATGACCATGAGGCTTGGTCCAG 
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CAGTGGTAGCAGTCCAATCCAGTACTTGAAAAGACAGACCAGATCAAGCCCAGTGCTCCA 

GCACAAAATATCTGAAACACTGGAGAGTCGACATCACAAGATCAAAACTGGTTCCCCTGG 

MGTGAAAGTTGTTACTCTACMCAGTTTTTGGAAGAAAGCAATAAGCTTACCTCAAGTA 

CAGATAMGTCCTCMGTCAAGAGAATCTTTTAGATGAAGTAATGAAAAGTTTGGCTGNC 

TCTTCTGACTTTTTGGGAAAAG 

Sequence 1848 

GCGTCCGCGCCGGGCCGCTCTAGCCGGTGAGGCCGGCGGGCTCTCTGTGGCTGCGGCTGG 

GAAACCGCGCGGAGGAGGTGCCCGGCCGGGGACCAGCCCTGGTCCAGCGCCTCCCTCTCT 

CAGCATGGACGAGGAGAGCCTGGAGTCGGCCTTGCAGACCTACCGTGCGCAGCTGCAGCA 

GGTGGAGCTGGCCTTGGGCGCCGGCCTGGATTCGTCTGAGCAGGCTGACCTGCGCCAGCT 

GCAGGGGGACCTGAAGGAGCTCATCGAGCTCACCGAGGCCAGCCTGGTGTCTGTCAGGAA 

GAGCAGGTTGTTGGCCGCGCTGGACGAAGAGCGCCCGGGCCCGCCAGGAAGATGCTGA 

Sequence 1849 

AGTCACCACGCGTCCGGGTTGCACTCTTCCTATAGCCCAGAGGGCGAGAGGGCCTGTGGC 

CTGGGGGAAGGAGGACGAGGTTCTGCCTGGATCCCAGCAGTAGGACGCTGTGCCATTTGG 

GAACAAAGGAATAGTCTGCCTGGAATCCCTGCAGATCTTGGGGCCGGAGGCCAGTCCAAC 

CCTTGGAGCAGGAAGAAACGCAAAGTTGTCAAGAACCAAGTCXaAGCTGCCTCAGAGCCGG 

CCCGCAGTAGCTGCAGACTCCGCCCGCGACGTGTGCGCGCTTCTCTGGGCCAGAGCGAGC 

CTGTTTTGTGCTCGGGTTAAGAGATTTGTCCCAGCTATACCATGGGCCCGCACTCGGNAA 

AGCTGGCTTGCGTGGCCGCTGGTGTGGTTATCGGGGCTGGTGCCTGCTACTGTGTATACA 

GACTGCTTGGGGAAG 

Sequence 1850 

TCGACCNCGCGTCCGCTCAGGAACCTTNGAGAAGATNAGNNCCCCACTTAGATTNTTAAG 

GAGTAAAAAGGGCTGAGTTATGCCTTTMGNGCTGTCMGAATTCACTTGGGTTTGGGAC 

ATTTGCTGGTGTAATGCTAGATGCCCACAGCANCATAATATTGNNCTTTGTCAAAGGTNG 

GTAAATNCTNTGNTTNTCANCANCCCTTTCCCCA 

Sequence 1851 

AGTCACCACGCGTCCGCGGCTGGTGGTGGGCTCGGGCCGCTCGCCTTGCCCGTCTTCGCT 

TCCGGAGGTCGCTACTGCCGCCTCAGCGGCCCCGGAGCGGGGGCGCCCGGGGGTCCTTCG 

CCCCCGGCCAGGGTCCCCGCGCCGGGGCTTCGCCGCCCCCAGTGTCCGAGCTGGATCGTG 

CGGACGCCTGGCTCCTCCGAAAAGCGCACGAGACAGCCTTCCTCTCCTGGTTCCGCAATG 

GCCTCCTGGCATCGGGCATCGGGGTCATCTCCTTCATGCAGAGTGACATGGGTCGGGAAG 

CAGCATATGACCATCCCCCGACCTTGGCCCTGTTGCTACCCTGCCTCTNCCGCAGGCTTC 

TTNCTGCTGGGCGGCCTGTGCGTGGTGTGGGGCAAGCGCCTTGTACCCGTGGGCCTGGCG 

GCGCTTNGAGGACCCATGCAGCTGACCTGGGGGGCCGGCCTGGG 

Sequence 1852 

GCGTCCGCTCGCTGCAGCCCCGCCTGGGCCACGGCACCCTCGAGCGCCAGCCCCGCGCCC 

CACCCGGGAGCAGCGAGCCACCGGCGCGCTCCCCAGGAGCCCCTGCAGGCGCCGGCCCTG 

GTCCAGCGCCTCCCTCTCTCAGCATGGACGAGGAGAGCCTGGAGTCGGCCTTGCAGACCT 

ACCGTGCGCAGCTGCAGCAGGTGGAGCTGCCTTGGGCGCCGGCCTGGATTCGTCTGAGCA 

GGCTGACCTGCGCCAGCTGCAGGGGGACCTGAAGGAGCTCATCGAGCTCACCGAGGCCAG 

CCTGGTGTCTGTCAGGAAGAGCAGGTTGTTGGCCGCGCTGGACGAAGAGCGCCCGGGCCG 

CCAGGAAGATGCTGAGTACCAGGCTTTCCGGGAGGCCATCACTGAGGCGGTGGAGGCACC 

AGCAGCGGCCCGTG 

Sequence 1853 

GCGCCCCGCGTCCGGAAATTGACCCCTAGAGAAAATCCCATTAACTTGTTAAATTAGTGG 

MTTMCMCAMTAAAGCATGTTTGAGACCTGGCAAAMTTCCTCTGGTAGTATTTATA 

AATAGAGCTGCATGCCTCTAGTATGAAAACCGTATCAGTTGCAAGTGCCACTTCTACAAG 

TTACTCAGTTTACTCTTTGTATCAGTAACTTTAAAGGTTGGATGATCCTTGCTGGTTAAA 

GCTAAATCTCAACCTAGCAACTAAATGAAAATATTTAGAATCATCAGAATCTGAACAGAC 

TAAAATTATCAGCGATMGCAGAATCAAGCAGGGTATAAGTTTTATCTCAATTATTTGAA 
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ATTGACTGGAGTTTTCTTAAAGTGTAAAGCTGAAATTTGCTAACCATG1 I I I IGATGAAC 
CCACAGTGCAGCATTGGGTTGGGGTTTTAGATTTGAATGACTCTCTGCTATAATTATCAT 
GACTTTGAAA 
Sequence 1854 

TGTCGACCCCGCGTCCGAACCTTTGGGAGACTCCAAGACAGCATGCTCCGAGGTCGGCGG 

GGGTCTGGGTGGCCATGGAGGAGCCCCCTGTGCGAGAAGAGGAAGAGGAGGAGGGAGAGG 

AGGACNAGGAGAGGGACGAGGTTGGGCCCGAGGGGGCGCTGGGCAAGAGCCCCTTCCAGC 

TGACCGCCGAGGACGTGTATGACATCTCCTACCTGTTGGGCCGNGAGCTTATGGCCCTGG 

GCAGCGACCCCCGGGTGACGCAGCTGCAGTTCAAAGTCGTCCGCGTCCTGGAGATGCTGG 

AGGCGCTGGTGAATGAGGGCAGCCTGGCGCTGGAGGAGCTGAAGATGGATAGGGACCACC 

TNANGAAGGAGGTGGAGGGGCTGCGGAGACAGAGCCCTCCGGCCAGCGGGGAGGTTGAAC 

CTGGGCCCAAACAAAATGGTGGTT 

Sequence 1855 

TGTCTGCCGAACAACTGAAAACTATGCAGGTTATATCATTCCACCAGCACCACCAAGACC 

TGATTTTTTGATGCTTCAAGGGGAGAAAACTACATGAAGCTTGGNGAAGNGAGAAGGGTC 

AAGTG ACCGA ANNGCAAGNAATTTCACAAANAGAATGAAACNAGGCAACTGGAAAGCTCG 

AAATAhnTTTGGCAATTATTTCAGAGGANAGCACCAGATTTGCCGTATTGCATNGGAANT 

TGNTACCCNGTAGTTCCNTGTTGGANATGTCAACANTCCCCTANTNTTTTGNACGAAACT 

AAGGMTTNTAGAAATTTTTGCAATNGTTCCTTTACTTGGGGGAAAATTATTNNAATTCC 

AAAGGAATCTTTNAGAGNTTGNNTNGCCGTAANGGCAAGAGAGAGNCNATNNTNGAAACT 

NAGGAAGCGGAAACCTTTTGNAAANMCTTTTCI I I I M A 

Sequence 1856 

GAGTCGACCNCGCGTCCGGCGGCGAGCGGGACTGGCCATTGGAGTGCTCCGCTGCGGAGG 

GAGGGGACCCCGACTCGAGTAAGTTTGCGAGAGCACTACGCAGTCAGTCGGGGGCAGCAG 

CAAGATGCGAAGCGAGCCGTACAGATCCCGGGCTCTCCGAACGCAACTTCGCCCTGCTTG 

AGCGAGGCTGCGGTTTCCGAGGCCCTCTCCAGCCAAGGAAAAGCTACACAAAAAGCCTGG 

ATCACTCATNGAACCACCCCTGAAGCCAGTGAAGGCTCTCTCGCCTCGCCCTCTAGCGTT 

CGTNTTGGAGTAGCGCCACCCCGGCTTCCTGGGGACACAGGTTTGGCACCATGGGGCCCA 

CCAGCGNCCCGCTGGGCAAGGCCCACCGCAGGCTCGGTCTNTGACTACGNCAACTATGAT • 

ATTCATTCGTCCGGCATTACAACTACACGGGGAAAGCTGAATATCAG 

Sequence 1857 

CGTACGCCGAGCGCCGCTCCGGCTGCACCGCGCTCGCTCCGAGTTTCAGGCTCGTGCTAA 

GCTAGCGCCGTCGTCGTCTCCCTTCAGTCGCCATCATGATTATCTACCGGGACCTCATCA 

GCCACGATGAGATGTTCTCCGACATCTACAAGATCCGGGAGATCGCGGACGGGTTGTGCC 

TGGAGGTGGAGGGGAAGATGGTCAGTAGGACAGAAAGGTAACATTGATTGACTCGCTTCA 

ATTGGGNGGGAAATGCCTCCGCTGAAAGGCCCCCGAGGGCGAAAGGGTACCCGAAAGNCA 

CCANGTAATCACTTGG GTGT TCCGMTATTGGTTNATTGAAANCCATTCAACCCTTGCNA 

GGGMAMCAAAGGTTTTTTCNAACAAAAAAAGAAAAGCCCTTACAANGGAAAAGGTTAC 

CATTCANANMGAATTTAACATTTGGAAMTTTCAAATTCTAAATAGGGGGGAAAAACCT 

TTTGGAAAGGAAACCAGGANGTACCCCAGGANAAAGGAAGGTAAAAAA 

Sequence 1858 

GNGTCGACCCCGCGTCCGGTGGAGGGTCAGGAGCTGCCCCGGATCCTCTCCATGTAGTTG 

CGAAGCTCCTCAGGGTCCTTCAGCCCCATGTCCTCACACACCCAGCGGATGTCCTCCTCG 

CCTGCCACAAGGATGGACTGCACAGCAGGGGCCCCTACAGGCTCCTCAGGTGACTGGGCT 

GGAGGGGCTGGCGCAAATGTCACAAACTCTACTCGCTTCCGCCGCCCCCCAGCCTCCTTT 

CGGGCCAGGGTGCTTGAGGAGCTGGTGGTGCCCCCAGGAGGGCCAGGGGCCAGGGTAGGG 

GCCTCCCCTCCTCCCCCACTCTCACAGGGGCAGCTCCCCTCCCCCTTGGGCGGGCCAGGG 

GACTGCCGGTCCAGCTGGCGGCTCAGTTCCTCCTGGTCAGTGCCCAGCCAGACCCAGTTG 

TGGGGCTGGGGGAGGTTGGGG 

Sequence 1859 

GCGCCCCGCGTCCGGCGGCGGCGGCGGCGGCAGCGGCAGCCAGAGGACTCCCAGCGGCTG 
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GAGCAGAAGTGTTAGCCGGCCAGAGCTCCCAGACCCCTACCCACAGCCAGGCGGGACGCG 

CACAGTCCCTCCACGCGGAAAGAAGTACCTTCGCCGGTCACCGGCTCCTGCAGGGTTGCA 

NATATATACAGAGCTTCATAATCAGCCCAAGACCACATAGAGCAAACATGAATGATATTT 

CCCAAAAGGCTGAGATTAAAGAAATGCTTGCTTCTGATGATGAGGANGATGTATCTTCTA 

AAGTNGAAAAGGCTTATGTTCCAAAA 

Sequence 1860 

CGACCCCGCGTCCGACCCACTGAAGACGTCTGCGTGAGAATAGAGACCACCGAGGCCGAC 

TCGCGGGCCGTTGCACCCACCGCCAAGGACAAAAGGAGCCCAGCGCTACTAGCTGCACCC 

GATTCCTCCCANTGCTTANCATGAAGAAGGCCGAAATGGGACGATTCAGTATTTCCCCGG 

ATGAAGACAGCAGCAGCNTACAGTTTCCAACAGCGACTTCAACTACTTCCTACCCTCACC 

AAGNCAAGCTTGCTCTGAAAAGCCCANTTATGCCANAATGTAGGATCCTGAAAACCAAGA 

AACI I I I I I ACTTG AAATT 

Sequence 1861 

GCGTACGGCCTGTTGGGCTGTCTGGGGGGTGGCCATTTAGGGATCGTGGGGACGGGGTCC 

ACCCCANNCAAGAAAGAACAGGCCCGTCCACAGGCCCGGCTCTGGGCCACAGTGCCCCGG 

AAGCAGGTGTGTCCAGAGTCANGCTTGAATGGCTCTCCCCACAACCACCCAGCNAGGCGC 

TGGTGCNTCCTTCTGCCTCATGGGACCAGTCCAGCTTNCAGCCGCTCTGGGCTCGAGGGT 

NGGtACTGACCACTTTCCTTCTTGAGNTGGGAGCATTCTCTTGGGGGAGNCTCTTCCAGT 

G GGCA CCTGCCTGGGACNCTTGCCCACCGGTTTTCTTGTAAAAATCAGGAATACCGGTGG 

CTTTTAGTAAAAGGCAAGACCANAGNCGCCTTNCGTTGGGCAGGGGAAAAGCCAAGCGTG 

CCGGNGGGNAAGGTCACTGGAAAAAGGTGGCTTGCCCTAAGGGGGAAGTTTGGGAAAATA 

GTCCCCCTGTTCCAAGMNTGCCTTTGMTTTTTAAAAACA1TTTTGGCT 

Sequence 1862 

CAATNTACAACGCCATGTNCACCCANATGTTCCAGACTAAGCGCTGNTTTCGACTGGCCC 

CCACCTTNAGCAACCTGCTCCTGCAGCCNACCACCAACCCTCATACCTCGGNCAGCCACA 

GGCCTTGCGGTCAATGGGGATGTAGACAAGCCTTCAGAGCCAGCCTCTGAGGAGGGCTCT 

GAGTNGGAGGGGAGTGAGTCCAGTGGACGCTCCTGTCNGAATGAGCGCAGCATCCANGAG 

AAGCTTNAGGTCCTGATGGCCNAAGGNNTGCTNCCTTGCTGTGAAAGTCTTNCTGGACTG 

GCTTCNGACCAAC 

Sequence 1863 

NGGAGTCGACCCACGCGTCCGGCCGCCAGAACACAGGTGTCGTGAAAACTACCCCTAAAA 

GCCAAAATGGGAAAGGAAAAGACTCATATCAACATTGTTCGTCATTGGACACGTAGATTC 

GGGCAAGTCCACCACTACTGGCCATCTGATCTATAAATGCGGTGGCATCCGACAAAAGAA 

CCATTGAAAAATTTGAGAAGGAGGCTGCTGANATGGGAAAGGGCTCCTTCAAGNTATGCC 

TGGGTCTTGGATAAACTTGAAAGCNTGAGCCGTGAAACCGTTGGGTATCACCATCTGGAT 

ATTCTTCCTTTGTGGGGAMTTTTGGAGNACCAGGCMGTTACCTATTGGTGGACTTATT 

CATTTGGATGGCCNCCMGGGACCACCAGGAGGACCTTTTATCAAAAAAAACANTGATTA 

CAGGNGGACATC CTNNA GGCCTGGACTGGTGGCTGGTCCTGNATTGGTTGCTGGCTGGGT 

GGTTTGGATGGAATTTTGAAGGCCTGGGNTATNCTTCCCAAAAGAAAT 

Sequence 1864 

GCGTCCGATGGCGTGNTGTCTCACAGAAAGTTCTCCGCTCCCAGACATGGGTCCCTCGGC 

TTCCTGCCTCGGAAGCGCAGCAGCAGGCATCGTGGGAAGGTGAAGAGCTTCCCTAAGGAT 

GACCCATCCAAGCCGGTNCACCTCACAAGCCTTCCTGGGATACAAGGCTGGCATGACTCA 

CATCGTGCGGGAAGTCGACAGGCCGGGATCCAAGGTGAACAAGAAGGAGGTGGTGGAGGC 

CTGTGACCATTTGTAAGAGACACCACCCATGGTTGGGTTGTGNGGCAATTGGTGGGCCTA 

CCGTTGGGAMACCCCTCGAGGCNCTCCGGTACCTTCAAGACTTGTCTTTTGCTGGAGCA 

CAATCAGTTGMTGAMTGGCMGAAGGGCGGTTTNTTATTAAAGTAATTTGGCCATTAA 

AAATTCTAANGAAAGGAAAGGGCCTTTACCAAAGTTACCTGCAAGGAAAATGG 

Sequence 1865 

CCGGCCGGGCTGGGTCCCAGCACNTGACCCCAGCTGCACTGCTGTACAGGGCTGCNCGTC 
ACCCGAAGTCAGAAACGTGGCATCTCATCGGAAGAGGAGGAAGGAGAGGTAGACAGTGAA 
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GTAGAGCTGACATCNAGCCAGAGGTTGGCCTCAGAGCCTGAACATGCGCCAGTCACTATC 

TACCTTCAGCTCAGNAGAATCCATCAGATGGGGAGGAAGGCACAGCTAGTGAACCCTTCC 

CCCAGTGGCCACACCTGGAAGTTGGCTAGCACCAACACNTGATGAGTCCGGGCNCAGNAT 

GAAGCNGNGTTCTGATGAACATGTTGACTCCCAAGGGGCTCAAGAAAATCCCCACNTGGG 

ACNCCACTCTCCTTCAGGAGGGTCATACCCTGGGCCCTGGAACCCAGCTCCCTGGCCCA 

Sequence 1866 

TTGCGCCCCCGCCGGTGAGCGCGGGGAGCGCCGCAAGCCCAACGCCGGGGGCAGCCCCGN 

TCCGGTGCGCCGCCGGCCGGAGGCCTCGCCGGTGCAGAAAAAGGAGAAGAAGGACAAGGA 

GCGGGAAAACGAGAAGGAGAAGAGTGCCCTAGCCCGGGAGCGCAGCCTCAAGAAGCGCCA 

GTCGCTGCCCGCCTCCCCACGTGCCCGCCTCTCTGCCAGCACCGCCTCTGAGCTCAGCCC 

CAAATCCAAGGCCAGGCCATCCTCTCCCTCCACATCCTGGCACAGGCCTGCCTCCCCCTG 

CCCCAGCCCAGGGCCAGGCCACACTCTGCCTCCAAAGCCACCGTCCCCCCGAGGCACCAC 

TGCATCCCCCAAGGGGCGGGTTCGGAGGAAGGAGGAGGCAAAGGAGAGCCCCAGCGCCGC 

AGGGCCCGAGGACAAGAGCCAGAGCAAGCGCAGGGCCAGTAACGAGAAGGAGTCAGCAGG 

CCCAGCCTTACCGGCACCTTTGGCGGNGCCTTGGCCAACCCAAGCCCGGCCCAAAAGGAG 

CAAGCCCCCGNGGAGACCCCTTACAGACCTGGTTTCTTTGACTTAACCCCAAGCCCTTGT 

TCCCCGGTGGACCCTAGCAAANCAATGGGCCGGNACCACAAGANCGAGAAGAAGCCCCTT 

GGNTNTTGGCTTGAAAGCGGCGCCAGGNCCCGGGAACAACCGGAAGCCCCANGAGCAAGA 

ACCGANGCTTTANGCAAAAAAGGACAAAGCCAATTGCAAAAGGACAACTTGCACCGGAAG 

GCCCAGGC 

Sequence 1867 

CCCCGCGTCCGTTTMGAGTTGCATATTTTACTTTATTTTTATTAAATTAAA^ 

TCTGGCAGCGATTCCAGAACAGGGTAAGGAGGTTCCTCACAGGGGTCAGAGAAGAGCGGA 

GAAAGACAGACTGACGGAGACTGAGACACAGGAGAGAAAGGACAAGGTTAAGGGAGAACT 

GTATCTGATGAACACACACAGCCGGCTCCATGGCGGGTGACGGGGAGCTCACATCAGCCC 

AATTTCTCCTCCCCGGCACCCGAAGTTCAGCGGTGGAGCAGTATGTGGGGGCGGTTAGGA 

ATCAAGAGACCCTCCCTTCCCCACCCTAGGTCCTTTCTCGGCTTGGTCGTGGAGCACAGC 

ACATACCAGAAAAAGCCAAGGGCAATGGAGGGGCAGGGAAACCGGGAGTATATGTACACG 

GGGAGGGGAGAACAGAGCCTTGGAGGTCGGCCTCTGCCAGAAGGGAAGTGGCTCACACTT 

GCATTTGNAACACTTGGCCAGTGGGGGATGGGGGAMGGAATTGCCCCTTCCTTTTGG 

Sequence 1868 

CCCNCGCGTCCGCACACCCTTCCTGTACTCAGTCCTCAGTTTGCCTGGTGAGAGAGCAGC 

CTCCTCCCGTGTGCTCTGCCAGCTGGACCCAGACTGGCCATATTACCAGTGAGACCAAAA 

AGATGGAGGTGGGGAGGTAGCTCTGAGGTCTGGGAAACCATTCCAGCTCCTGCCAGTTTT 

MCTTGTGTTTMTTCCTGGCACAGTTGTCCTGGAMTGCCTFnTCTCTTGCCTGGGAA 

CCACTAGAAGGGGATGTTGTCTGTGTTGGCCAGGGCCATGCAMTTCAACATCTTGTTTC 

TGCCCTTCCCCCGTGTAGCTGAGGCTAGGTGTTGGCATTACCCAGTGCTTGTTCTTCAGA 

GAGCAAAAGCACTGCTCGTCATGTCTGAAATTTAGTGAGTGAGCTCACCCACTAGGCTGG 

TGTTTCCTGCCCGTGGCTGCACATTGGAAGCACCGGGGCACTTTGAGAACTACAGATGCC 

TGGGTCCCAGAGCATCTAAGGTGCTCTAGGGTGTGTCCAGGACACAAGCCCTGGTTGAGG 

ACCACTGCTATATTGTATGGCCTCTTTTAAAMAGTTMTTTTACTTGGAAATGATTTCA 

AAGCTACAGMAAGTTGCAAGAATAAAMCTGTACAMTGAGGCTTAAATATTCTTTGGC 

CAAATCACCTATTAACATTTCGTTCCAAAAAAAAAAAAAA 

Sequence 1869 

GGGCAGCGCCTCCGACATGAAGGCTGAGCTGTCGCAACTTATTAGCGACCTGGGCGAGCT 

CAGCTTCGGCAACGACGTGCGCACCCTGCAGGCCGACTTGCGGGTGACGCGCCTGCTGTC 

AGGCGACAGCACGGGCAGCGAGAGCTCCATCGAGGGCGGGGGCCCTGACGCCACCTCCGC 

CACCGCCGGGGACTCGTCCCGCCAGGCCGACGGCGCCAGTGCAGACGAGCCCCACTCGGG 

CTGAGCTCCTCCGCGCGTCGCCGGCGCTCCACCGTGGCTACCCATCCGTGGTCCCGACAA 

CCTCCCTGTCCCTTGCCCGCCCCCAGGAAGGGGGAAATGGGGCATTTGGGGCCCAGACCT 

ACACTTGGAGCCCAGGTCCAAGCGTTCCCCGACCGCTTTCCCTACTNCCGGNCCCCGCTC 
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CCGCCCCMGAACTTTTGGCTTTTTTGCGCGTGGGGATGCGGGGAGATTTGAGANGGGNA 

AACCCCCGCCAGGAAGGAAGAGAAGAGGCACCCCTTTGGGAATGCCGGTGAAGGGAAGGT 

TGGCTGAAGTTCCTMGTTTMGGCCTMGGTGCCTTGGNCAGGTTTCCTGTTTGTGGGG 

GAMCTTGGGNCTTGAGGAGGANGGGGTTAMTTTCTTCTTCCAACCCCTGGGAAGCGGG 

CCTTGCCTTGGANNTGAAATTTAANTNAAAAAAAAAAAAA 

Sequence 1870 

TCTTCATCATGGGTGCCAGCATCCACCTGGTGGGTGACTCTGTCAACCACCGCCTGCTCT 

TCAGTGGCTACNAGCACCACCTGTCTGTCCGTGAGAACCCCATCATCAAGAATCTCAAGC 

CGGAGACGCTGATCGACTCCTTTGAGCTGCTCTACTATTATGATGAGTACCTGGGTCACT 

GCATGTGGTACATCCCCTTCTTCCTCATCCTCTTCATGTACTTCAGCGGCTGCTTTACTG 

CCTCTAAAGCTGAGAGCTTGATTCCAGGGCCTGCCCTGCTCCTGGTGGCACCCAGTGGCC 

TGTACTACTGGTACCTGGTCACCGAGGGCCAGATCTTCATCCTCTTCATCTTCACCTTCT 

TCGCCATGCTGGCCCTCGTCCTGCACCAGAAGCGCAAGCGCCTCTTCCTGGACAGCAACG 

GCCTCTTCCTCTTCTCCTCCTTCGCACTGACCCTCTTGCTTGTGGCGCTCTGGGTCGCCT 

GGCTGTGGAATGACCCTGTTCTCAAGAAGAAAGTACCCGGGTGTCATCTACGTCCTGAGC 

CCTG 

Sequence 1871 

CCGCGTCCGGTTTGTTCGTTATAGGTTTAATAAGTCTATTGAGGAAGACCTACTCCTGTG 

TGAATCTTTGCAMGTMTGCTACCGGTGMGAAATATTCMCTGNATCAACAGTTTTAT 

GCAGAAACATGAMTTGAATGGGAAAAATGTGTTGATGTTTGTAGTGATGCTTCTAGGGC 

AGTGGATGGGAAAATTGCCGAAGCTGTCACCTTAATAAAATATGTGGCTCCCGAAAGCAC 

CAGTAGTCACTGCCTATTATACAGACATGCACTGGCAGTTAAAATAATGCCTACATCTCT 

AAAAAATGTGCTAGACCAGGCAGTACAAATCATCAATTATATTAAAGCTCGACCACATCA 

ATCCAGACTATTAAAMTTTTATGTGAGGAAATGGGTGCTCAGCACACAGCACTTCTTCT 

AAATACAG AGGT G AGGTG GCTTTCTCGAGGTAMGTTCTTGTMGACTTTTTGAACTTCG 

TCGTGMCTTTTGGTTTTCATGGATTCTGGCTTTTCGACTATCTTGATTGGTTAACAAAA 

TTCATCTTGGCTGCTMGACTTGCATATCTTGCAGATATTTTTACTAAATTAAAATGAAA 

GTTMTTTGTCAATGCCMGGNAAAAATGTGACCCGTTTTACNAGTATTTGATAAAATGT 

CGTCATTGGTAAGAAAATTGGAAT 

Sequence 1872 , 

TCGACCCACGCGTCCGCGGACGCGTGGGTGTGACTGGCACCCAATGCCATGCCTTTATGG 

TCAC TTGGTAGTATAMGGCATGGCATTTGTTGTGACTGGCACCCAATGTTTGATTTTTT 

TTTTAAAACTATCCAATTAAAATTAAGGTCTGGGAGTGTTCTGTTTCCCATTCTTTAATA 

CTCACCTCCTCCCAGACTTTCTACACCTGTTGCACCTCAGGCAGAGGATGTTCTGGACCT 

CCCCCTCTTGGTCCCTACTAGAGACCTCTCAACAGATCTGTGGGCCCAGTCATTGGGTTT 

TATCAGTGCTTMTGTGMCTAAGTTTTTTACTTCCACAGAATACAAGCCACTACCTTCT 

GACCTCCCCACCCCCCACCMCCCCCATCTTTTAATATGCTGTGGGGCATAGAACTCCGG 

MTGACCAGCATGATATTTTCAGAGTCTTGTCCCCGGGGTATTAGCACCTCTTTTTGAAC 

AGGGAATTGATTCAAGATTGGACATGGTCTCCTCTGATTATCAGGTACTGGGGCTGAGGG 

CATTAAAAATAGTAAGCCTCCCTCTCGTCCCCTGCCTCAAGAAATTGCCTCTTATTTATC 

AACATC I I 1 1 ICCTC 

Sequence 1873 

CCNCGCGTCCGGTGTTCCTCCCTGAGAATTAGTGGTCAGACATTGCAGAGGGCATCAGAA 

GCGGGTAGATGAAATAGCGAAAGGAAACAGGCTAGCAGACCAAAGAGCTAAGTCAGCAGT 

AAGAAGGCCCCAAGGTCCCAAAACACTTGAGGCCCCTCTGATTTGGGAGGGCTACATAAG 

GGAAATAAAGCCTCAGTATTCCCCTACAGAGATAGAATGGGCCACCTCTCGAGGTATACT 

TTTCAACCCTCAGGATGGTTACAATCAGAAGATTGCAAAGTACGCTTGCCAGCCTCCAGC 

CMTGGMGATTCTTAAAATCCTCCACTGAGCCTTTCACTTAGGAAAGCATAAGACATCA 

GTGCATCCAAAGATTGTTCTCAGGAGAAAATCTACTAAAAATGGTCAAATAGGTTGTTAA 

TACTCGTGAAACCCTCTTAAAAATAATCCCTTTAACAGATGACTTCTTCCCCACCACAAT 

CAAAGGA 
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Sequence 1874 

ACGCGTCCGATGACCAGGCTGCCCCGCCTCCTGGTTCTGCCCAAGTTCTCCCTGGAGACT 

GAAGTCGACCTCAGGAAGCCCCTAGAGAACCTGGGAATGACCGACATGTTCAGACAGTTT 

CAGGCTGACTTCACGAGTCTTTCAGACCAAGAGCCTCTCCACGTCGCGCAGGCGCTGCAG 

AAAGTGAAGATCGAGGTGAACGAGAGTGGCACGGTGGCCTCCTCATCCACAGCTGTCATA 

GTCTCAGCCCGCATGGCCCCCGAGGAGATCATCATGGACAGACCCTTCCTCTTTGTGGTC 

CGGCACAACCCCACAGGTGAGCCTGGAACCCATCACGTTCCACATCCTCCCACCCATTCT 

TTCTCTCAGGAACTAGTCCCGACAGATGCAGACATCCCTCTATCCCTGAGAGGGCTCTGG 

GCAGGGAACCCATAACCCTACCCTGCTTCCTGTCCCAAGAGGAGGC 

Sequence 1875 

AGTCGACCCACGCGTCCGCCCACGCGTCCGCCTTCTTCTGGGCACTGACTGCCCTTCTGG 

TCGCTTCAGCTGCTGCCTTCCAGGGTCTTCTGCTGCTGTTGCCGCCACCACCATCTGTAC 

CCACAGGGGAGTTAGGATCAGGCCTCCAGGTGGGAGCCCCAGGAGCAGAGGAAGAGGTGG 

AAGAGTCCTCACCACTGCAAGAGCCACCAAGCCAGGCAGCAGGCACCACCCCTGGTCCAG 

ACCCTAAGGCCTATCAGCTTCTATCAGCCCGCAGTGCCTGCCTGCTGGGCCTGTTGGCCG 

CCACCAACGCGCTGACCAATGGCGTGCTGCCTGCCGTGCAGAGCTTTTCCTGCTTACCCT 

ACGGGCGTCTGGCCTACCACCTGGCTGTGGTGCTGGGCAGTGCTGCCAATCCCCTGGCCT 

GCTTCCTGGCCATGGGTGTGCTGTGCAGGTACACAAGGACCCCCAGCCCCTGTGCGGGTG 

GAACTCA 

Sequence 1876 

TCGACCNCGCGTCCGGTCTTCGCAGGTGGCCCTCGGGCCCGCAGCCGCTGGGTAAGGGTG 

ATGCCTAGCCTGGCTTATTGCACCTTCCTTTTGGCGGTTGGCTTGGNGCGAATCTTCATC 

TTAGCACATTTCCCTCACCAGGTGCTGGCTGGCCTAATAACTGGCGCTGTCCTGGGCTGG 

CTGATGACTCCCCGAGTGCCTATGGAGCGGGAGCTAAGCTTCTATGGGTTGACTGCACTG 

GCCCTCATGCTAGGCACCAGCCTCATCTATTGGACCCTCTTTACACTGGGCCTGGATCTT 

TCTTGGTCCATCAGCCTAGCCTTCAAGTGGTGTGAGCGGCCTGAGTGGATACACGTGGAT 

AGCCGGCCCTTTGCCTCCCTGAGCCGTGACTCAGGGGCTGCCCTGGGCCTGGGCATTGCC 

TTGCACTCTCCCTGCTATGCCCAGGTGCCGTCGGGCACAGCTGGGGAA 

Sequence 1877 

ACCCCGCGTCGGCCCTTAAGAGACAATGATTGAGAAAGAGCCATGTGGCTTGGCTCTAGA 

MCGTCATTATCATTAGGACCATCAGATTTTAGATTAAGCTGCTATTGAATTAATAAAAT 

CCCAATGAAGCAGAGTTATAGGGATAGATTTATAGCTGGCAGAGTGGTATCAAAGGAGAA 

MACAGT GAAA AAGCCMTTTCACTGGTTCGTTCMTCCAGCTTGTTGCTAATATTAGTT 

ACCCTTGTTTTAATGACAGAGAGTGGCTGGAATCTGTAGCTAGGGGAGGGGCAACACTGT 

TAGATGTGAGGAAAGGAAGTGCCAAAAATGCCTGGACAGATGGCTTGTCCCAAGGCCAGG 

ACACACACTTTAAAATCCAACATTCACCTAAGCAAGTAATTCTTAAAGATCTTACAGAAA 

CGCAGAGTCAATTCAGGTTTATAAAGGAAGGCTTNAGGGGAGAGAGGAAGGCCTGGGGGG 

CCTGGACGAAAGAGGCCTAGGACCTGAAGAGACTCCAGCGAGTCTTCGGGAAGC 

Sequence 1878 

AGTCGACCACGCGTCCGCAGCATCGTCCGAGACTTCCCAGACTCCCGGCCAGCCATCGAG 

GACCTCAAGTACTGCCTGGAGAGGACGGACCAGAGGCAGCAGCTGCTCGTGTCCCTCAAG 

GCTGCCCTGGAGACTCGGCTCCTGCATCCAGGCGTCAACACGTGTGACATCATCACCCTC 

TATATCTCTGCCATCAAGGCGCTGCGCGTGCTGGACCCTTCCATGGTCATCCTGGAGGTG 

GCCTGTGAGCCTATCCGCCGCTACCTGAGGACGCGGGAGGACACAGTGCGGCAGATTGTG 

GCTGGGCTGACGGGGGACTCGGACGGGACAGGGGACTGGGCTGTTGAAGCTGTCCAAGAC 

CGACCCGGCGAGCCTGGAGACAGGCCAGGACAGTGAGGGATGACTCAGGCGAGCCAGAGG 

ACCTGGGGTCCCGGACCCTGTGGATGCCCGATCAGGGGAAAGTCGAGCTCCAGCC 

Sequence 1879 

GTCCGCACAATTGAAMCTGGGAAAAATAATCTCGTGGTTCGGTTTGTTAGTTTGAGACG 

CAGTCTCACTCTGTCGCCCAGGCTAGAGTACGGTGGCCCAATCTCAGCTCACTGCAACCT 

CCAACCTCCCAGGTTCAAGCAATTCTCTTGCCTCAGCCTCCCGGGGTAACTGGGGATTAC 
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TGTTGGCCAGGTTGGTCTTGAACTCCTGACCTCAAGTGATCCACCCACCTCGGCCTCCCA 

MGTGCTGGGATTACAGGTGTGAGCCACCGTATCTGGCTCTCATATTTTTTATATACAGA 

TTGMTATCCCTAATTTGAAMCCTGAMTCTGAMTCTTCCAAAATCCAAMCTTTTTG 

AGCAGTTACATGATATTTGAATGAACTGCTCATAAGAA 

Sequence 1880 

GAGGAAACCAAAGTGCTCTGTATCCTCCAGTCTCCGCGCCTNCACCCAGCTCAGGAACCC 

GCGAACCCTCTCTTGACCACTATGAGCCTCCCGTCCAGCCGCGCGGCCCGTGNNCCGGGT 

CCTTCGGGCTCCTTGTTGCGCGCTGNTCGCGCTGCTGCTCCTGCTTGACGCCGCCGGGGG 

CCCCTCGCCAGCTGCTGGGTCCTGTCTCTGCTGTGCTGACAGAGCTGCGNTGCACNTTGT 

TMCGCGTTTACGCTGAGAGTAAAACCCCAAAACCGATTGGTAAACTGCAGGTTGTTTCC 

CCGCAGGCCTCGCAGTGCTCCAAGGTGGGAAAGATGGTANGCTCTCCCTTGAAAGAACCG 

GGTAAGCANAGTTTATGTCTGNNACCCCGGGAANGCCCCCTTTTTTCTTAAAGGAAAAGG 

TGCATCCCAANNAAAAATTTTGGGACCAGNTGGGGAAAACCAAGGAAAAAAA 

Sequence 1881 

GCGTCCGCCCTGGCTCCTCCAGCAAGACCTCGTCTTTGCTTGTCCTGCTCAGATGCTGGT 

CATCCTGGGCATGTCCCCAGTGTGGACTCTGGACTGGGAAGGGGGCAGGCCCCTTTGGAC 

CTGCAGTTGGCCTCAGCAGAAGGCCTTGCCTTGTGTATGTGACTCCATATCCCGGGAGCA 

GTTGACCTTTGCCAAACACTTTACAGTTCTGGAGGAGGAGGTAACATAGATGCCTGGGCC 

TGATGGTGGGGCCATACCCATGTGTCGCCTCTCACTCTGGCAGCCTCAGAGGCCCCTTGC 

TGCTGGCTCCCATCTCCCTCCCATTTGCAGACCAGGAAGGAAGAGCAAGCTGTACAAAGG 

GAAGCAGAGCCTGGGGTGGGGTGTGAGCAGGGTGACCCCTCATCTGAAAGGCCCAAACCA 

GGGGGGAAGCACCAGCCTTAGTGCAGCCCCCTCTGACCCCACCTTAGAATGGAAAGCCTT 

CACCTGCAGCCCAGGCCTTCCTCCCCGG 

Sequence 1882 

AGTCGCCCCGCGTCCGGTGATTCCAGGGTGCAGAAGGGATTCATATTCCCAGAACGCTTT 

AAGTGTACACCTGCAGGATAMGAGATACCGGTTACATTATTAAATGATTCTAGGGATTC 

ACTGGGGGAT ATTTTTGTTGCTTTTACTTTCATGGTTAGAGCTACAAAGMCMGTGATT 

II I I I I I I I I I CTCCCTTCCCCATTCAGAAACATTATACATTGGGCCA I I I I I CTTTCTC 

CCAAAGAAGATTCATGGATAGTCAGACTGAACTGTGTGCAACAGGAAAAGTCAAAAGGGA 

AAAGGCAGCTGATGAGGTTCATGGTTACATGTTCTACATCATGCAGAGTAGCTTGAAATC 

TAGTCTGGAGAAAACTGGATCAAGATTCTAGCCCACTGGAGTTGCAAGGAATGAGAGGCA 

AAAATTCTAAAGATTTGGGTTATATTTTCAACTTGGGGGACAGAGAGAAATGGAGiAGCAG 

GAATTACAGTTCCAACAAACATCATGATAGTC 

Sequence 1883 

CCACGCGTCCGACTAGTTCTAGATCGCGAGCGGCGCCCI II I I I INI II I I NNCACGCTT 

AATTCACTTTATTTTTCTTGNATAAAAACCCTATGTTGTAGNCACAGNTGGGGCCTGAGT 

CCGNTGCACGGAGACTCTGGTGTGGGTCTTGACGAGGTGGTCAAGAGNAACTCCTNGATA 

GGGAGACTTGGGTGAATACANTNTCCTTCCANAGGTCCGGGGNGTTCATGGTATGCTGTA 

NGGTCCTTAAAAAAATGGG 

Sequence 1884 

GTCCGAAAAAATGATAATGTGCATAAAATTCAACCCAGCTTTCAAAGTCCAGTCAAAATA 

TCAGAAATCATGAGGTCCAATGGATTTTGTTTAGCAAATACCGAAACAATAGtTATTGAC 

CACAGTATACCAAATGGAAGAGACCAGCACCTGGGCGTGGACCCAACAGAGCATTTATTT 

GAGAATGGCAGTGAGTTTCCCTCAGAGCTGGAGGACGGGGACGACCCAGCAGCCTACGTC 

ACCAACCTGTCATATTACCACCTGGTCCCCTTCGAGACAGACATTTGGGACTGAACCTCT 

CTATCAGGCCTCCCCCGCCTCAAGCTGTTCACTGCC 

Sequence 1885 

CGCGTCCGCACAACAAGACACTCCAATTGTGATTTGAGTTGAGGATCTCTGCCTGCCTTC 

CTGCCGTCCTTCCTTCTTCCCCGATCCATGCTACTTTTAGGGGCTGCGGAGAGCAGCAGC 

AGAGCTGAGTAATGATACAGGGCACCACGGAGAGAAAGTAGAACCATTTCACTCCTGGGA 
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Agatggggtatttcccacttccagcaacgaaataacaaatgaaaagttgcatacttattg 

atgtattgtatgagccagtagcattttatgtacaaaacagaagtcaatgcaacagtatgt 

atgtgtgcctgtgtgtgtataaaaataaccattgaagctaacttgctaatgtacttaggc 

aagccacttcccatctctgggcctcgtctttcctccctctaaaatcaaagagctgaatta 

tgtgatccttgaggtctcttccacttataataccaactgtcttgtcagactggcaaatta 

tattggcctctccttatgtggtgggtttttttgggagggncatagttncttatacacagg 

acacctgcatnatcnaagggc1 i i i i i i i ctaaaaaaaaaaaaaatg 

Sequence 1886 

CGTCCGCTCCTGAGTAGCTGGGATGACAGGCGTGCACCTGGCAGCTTTTTCAAAGTGTTG 

ATGGTAATCTGAGGCMTCTAAGGGAGTCATTTTTTAAGTGACTTTATACAGAAAGATTG 

GTAAGAGCCAAGGGGTAGAAGTGGCATAAATGTCTAAAGCAGGGAAGTGACAGGGACTTT 

CATTGTTCTTGGCTGAGGAGAAGCGGGAGTGGCTGATGGAAGCACCTAAATGATGCCTTT 

GTCTGTGGGAAGGCAAATGATGCCCCAGAGCTCTAACCAAAGGTTTTGCAGCCGCCGAAA 

AACAGGAAAGTTGGGAAGCGGGGGTAGGACTACACTGAATCATTAACAGTGCTGTAAACT 

ACCCATGTGGCCATTMCAATGGACCTTTGGGGGAGTTTTCCTAAACGATCACTCTGGA 

Sequence 1887 

CGTCAAACACCCGCAGGCTGTCGATTTCATCATCAAAGAAATCAAGACGATAAGGGCAGC 

TCACTCCCCATCGGGAAGAGATCCAGCAACGCGCCGCGCGTGGCGTATTCGCCGTGCTCC 

ATCACCTGGTCAACATGGCGATAACCGGCGCTGTCCAGTTGGGTTCGTAATGCATCTCGT 

GACAGGCGCTGACCTTTTTTCATCACCAGCGCATGACCGTGGAGAAAACTGTGTGGGCAA 

ACGCGCTGCATAAGCGTATTCACCGGAACAATCAGTACGCCACGCTGCATCGTCGGTAGC 

TGGTAAAGGGTGGAAAGGCGCGAGGAGATAATGTCCTGATGAGGCGAAAAACTGTCGTAG 

GGMGAGTTTCCCAGTCCGCCAGATTCATCACCATTTGATCGGTAAACTGGCTGATTTCA 

TCATGCAAACGCAGAGCATTTTGCATATCTGGTGCAATGAGTACCACCGGACCGGCGTGA 

CGTTCGGCA 

Sequence 1888 

C GCGTC CGTTTATTTTTATGCCCCTTTTTGTC^ 

MTTTTAAGTGACTTTCTCCAGGTCATGMGATTCAATGGGTAGAATTGAATCAGAATTG 

AMTGTTCCAGATTCATATTCTTGTGTGTGTTTGATAAAATTCATGGCTTCCAAAGTAAC 

TGAACACTTCCTTTGGGCCCTTGGAGGGAAAATCCATA 1 1 1 M ACTAATTACAC I 1 1 NT 

TTTTAGACATCTGGCAGTTCTTTGMCTTAMCATATTCTCATGGCCATAGTTCCAAATT 

MGCCCG ACGCA GTTGCTAAAAATCTTGCTGCACTGTTGAATACTAATAATGCAACATTT 

ATTGGATGTTTTTGCATTTTGATGACCTTCATGATTCATTTATMGTCTTTGTAAGTGCT 

TMGTGACCCCCTCACTAGTGAAMTMTAAATGTTCTATATCATTTATTATTATTTGTG 

TATTCTCTACATG ATATA I I I 1 I I I 

Sequence 1889 

CCGCGTCCGGGAGGATGGACGTACTGGTGTCTGAGTGCTCCGCGCGGCTGCTGCAGCAGG 

MGMGAGATTAMTCTCTGACTGCTGAMTTGACCCGGTTGAAAAACTGTGGCTGTTTA 

GGAGCTTCTCCAMTTTGGAGCAGTTACAAGAAGAAAATTTAAAATAAAGTATCGACTGA 

ATATTCTTCGAAAGAGTCTTCAGGCAGAAAGGAACAAACCAACTAAAAATATGATTAACA 

TTATTAGCCGCCTACMGAGGTCTTTGGTCATGCMTTAAGGCTGCATATCCAGATTTGG 

AAAATCCTCCTCTGCTAGTGACACCAAGTCAGCAGGCCAAGTTTGGGGACTATCAGTGTA 

ATAGTGCTATGGGTATTTCTCAGATGCTCAAAACCAAGGAACAGAAAGTTAATCCAAGAG 

AAATTGCTGAAAACATTACCAAACACCTCCCAGACAATGAATGTATTGAAAAAGTTGAAA 

TTGCTGGTCCTGGTTTTATTAATGTCCACTTAAGAAAGGA 

Sequence 1890 

CGCCCCGCGTCCGCTAATTATAAGCTTTACMGTATTTATTTTATAAGGCTTAGACAGAA 

TTATTGGAGTTTTAAATTAAGTGTATTGGAAAAGAAAGGATGGTATGTGTATGAAATGTT 

MGATCCTACGCMCACTGCTATTTTTTTCCTTTMTATTTGTGCTGCATAACAAAAGCC 

ACTAGACTGTTACTGTCTTGTCTGTCCATGTGTTAACAGCATTTCTTAATGATGTATATA 

TGGAGTGGTCTTCMTCATAGTGAAGMTTTAMGAGAAAGTCMTTGTATTGGCATTTT 
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TMTMGMCAAMTTAGTTCGTCTMGGGGACTGGCTGGCCACATATTTGTTCCTTGCC 
CATATGCTTTCTACTTCTTGTTCTTATTATGAAATTATGMTTTGAAGCCTCTGAAATGG 
TGATCAGTTTTCAACATCTTTCAAAAACAAAATTACTA 
Sequence 1891 

GCGTCCGCGGCTGCTGTGTGTGAGCAGTGGACACGTGAGGGGGGGGTGGGTGAGAGAGAC 

AGGCAGCTCGGATTCAACTACCTTAGATAATATTTCTGAAAACCTACCAGCCAGAGGGTA 

GGGCACAMGATGGATGTMTGCACTTTGGGAGGCCAAGGCGGGAGGATTGCTTTGAGCC 

C AGGAG TTCAAGACCAGCCTGGGCAACATACCAAGACCCCCGTCTCTTTAAAAATATATA 

TATTTTAMTATACTTAAATATA^ 

AGACCAATTTATGGGAGAATTGCACACAGATGTGAAATGAATGTAATCTAATAGAAGCCT 

MTCA GCCCCCATGTTCTCCACTGAAAAATCCTCTTTTTTGGGGGGTTTTCT^ 

mn GAATTTGCACTGGACGGNGGACCGTCAGCCATGTNCAAGGATCCCCAGGGGGGGG 

GNNGTCAAAATGGCTATTGGAAAATTGGGGTGGAMTGNATGCCTTTTTCACTTTTTGGA 

TAAATAAMCATGTAAAAAAATGNTTTCAAAAAAAATTAATTAAAATTAAATTAAATTCC 

NNAAAAAAAAAAAAAAA 

Sequence 1892 

AGAGGATTCCCAGGGTTTCCAGGGGGCCAAAGGAGACAAAGGTTCAAAGGGTGAGGTGGG 

TTTCCCAGGATTAGCCGGGAGCCCAGGAATTCCTGGATCCAAAGGAGAGCAAGGATTCAT 

GGGTCCTCCGGGGCCCCAGGGACAGCCCGGGGTTACCGGGATCCCCAGGCCATGCCACGG 

AGGGGCCCAAAGGAGACCGCGGACCTCANGGCCAGCCTGGCCTGCCAGGACTTCCGGGAC 

CCATGGGGCCTCCAGGGCTTCCTGGGATTGATGGAGTTAAAAGGTGACAAAGGAAATCCA 

Sequence 1893 

TCCGCCCCGCGTCCGCTTTTTCCNAACAAGGAGCATCCAAAGACACAGTGACTTGAGCTA 

TAGATAGTMAAATCATACGAGAGTTGAACTGAGTCAGGTTTAGGMGCAAGTTTGGTTG 

CATCAATTMGCAGGCTCTTTTCMTTGACTGATGCTGGGGCCTTCAGTTTTATTCTCAG 

TATAGATTGCCAGTATTGTTAAGAGTATCCAAAGGCCTTTCTAGATGGAGACAGAATAAC 

TGACTTGAACATACAGTGTGCCTGTAAGTGTCCAGGCTCAGAGCTGGTGAAAACCCTTCT 

GTTGGGCGTGTGCAGGGTTAAACTCCTGAAGTAACTTGTGAGGACTTCAGTGCTTGCTGG 

TGTCCTGGGCAGCACCATGAATGCCTTTACCAAGACATGCCAAGTTGGATCCCCCGAATG 

AAGCAAGAGTGGCTTGTGGGTGTGACCCTTGCTCCCTGCTACACAGAAGCATCGCAAGGG 

CTGCCTGTGTNGGTTTCCAGATGAAGGGTCTTGGGTCCCGGAAGCTTTGTGGTTGAGAGC 

TCAAGTGGGACC 

Sequence 1894 

GTCACCACGCGTCCGCGGACGCGTGGGCGCACGCCGGCGGCGCGGAGGCCGGCTCTGCGC 

TTCGGGCCGCCCCCTCCCCCCACCCCGCTCACACCCGGCACTTACTTCGGCTGTCTCCGC 

TGCCCTCCAGCGGAGACGCAGCTCCTCAGGCGCCCGGCGGTATTTGTTGGGTCGGCGGCG 

TCAGGGATTCGCAGTGGCCTGTGGTCGGCGTCGTCCGGCCACTGGTGCGCCCCCGCGGCA 

GGCAGAGCTCACGCTCCTGTCCCCCGGCTGGTCCGGGGTCTGGGCGCCGCGTCGACGGCG 

GCTCCGCAGGACGCGCAGACCGGGCCGCAGCCCATGCC 

Sequence 1895 

NCCCCGCGTCCGATTTAAATGCCCTAAATTTTAMTTCATACCTTTCCATGATTCAAAAT 

TCAAAAGATCCCATGGGAGATGGTTGGAAAATCTCCACTTCATCCTCCAAGCCATTCAAG 

T7TCCTTTCCAGAAGCAACTGCTACTGCCTTTCATTCATATGTTCTTCTAAAGATAGTCT 

ACATTTGGAMTGTATGTTAAMGCACGTATTTTTAMAT^ 

CATTGTATGTCTGCTGTGTACTTTGCTATTTTTATTTATTTTAGTGTTTCTTATATAG 

GATGGMTGMTTTGMGTTCCCAGGGCTGAGGATCCATGCCTTCTTTGTTTCTAAGTTA 

TCTTTCCCATAGCTTTTCATTATCTTTC 

Sequence 1896 

CGACCCACGCGTCCGCCCNGCGTCCGAGGGGCAACAGCAGAGCCTACAGCAGGGGGCACA 
CTCCACCGGNTCCAGCCGCCTGCACGACCTCTACTGGCAGGCCATGAAAACCCTGGGAGT 
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CCAGCGCCCCAAGTTGGAGAAGAAGGATGCCAAGGAGATCCCCAGTGCCACCCAGAGCCC 

CATCAGTAAGAAGCGGAAGAAAAAGGGATTCTTGCCAGAGACGAAGAAGCGCAAGAAACG 

CAAGTCAGAGGATGGCACGCCAGCGGAGGATGGCACACCTGCAGCCACCGGCGGGAGCCA 

GCCCCCCAGCATGGGCAGGAAGAAGAGGAACAGGACAAAGGCTAAGGTCCCAGCCCAGGC 

AAACGGGACGCCAACCACCAAGAGTCCAGCCCCTGGCGCCCCCAC 

Sequence 1897 

ATTATA TACTT CTGAATGGCACCTTACTTTTTGGAMCAMTCTTCTGTTATTTACAAAA 

TMTMTTTTTAAAAAACATAAAAAAAAAATCCAAAGCTGCTCTCGATAATAGTCAACAT 

TTGCATATATATGGMTTTCTTACTTTTmCTCCCAMCTCTATTTMTAM 

TMTGTTTGTGTATTTCATGTATAATTGTGATCTCMTTATAAMGTTTAATTCAGCATG 

TCTTTGAGCCAATATAATTACTGCACACCCACTAAATTGGGATCAGCCATTATAAATAAT 

GTAGTTTTAGAATAATAAAACATGACACATATATATATATATAAATATATAGTATATATT 

GG CACATC GGTGAAAGTT TMTA TGTGCAGGMGGTTTTTTTCTTTCTTCM 

TTATTTTTTGCCATATGTMTTTTGGTGTTGCAGGCTGGTCGAGAGGATAAAAAATGGAT 

TTTAAATCTGGGTACCGGATGGATCTTTCNGGNGGTTAAGAAACACAGGGGNTGNGGACC 

TTCCTTTTT 

Sequence 1898 

CCGCGTCCGMTATAGTATTTTTTAATTTTTGTGGGGATGGATTCTCAAATACTTGTGAT 

TTTAAAAGATTCTAAAGCTAAAACACAACTTGATTTTAAAAAGAATGATTCTCCTTACAC 

AATTATAMTATTTGCAGTMATATTTTCCTTATAATACTGTTTTGACCCCATTTAAA^ 

GTATTAGATTATATTCCTTTGATCCAATGAAAACTGAACCTTATAAATGGTTAGCTGAAA 

GTAGACCTTATTCTTGTCCTTCTTTAGAAGAGTAAAGATTTGTCCTAGGGAAGATGGCTG 

ACTTCGGTTCCCAACATGCCGTATGCATTTAGACTGTAGCTCCTCAGCCCTGTGGACACA 

AAATTTGGACAGCTTATTAGGNTACCGTTAGCAATGCTGGACCGGTTTCTTCAACACTAA 

AGANTTTCACCGTTGNAACAGATTTCCTGGTTCGTCTNATGGGGNCTGGTAAAAATGGT 

Sequence 1899 

GCCCTGGAGGCTACTTGTAAATCCTTAGAAGAAAAGCTGGATCTGGTCACGAACAAGCAG 

CACAGCCCCATCCAGGTTCCCATGGTGGCCGGCTCCCCTCTCGGGGCAACCCAGACGTGC 

AACAAAGTGCGATGCGCTGTGCCTGGGCGTCGGCAGAACACCATTGTGGTGAAGGTGCCG 

GGCCAAGAAGACAGNCACCACGAGGACGGGGAGAGCGGCTCGGAGGCCAGCGACTCTGTG 

TCCAGCTGTGGGCAGGCGGGCAGTCAGAGCATNGGGAGCAACGTCACGCTCATCACCCTG 

AACTCGGAAGAGGACTACCCCAATGGCACCTGGCTGGGCGACGAGAACAACCCC 

Sequence 1900 

NCCACGCGTCCGCCCCGCGTCCGGGCCCGGCGCGCCTGCTCTGGGCTCTCCGCGTGCCGC 

ATCGCTTTCTTTTCTTCCTCTGGAGCAGCTATGGCGGCGGCGAAGACCCTGAACCCCAAG 

GCCAAGGTGGCCGGAGCGCAGGCGGCGCTGGCGTTCAACATTAGCGGGGCGCGGGGTCTG 

CAGGACGTGCTGAGGACCAACCTGGGGCCCAAAGGGACCATAAAGATGGCCTTCATCCCA 

GMTMTCACTGMGGATTTGAAGCTGTGMGGAAAAGCCCTTCATTTTTTGGAAGAAGT 

CAAAGTAAGCAGAGAGATGGACAAGGAAAC 

Sequence 1901 

CCACGCGTCCGCCCCGCGTCCGAAACATGAGGTTCTCTCTACTGGTCCTCTTAACTGTGG 

TGTTGAGGCTTATATTTGTGTMTTTTTGGTGGGTGAMGGMTTTTGCTAAGTAAATCT 

CTTCTGTGTTTGMCTGAAGTCTGTATTGTAACTATGTTTAAAGTAATTGTTCCAGAGAC 

AMTATTTCTAGACACTTTTTCTTTACAAACAAAAGCATTCGGAGGGAGGGGGATGGTGA 

CTGAGATGAGAGGGGAGAGCTGAACAGA 

Sequence 1902 

GTCGCCCCGCGTCCGCCCTCCCTGGCMATMTATAATMCCCGTGAATTTTCAGGAATT 

TAAAMTTANGCTnTTTCCACTTAAAGGAGAAAAATATTTGGGACTAGCAGCAGAGGCA 

GTAAGAGATGTGAACCTTGGTGAGCTCTGATACAGTGAGAAGAGATTATACTCATGAAAG 

AGAATGTTAGTGTTACAGAGAAGCAGCCGATAGCAAATCGACTGTAGAGACTTGGCGGCG 

GTGGCATTGCCCCAGGTCGTCAGCAGTGTGGTATTATCTATGAGAACTTGAGCGACAGAG 
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tAT TTCTTG ATGMTTTATAGATCATTTGAGATGTT 

TTGTTTTTTCAAATAAGTAGAGACTATTTGTAAAAAACGAGGAAAGGGAAATGAAATGGG 
GCGTGTTTGATAGCMTAMTTTGGTTTCTTTTTAAAGAATTCTAAAAAGGGTCTGAGAC 
CCTGNTAGCATTAA 1 1 I I 1 1 GAGTGCCCTTCCTTTTTNCCCTTCCCCTCCCTTTTTN7TT 
TTCTCT 
Sequence 1903 

GCGTCCGCCCCGCGTCCGGGAAACCCCCTTCGATGACCTCCAGAGCCTCCCAAACGACGT 

GATCTCTTCCCTGAAGAACAGGCTGAAAAAGGTCTCCACAACCACTGGGGATGGTGTGGC 

CAGAGCGTTCCTCAAGGCCCAGGCTGCTTTCTTCGGTAGCTACCGAAACGCTCTGAAAAT 

CGAGCCGGAGGAGCCGATCACTTTCTGTGAGGAAGCCTTCGTGTCCCACTACCGCTCCGG 

AGCCATGAGGCAGTTCCTGCAGAACGCCACACAGCTGCAGCTCTTCAAGCAGTTTATTGA 

TGGTCGATTAGATCTTCTCAATTCCGGCGMGGTTTCAGTGATGTTTTTGAAGAGGAAAT 

CAACATGGGCGAGTACGCTGGCAGTGACAAACTGTACCATCAGTGGCTCTCCACTGTCCG 

GAAAGGGAAGTGGAGCAATTCTGAATACTGTAAAGACCAAAGCAA 

Sequence 1904 

CGTACGGGGTGCGGTTGGCGGCGGCGGCTGGGCCGGGGGCTGCCGGCTGCGCTCGGGCCG 

TGCGCGGCGGCCGTGCGGNCACGCCATGGACTTCAACATGAAGAAGCTGGCGTCGGACGC 

GGGCATCTTCTTCACCCNGGCGGNGCAGTTNACGGAGGAGAAATTTGGCCAGNCTGAGAA 

GACTGAGCTTGATGCCCACTTTGAAAACCTTCTGGCCCGGGCAGACAGNACCAAGAACTG 

GACAGAGAAGATCTTGAGGCAGACAGAGGTTCTGCTGNAGCCCAACCCCAGTGCCACGAG 

TGGAGGAGTTCCTGTATGAGAAGCTGGACAGGAAGGNCCCCTCAAGGGTCACCAACGG 

Sequence 1905 

CNCGC GTCC GGTGCATCTTGCCCATTGATTTCTAAATGTATTAACTACTTAAATTAATCC 

TGAATCTTTTCCCAGGCTTAAGTGGGATMTGTTTTATTGTAGATGCATATTTCCTGGCT 

CTACCCAGTCTTTCTTTGMGACTTTATCATCCTATTTTCTGMTCCAGTGGCTGACTTT 

AATCTTCTCTGGAGGAACTAGATAATTTCTAGACTAATGCTTACACTCATGATCCAGATT 

GTAATTTCTGAACTCCTTCTTCCAAATAGAATCAAAACAAGAAAGGGGAAAGCCTCTCAA 

AGCA ACTGTGCGTTMTMTGAMCACTCTTTTTTTCTAATCCAAGGAGGGTTTCATACT 

TTTTCTTAGTTTCTTGCCCTCTTCCCTTCTGATCAATAATTGTAATAGGGAMTTTGCM 

TTGTGCCAATACTCAGATTCAATACTGMCTACTTTCTTGCATTGGAATTCAAATTCCAA 

GGTTAACAACTAGCTGTATGTTTCCAAMCAATCTTATTGGATATGGATTTTCTTAGGGG 

GAAGGTTCCAGAAATGATTT 

Sequence 1906 

GANCAGGCTCAAGAGCAACATGGAGGTCTGCACTTAATCGCTCCTCTCCGGGGGCGGCCA 

TACCGAGGAGGCGTCTCTTCCGTGCAGGCAGGCTCTCCTGGGGACCTCAGAGATTCTCTC 

CAGCGGCAGCGGAAAACGGACAATGGGTGGATTCGGGTCCAGATTCTGGTAGGAGGGAGT 

TTGGGATCGAGATCTGGAAAAAAGCACTAGACTGGAAGAGGACGCGATGGAGTCGGAGCC 

GCTGGCGGGGACAAAAACCAGAGGCCGGGGAAGGCGCCGGTGGGAGGCAAGGCACGGATG 

GACTTTACCTGCGCACGCGTCGCAGCCATCTCCGCGCACAGTGGTGGCCACCGCGACTGG 

TGCTGAAGTGTTGGCCGCGTGCCGGGCGCTCCGCTGGGACCCGGGTTGCTGGCCCTGAGT 

CTCAGCTTTCTCATCTGTACGGTTGGGACAAGTACAGTAACCCTCTCCCGTCAAGACGGG 

CC 

Sequence 1907 

GTCGACCNCGCGTCCGAACATCGTCAACTACGGCATCCCAGCCCACCGTGACATCGACGA 

GTGCATGTTGTTCGGGTCGGAGATTTGCAAGGAGGGCAAGTGCGTGAACACGCAGCCTGG 

CTACGAGTGCTACTGCAAGCAGGGCTTCTACTACGACGGGAACCTGCTGGAATGCGTGGA 

CGTGGACGAGTGCCTGGACGAGTCCAACTGCCGGAACGGAGTGTGTGAGAACACGCGCGG 

CGGCTACCGCTGTGCCTGCACGCCCCCTGCCGAGTACAGTCCCGCGCAGCGCCAGTGCCT 

GAGCCCGGAAGAGATGGGACGTGGACGAGTGCCAGGACCCGGCAGCCTGCCGCCCTGGCC 

GCTGCGTCAACCTGCCGGGCTCCTACCGTGNGAAGTGTCGCCCGCCTTGGGTGCCCGGGC 

CCTCCGGCCGNGATTGCCAGCTCCCGAGAGCCCGGCCGAGCGTGCCCCGGAGCGGGCCGA 
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ACGTGTGCTGGAACCACGCGGAAGAGGACGGNATGTGCCCTTGCCCCCTGGCCCGGGCC 
Sequence 1908 

ACCACGCGTCCGGGCGGCCCGGCCAGGCCCCCGGCACTTCCTCGTCCTCGGCCCGGGTGC 

CCTGCCCCCGTCCAGGAGCCCTAGGAGTGCTACGGGGGGCCGGAGCCTTGCCCGGGCCGC 

TGCCCCGTCCCTGGATTCGGGGCTGGACGCAGCAAGCAGGNGCGCTGTGTCCCCAAGCTC 

CCCGTCCTCGGGGGAGCTTTTGGAAGAGTCCCAGATGGAAGCGACCAGGCTCCGGCAGAA 

GGCAGAGGAGCTAGTGAAGGACAACGAGCTGCTCCCACCACCTTNTCCCTCCTTGGGCTC 

CTTCGACCCCCTGGCTGANCTCACAGGAAAGGACTCAAATGTCACAGCATCTCCCACAGN 

CCCTGCATGCC 

Sequence 1909 

ACGCGTCCGGMGGGTGTTACATGTGTCTCTTCAATACCTTTGGTTTTGGGAAGATCTCA 

GGAACGGCCTGCCTCACCGTCTATGTACAGCCCATAGTATCCCTTCACTACAAATTCTCT 

GAAGACCACCTAAATATCACTTGCTCTGCCACTGCCCGCCCAGCCCCCATGGTCTTCTGG 

AAGGTCCCTCGGTCAGGGATTGAAAATAGTACAGTGACTCTGTCTCACCCAAATGGGACC 

ACGTCTGTTACCAGCATCCTCCATATCAAAGACCCTAAGAATCAGGTGGGGAAGGAGGTG 

ATCTGCCAGGTGCTGCACCTGGGGACTGTGACCGACTTTAAGCAAACCGTCAACAAAGGC 

TATTGGTTTTCAGTTCCGCTATTGCTAAGCATTGTTTCCCTGGTMTTCTTCTCGTCCTA 

ATCTCAATCTTACTGTACTGGGAAACGTCACCGGAATCAGGACCGAGAGCCCTAAATAAG 

TCACACAGCACCCTTGAAAGGGGATTCCTGGNCTACTTGGATTTGGCACAAGAGAAAAAG 

CAGGAGGGAAAGG 

Sequence 1910 

GCGTCCGCTAGTTCTAGATCGCGAGCGGCTGCCC I 1 1 1 I ICTTTTC f I I M I » M M I I I 

TGAGACACAGTCTCACTCTGTCACCAGGCTAGAGTGCAGTGGCACGATCTCAGCTCACTG 

CAACCTCTGCCTCCCAGGTTCAAGTAATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGAC 

TACAGGCACGTGCCACCACGCCCAGCTMTTTTTGTATTTTTAGCAGAGATGGGGTTTCA 

CCACATTGGCCAGGATGGTCTCGATCTCAACCTCGTGATCCACCCACCTCGGTCTCCCAA 

AGCGCTGGGATTACAGGCGTGAGCCACCGCGCCAAGCCAAGGTCTGCATTTTTCTTTAGA 

ACTCAGAACACCCAATAGTCCTAGGCCCCCATCCTCGCATGGCAGCAAGCTAAATAAGCA 

TNTTCCCACTGCGAGTTGGGG 

Sequence 1911 , 

GTCCGCTGAGAATGGATAATCTCACTGCAGGTATTCCATAATAGGCTTGGATTAAAACCA 

GGGACAATGGGCAGCAATAGGCCATAGGTTAAGCAGCAGCAACTAGTCACCACTGGACTG 

TCT TCTT CTCCCCTTCCCCATATCCCACATTCTCCTAAACATGATGTACGTGTAGCAACA 

GTCTTTTAAAGTCAGATGGTCAGACTAATTATTTTTACMTTTAAGTGTAAGTGATGTAC 

ATGAATGGACTCTGTGAATCGGAAAACTTACGTAACAGCAGAGAATACGTATGTTATATG 

G AATAACCTGAGTTG AAGGTACAA I I I 1 1 I I I CCAGCTCTTTTATTCCTTTAACTGCTTA 

ACAAMGAAAGAGTCTCCAMGTTTAAAAAACCTTTGAAAAATATACAGCTTGATATTAT 

TTACATAAAATATGAATCCAGGTTCCAATATCAAACAAACATTGCTATGTCAGAAACACA 

GTGGAAGGCAGGAACGTAACTCACTGCCTTTTAGAT 

Sequence 1912 

CCCGCGTCCGCTCTTTTCTTCTCTNTAAAGTGAATTATTCC T I I M I I I GTTTTATGTAA 

CGTGTATATATTCTTAGT7TTTCTTGAMTCATTGTMTGTTMCTTTGTTGT7TCAAAT 

ATCTTGGTGATTGCTTCATTATCTCTTCMCAAAAMMCCTTTMTTTTGCCATTGAAA 

CTGTAGMCTATGCCATGCTTTTATTAGAAGCAGTGCTCTGTGTTAACAACAAGAATGGT 

GTMTTAGMTTGGGATGTGGATATTTACTGTATGACAACACATTTACAGTTCTGTAATG 

CAAGGATGCAGTTTAAMATGTGAAGTAGTGATGGTTTTTGAAATAAGCTTTAAAATATA 

GGGGATCTTGAAGGCTCCCTGGG GTMCT ATT TTATA ACTTAGATAAAATGGCTAGTCAT 

ATCTGTGTGGTTTGNAMGGTTATT11TTTAATATTTTMGATTACAATTTT 

AGAAATGAGCCCAACTATTTAAATTTTAAAACAGTAAAAACAAA 

Sequence 1913 

CGACCCACGCGTCCGCTTTGAAGCAGGAAAAGACACGTCTCTAGAGCAACATGGAAAGGA 
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TCCAGTATTTGTTTTCAGGCAAAAGCMGCCTCCAAAAGCCCCTTGGTTCTTCTCTGTGT 

GTCTCGCCCCTTCTGCCTTGTACACCAGACTGCCGGCAAGCAAGCGGGGAGGTGGCGAAC 

AGCTGGGATCCTCCAGTGAAAAGTTTGCTCAGCGCGAGAGATGTCAAGAGGCTGGGATTG 

TCGCCTATCGGGAGGTGGGGAGCTCTCTCACTCACACTTCCTGGGAAATGAAGGAGAACT 

AATAGGCAGCCCGCATTTGCAGCCAGCCAACCTGGGTGGTATTCCTTGAAGTGAAGCGCT 

CATAAATCCCTTCTGGGGCCAGAGCCGGTTTCACAAGCTGCAGGATTTGGGTGGCCTGCC 

TGCTCTTTGGGAGCGGGCTTGGGAAGCTGAAGAAATTGAATCAGGACCTCAGGCCCTTTA 

CAGACTCCC 

Sequence 1914 

CGCGTCCGCCCCGCGTCCGCTCAGAAACCCCTGCCCTTTCCCTTCAGAAAACGATGGCAG 

GCATTCCTCTGAGTTTACAAGCAGAGACTCACTCCAACCCAAACTAGCTGGGAGTTCAGA 

ACCATGGTGGMTAAAGAAATGTGCATCTGGTCTCTTCTGTTGTTTTTATTTCATATCAG 

ATTAMTTTCTTTACCATGTTGGCTAAGTCTAAATATTAGAGATGAGGCTGTGCCTACTC 

CCTGGCCAGCTCTGCTGATAGCCTATGATGGGTTCCAATGGGAAATGACTCTTTACTATT 

AAAAGACAAGGAAAGCTCTGACTTCGTACTTCTCTGATGAATGGCAATGTAAATGAACAA 

GGCTCCATGTGACTGGAGCATGGAAGTGMTGCTACTTTCTTAATTTAATCTGCCCTGTC 

CTACCTGCTCCTCTGATTGTTAGCCATCACATAACTTATTGAATGCTTGCCATGTGCCAG 

GCACTGTGCTGAGTGCCATACATACATTTCATTTAATTATCCAATAATCCTACTTACTA 

Sequence 1915 

CCGCGTCCGATTCTNTAACATCTCTGTGAGGMGGMTTTTTATCCTTATTTTACAGATG 

AGGAAGCTGTTTGGAGATAATTTAAGTGACTTGCCTGGGGAATCTAGCCAGTAGTAGAGT 

ACTGATTAATCAGGTGCTGACATCTGCTCTGCTTTGTGTATGTAATTCAGCAGTGCTTCA 

AAGATCCAAGAAGCTGTAGCAGATCTCAATACACTCTCCTATAAAATTAGTGAATAATCA 

CCATGACAMATTGGTATGGCGGMCAGTCATTATACATTATTTAGACTCATTCCTTCTT 

CCAGTGCCCTTATGATTATTTCCTACCTTTACCATTGG 

Sequence 1916 

CCNCGCGTCCGCCCCGCGTCCGCCCCGCGTCCGATATTGCTTCAGAAAACTGAATGTGTA 

TGTCGGTCATATTGCCTTTATAACCATGCTAATATCTATGCTTTATACATACTCAAACTT 

GCCTTGCCTTAAAAAAATACATACTACATACTTAAATCAGGAATTCTAGCCATCTCACAG 

AATACCAACTAMACTAAGTGCATTGAGATCTGAGATTGGTAAACCCAGATTCATTTACC 

ACAGCTGTMTTAAGTTTTTAGAMCTATTCTCTTTTTGGGGAMTCCATTGMGTTAAT 

TTCTGTTATCTTATTAGMGMAATGATGTTGATATGTGTTTCAGATTTTCCATTTGAAA 

TCTTATAATTMTTT GATT TATTTACTGTAAGTAGGAGGTATNAATGACACTCTTAAATT 

GG AAGGA GGGGTGTTTTAGTGTCTGTNTTAGGTCAAMTTAGTGATTCTATTTTTATCAA 

MGTTTTATCCTGAAGTTTCAGGACCACTCTTCCTTAATNAACTTGTTAATGGGAAGCGA 

GCCTTATGAACATTTAAAAAT 

Sequence 1917 

CGCGTCCGCAGCAGTMTCCTTTAATAACTGGCACGAGCACTTTATTCTTCTGGTGAGCT 

CCCTGMTAlTrATTTTTCTGATTATAMTTTTCTATATTAGTAGCATTTT^ 

ACTTCTTCACTATAGAGCATTTACTTTTAGTCTCTAGATGTATATTTTGGAATGCTGTAC 

T TGGC ATA ACATAG ATTAAMTCATMTGCATGACTAAAAACTCCTTGGATTTATTTCCC 

ATTTTAAAATTTTTAGCGGTMGTTCAGATTTATAATCTTTCTCTAGACTTC 

GAATGTTGCCTGCTGAAGTAGCAACCTAAAAAGTATCCCCTGCTTATGCTTCTCCAGTTG 

GCCCTCCATGTCCATAGGCTTCGCATCTGTGATTCAGCCCACTGTGGGTCAAAAATATTT 

GGGGAAAA 

Sequence 1918 

GCCCGCGTCCGCATCCCTTGTATACCATCGTAGACTTCATACTGGAGAGAAACCTTACAA 

ATGTGMGMTGTGATGMGCmCAGTTTCAAATCGAACCTTGAAAGACATAGGAGAAT 

TCATACTGGAGAGAAACCTTACAAGTGTAATGATTGTGGCAAGACCTTCAGTCAGACATC 

ATCCCTTGTATACCATCGTAGACTTCATACTGGAGAGAAACCTTACAAATGTGAAGAATG 

TGATGAAGCTTTCAGTTTCAAATCAAACCTTGAAAGACATAGGATAATTCATACTGGAGA 
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GAAACTTTACAAGTGTAATGAATGTGGCAAGACCTTTAGTCGGAAGTC 
Sequence 1919 

NGACCNCGCGTCCGCGCTCCGCTGCCNGGGGCGGGAGGGAGGAATGGTTGCTTCACGCCC 

CGGGGGAAGAGACNGGAAGCTCGGCTCTGGGTTGCGGGCCCCGGGGTCTCCGCGTGGGGC 

GCACCGTCCGACCCGCCCCTCCCGGTGTGCAGCGCCCCGCACCGCCCCGCCTTGCCTGGG 

AGAAGCCCGGCGGGACGCGCCGGGCTGGAGTGGGCGGTTATAGGCTTTGAGCTAGGCCGC 

TTCCGGGAGGCGGAGCTCACACCCCATTTCCTTT 

Sequence 1920 

GTCCGTTCCTTGATTCCTGGAAGTCCAGTGGGTTCTGCAGCTGAAAAAGCCCTGGGTCCC 

AGCAGCAGAGAGACAGGACAGAGGGGATGCTTGGGCGGGGAGGGACGGTAACCTGCAGAA 

CAGATTCCATTTTTATAGAACGAGTACACGTTTGCTAAAACAGTCCTGCTTTCCCAGACT 

GGATTCCCACCACAGGGACAGTCGGAACTCAGGACTAGCTCCAGCGACATCTTTCCTCCG 

AATTCMGCCTTCTATCACAATGTCAAAACAGCTATTTATAMGCCATTTTCATTGTACT 

TGATMCAGCACGAGTCCCAMACTTTTAGAAATAAAATAGGACATTGGCTTGATTGAAA 

AGAGGGACTTTTTAAAMTTGTTCTTTCGTCAGAMGCCTTTTGGATGACT 

Sequence 1921 

GCGTCCGAAAAAAATAGCATTATACCTCTTCCTTGTCTCAACCGCCATGAAAATTCTGAA 

CACTCCAAATTCAGTTGAATAATCCAAAACMMTTTATAAGTATAAAATAATTTTACTT 

CTTATAGTAATAGTATACTTTAAAAAGCCTCAGGGTATATTATCTTCTAAACAGCTACAA 

TTCAGTGCAGCTACATTAACCAACTATGTTCTCTAGTTGAGAACAACTAGGCCTATTTCA 

CTGCTGTGTAGCCTCAGTGCCTAACATGGGTGCCAAATAAATATTCGTAGAATTACACTG 

AATTGTAAAAACCATTCGTTTTTGTTTACAATTGCCAAAAATCTCAAAAGGCCCTGTATT 

TATGTMTTCTTTGAAATTATTATTTTA7T7TGATTTCTCAGTTATTG 

TGACTTAGTCATMGTACTCAATATTATAAAAACCTCAAATMTTGACTTGGATTTTACA 

CAACATCCTTCCCTTTTCTACAAGTTAAA I Mill IACC 

Sequence 1922 

TTGGTATTCTTGGCTAATTTCTTAGCTACTTGMGGTTAATTTGCMGACTTTTAAAACC 

TTAGAAAAGTTTTAAGGTTGCAAAGTTATCAACACTGGGGCAGAGGGTGGAGAGGCCAAT 

GCGGGTAGAAGGAGGCAGTTATGTTTATATTGAAGGTGAAAI I I I I CTTTCATTTAG AAT 

GGAAAACATCCCCAAATGTATCATTATAAACTAGTCAGCCTTGACTACACAAAATTGACC 

TTTMGTTGCTTGAGAAAAACACAATGCAAATCGTTCAGAAGGGTCAACATCCTTTGGTG 

CTAAMTCTTGTGTATGTTTTCAGAATGGCTTTTTCTGTATGTTATAGAATAATCACTAA 

AGGMAGGTAGTTGAATTTAMGTCATGAAGCAAGACTCTTTAATTCAGTTATTTTAAAC 

MGMTTAAMCCCCMACATCCTTGGCAGGCTTTGMGCACACAGAATTTTCTAGNATT 

TCTTATTA 

Sequence 1923 

CNCCGCCGGAACAACAACAGAAAGCTGTGTTTGTC I I II I I CTCTCAAATATATCTCCCG 

TATGAGATTTCAGGTCCCCATGTTTTCACCAAGCAATCTGCTATGTCAGCCAACCCANCA 

TCACTTTCTACAGGAGGTTATGATTTTTGCCATTTACTAGAGGMGATGTT7TATG 

CMGTTGGGGTTTGMTTCAGGTGCAGTCATCAGTTCTTTAGGGGCTGCMTGTTTTAAA 

AAAAMTMGTCATCAGATTTTAAGMAMAGTGATGATTTCTTATTGATATTTTTGTAA 

CAGMTATAGGCTCTTMCTGAAAATCCAGMCCAGAAACATAMTCTTGAGTTTCTTTT 

CATGTACATAAAAAGCAATAGCCGTTTTAGTATAGGATAGCCCTGAGCCCAAAAAGTAAT 

AGAMTTTTCTGTAGATATTTTAATACAGAGAGTGTATAGACTGACTCTAAGTTAATAAA 

TGTGCAAAAATATCTTAAAACCATNCCCTNCCCTTTATTTCAAC 

Sequence 1924 

CCNCAAAAAGGAACCAGAGGCCACTTGTATATATATGGTCTCTTCAGCATTTATTGGTGG 

CAGAAGAGGAAGATTTCTGAAGAGTGCAGCTGCCTGAACCGAGCCCTGCCGAACAGCTGA 

GAATTGCACTGCAACCATGAGTGAGAACAATAAGAATTCCTTGGAGAGCAGCCTACGGCA 

ACTAAMTGCCATTTCACCTGGAACTTGATGGAGGGAGAAMCTCCTTGGATGATTTTGA 

AAGACAAAGTATTTTACCGGACTGAGTTTCAGAATCGTGAATTCAAAGCCACAATGTGCA 
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ACCTACTGGCCTATCTAAAGCACCTCAAAGGGCAAAACGAGGCAACCCTGGAATGCTTCG 
TAAAGCTGAAGAGTTAATCCAGCAAGAGCATGCTGNCX^AGGCAGAAATCAAGAANTCTGG 
TCACCTGGGGAAACTAT 
Sequence 1925 

CCACGCGTCCGTCCGCCTCCAGCAGCATCCGCTTCAGCAAGGCCTGCCTGAAGAACGTCT 

TCTCGGTCCTACTCATCTTCATCTACCTGCTGCTCATGGCTGTGGCCGTCTTCCTGGTCT 

ACCGGACCATCACAGACTTTCGTGAGAAACTCAAGCACCCTGTCATGTCTGTGTCTTACA 

AGGAAGTGGATCGCTATGATGCCCCAGGTATTGCCTTGTACCCCGNTCAGGCCCAAGTTG 

CTCAAGCTGTAAGCACCATTTACGAGGTCATTCCTCCTCTGACAAGCCCTGGCCAGCCGG 

GTGACATGAATTGCACCACCCAGAGGATCAACTACACGGACCCCTTCTCCAATCAGACTG 

TGAAATCTGCCCTGATTGTCCAGGGGCCCCGGGAAGTGAAAAAGCGGGAGCTGGTC7TNC 

TCCAGTTCCGCCTGAACAAGAATAGTGAGGACTTTNAGCCGCCATTGAT 

Sequence 1926 

GCGTCCGGTAAGTATTTTGG AATTCAACCCTCGAATTA I I I I I I CTCATTTCAGCATAGT 

GATAGGGGATGCAATGAGGCTTCATTATTTTTTATGACCTGCCCCTCATTTGCTCTGATG 

TTCCCTAAATTCTGTAATCATATCATMCTTTTGTTATGAATAGAGAGGAATGGGCTCAC 

TGAAACCTGACACTAGAAATTGGTGGGTGATGCTCATAACTGCAAACACTTAGCTTATTG 

AAGTGCCTCTATTTACATGTTCTTTAGTTATMTATGTATTTTTCTAACAGAAATACACG 

TCTGTAATTGGTATATATTATACTTTGTATGTGTCACAACAAAAGCTAAACAGAGGCTAA 

AGTCTTTAGCAGAGAAGAATGAATTN 

Sequence 1927 

AACTGTTTGGGAAAATACGTTGAGGGAGAGAAGACCTTGGGCCAAGATGCTAAATGGGAA 

TGCAAAAGCTTGAGCTGCTCTGCAAGAGAAAATAAGCANGACAGAGGGATTTGCTCTGGA 

CAGANATGGAAGAGCCNGGGAACAGAGAAGTGTGGGGAAGAGATAGGAACCAGCANGATG 

GCAGGGGCAAAGGGCTCAAGGGTGAGGAANGCCNGTGGGACCCCACAGANTATGGGGAGA 

TAMGGACATTGCTTTGCTTTTGGTGGCACCGTAAGCTCCTTGACTGTCTNCAGCACCCA 

GMTCTCATTAAMGCTTATTTATTGTACCTCCAACCGGCTTGTGTGCAATGGGGGTCTT 

TTTGTGGAAAATCAANGAGCANACAGGTTTTCATGTGTACTGTCACCACGTGGGATGNGA 

ACCAGATNGCATGGAANCAAGACGCTAAATGNAAGAGGGCCATAANGGNTGGGATTTCCC 

AGGCNCCTTAAGAACAGCTTGTC I 1 1 1 I I 1 1 1 1 1 I CCTTTCCAAAAAA 

Sequence 1928 

CCGCCGGTMGTTAAAGACTTTAAGGACATTCAMGTTTAAMTAGTGTTCAAATTGCAA 
MTTTGGCAATCTTCATATAMTTGGTTTCTTTC 

TGTCAATTATAGGAAAACATAGTTGGAMTGTMTCATCCAMGATCATTTTTAAAATGA 

AATTTAATTAGCACATATTGMCATTTGACTTAATTGTTAMCCCCAGT^ 

TTTTTMTCAGATTTTTGCACACTGATTAGTTTTTGTGTTGTG^ 

ATTCAAGGI I I I I I I I I I I I I I CTTCCCCATGGGGGAGATTGTCTTCCAATGTTTAACTA 

CGTTTAAATAAATAAAAATTGAATTTTATTGIsrrCATTTATATAAAATCTGATCCTTGATG 

TMTTTCCAATACAGTTCCMTTTTATGGCTTTATAATTACMTGATATTTTCTTCTATA 

ATAAAAACCAMGTAMCATTTAMTGGGGAMCTGATATTTTTCATTTATATGAAGTAT 

NAAGCCCTCTACTGGGGTCNTTATTGGNGAATCATNCTGCCTTCAAANTGGTTTCAAAAN 

TGGGTAG AAANAAAACC I I I I I I I I I G 

Sequence 1929 

CCGCACACCCTAAAGAAAATAAGTATCCCAGTCGACATCAGTGACAGTGATATGATGCTG 

AACATCATCAACAGCTCTATTACTACCAAAGCCATCAAGTCGGNGGTCATCTTTGGCTTG 

CAACATTGCCCTGGATGCTNGTCAAGATGGTACAAGTTTGAGGAGAATGGTCGGAAAGAG 

ATTGACATAAAAAAAAATATGCAAAGAGTGGAAAAAGATACCTGGGAGGCAATCATTGGA 

ANGACTTCCTGTGTCTTGCGTGGAGTCATGATTAACAAGGATGTGACCCATCCACGTATG 

CGGCGCTATATCAAGAACCCTCGCATTGTGCTGCTAGGATTCTTCTCTGGAATACCAANG 

AAAGGAGAAAGCCAGGACTGACATTTGAGATTACACGAGAGGAGGGACCTTCACCCCGAA 

TTCTCNCAGAATGGAGNGAAGGAAGTACATCCCAGCAGCTNTGTGAGGGACCATTATCCC 
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AACCTGGAAAGCCCGATGTTGGTCANTCACTTGGAAAA 
Sequence 1930 

MIN IMI I ACAACTCAAAAAAAG AAATCTTTTAATAAAAATTACTCATAAAAATCCTA 

ATAMTTTTAAAGAGCAGNGATATTCCTTATTACATTTATAAMGAACATTTGGNCCTTT 

TACAAAAAGATCCCTTTTAATTNAMTAC^r^^TCTTATTTACAGATTAAACATAAAATAT 

C ATNT ACAGTTGCAAAGCATATTGCACATTACAGAGCAAAGCATTNGNGTATTTCCGNAA 

GTTTTCCCAGAGTTCCCAACTCTATAC I I M I M I GTAAAAAGATTTACCCTTGCTTATG 

CAAAMTAAAATMGAAATGCNANCTGNCGGTTTTGCTATTTAAAAACTANAANCCAAAA 

TAMCCTNTTAAAMTATTATTCCTCTGCCTTGCANAAAAGGAAAGTGAAGAGGGGThfTT 

ANAAATCAGNGGGGGTTNCCACCAGNGTCNCTTGATAATTTT 

Sequence 1931 

CGTCCGGGGAAACTTTATATGGTTGGGGATAAGAATTGAATGCAAATTAATGAACTAGAT 

TTGCATTTATGGAGTTACCTCATCATGGAGTTACCTTGGTCTGTCCCACGTCAATTAATC 

TTGNTTCCTTATTTTCATAGACCATCCTCTAGAACAGTGTTTTCACACTGTGTATCATGA 

TCCTATAATGCAGGGGTGTCCAAGCTTTTGTCTTCCCTTGGCTACACTGGTNGAAGAATT 

GTCTTGGGCCACACATAAAATATACTAACACTAATGATAGCTGATAAGCTAAACAAACAA 

AAAAATCACAAAAAATCTCATAATG MINI I GAGATTGTGCTG I Mi l l I AGTTGTGTT 

TTCAATGGAGAAATAC 

Sequence 1932 

CGTCCGGCGCGTTCGTGCGTCCTAGTTCCAGTACATGCGTGGAGGGTTTACGGCAGCGTG 

TTCTGATTCTTTGCGGGACGGCGAGCGCA7TTGTGCTTTGCCCGCCGCGGCCTANGAGGC 

CTTTTGAGGCCGCGTAGTCGGTGTTTTTGAACTGACTCTACAGCTTCTGGCAGGCCGTGC 

GGCGCCCTGACCCGGCCTCACCATGTTGGTGCTGTTTGAAACGTCTGTGGGTTACGCCAT 

CTTTMGGTTCTAMTGAGAAGAMCTTCAAGAGGTTGATAGTTTATGGAAAGAATTTGA 

MCTCCAGAGAMGCAMCAAMTAGTAAAGCTAAAACATTTTGAGAAATTTCAGGATAC 

AGCAGAAGCATTAGCAGCATTCACAGCTCTGATGGAGGGCAAAATCAATAAGCAGCTGAA 

AAAAGTTCTGAAGAAAATAGTAAAAGAAGCCCATGAACCGCTGGCAGTAGCTGATGCTAA 

ACTAGGAGGGGTCATAAAGGAAA 

Sequence 1933 

AGGGAGCCGCCCCGCGTCCGCCCCGCGTCCGCGGACGCGTGGGCTAAAACCCATCAGGCA 

AGATCACCACGCATTGANATATTTTCATATCAAGATAMGTCGCACATTTTCCACAATAC 

ATTGCTAAAATAMGAGGAGAAAGGCTTAGGAAGTTTTTTTGCAGAGAGTGCTGGTAAAG 

AATTGAGCAAGTTTGCTATTGTATTGNAATGTTTCTCTCAGGTTTGNTCTTCCTATCATG 

GNNGGTATTCCATGAATAATTGAGATCAGCCCTATGTAAGGTAAGATCATAATATGGGGG 

ACAAATGG 

Sequence 1934 

GC GGAC GCGTGGGCTCCATCTGAGCTCTTGGGTGACCAGGGTGCATTGTCAATGAGGGTA 

ATATTTTGAAAGACATCTTTATTATGAGCAGTAGGTCTCAACAGTGGGCTTAAAATGTGC 

AGTAMTCATGCTGTAMCAGATGTGTTGTCATCCAGGTTTTGTGCCATGTCTAGAGCAC 

AGGCTGAGTAGATTTAGCATMTTCTGAAGGACCCCAGGATTTTCAGAATGATAAATGTG 

CATTCGCTTCCACTTACAGTCACCAGCTGCATTAACCCCTAACAAGAATCAGCCTGTCCT 

TTGTAGCTTTGGAGGCAGGCATGMCTTCTCCTAGATGGCATCTTCCAAGAGGGCTATTT 

TTGTCTACATTGAAATTCTGCTTAGTGTAGCCACCTGCTTCAATGATCCTA 

Sequence 1935 

TNGACCACGCGTCCGGGCGCCNTCTCCCAGCAATCAAGTCTTGCTCCCTGGCCTGCCCTC 
CGGCACCCTGGAGCCCCGCTTTCCACACAGCCTTGCTCCTCCCGGCCTGGCTTGCTGCCT 
GCAGGCCTCTCAGGGTCCTTCACCGTCTCTCCAGTCTCGGATCCGGACCCGTGGATCTCA 
GCCTCAGACCCTCCTCTGGCCCCGGCCTTGCCCTCGGGCACGGCCCCTTTCCTCTTCAGC 
CCTGGGGTCCTGCTCCCCGAGCCAGAATATTGTCCTCCTTGGAGGTCCCCAAAGAAGGAG 
TCTCCCAAGATCTCCCAACGTTGGAGGGGAAGTCCAAGCCCAGGGGGAACTTGACATACC 
ACCAGGTACATGCCCCCAGAGCCGAGAAGGATNCGGGGGGCAAGACCCCCANGCCGTAGG 
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(3NGTCCTCCCTTGGGTCCTCCTGGGCCATCTCTGTGGGGAAGGGACAANACTCGCAACAA 

NCCACATTCTAANGATGANGGCCCTTTTNCCCT 

Sequence 1936 

CCN CGCGT CCGGAAAATATCCNAGGTTGTACGCAGCAGTGGAAGTTGCTCTCAAGGGAGT 

GGTATTTTACACTATGCTCATGGCGACAGTCAGCAAACTCACCTGTTGAAGCAAGGAAGA 

AGCTCCATGGGCACTGGTCTCAGTGGTGGGAAACGTCCTAGTCAGGAAGAGGACACACAG 

AGTATTGGTCCTAMGNCCAGAGACAGAGCACTMTTAGGTAMTATTTTAGAGCTGTAT 

TTCTTGCTTTAGMGAGTATATAATTAACATAAATTAAGATAATTTCAAAAATGGAGCAA 

ATCTCTATTTTCAMCCAGAAAATCTTGAGGCATTAATTTTTAAG 

CAGTTAATTTTTGGTCAAGAGACATGCATCTGTACTGGAGAAATTGTTGCACCAAGTTTT 

ATATTCATCTGAACCAATGC 

Sequence 1937 

CCCCGCGTCCGCCCCTTTNCTCCCTGAGGACTCCACAGAAGATGGTATATTATGGGAAAC 

CTTCTTGTAAAAACAACTATGAGAATTATATTGACATAGTGAAATATGTGTTCAGCGCTT 

ACAAGAGAGAGTCCCCTCTCATCGTCAACACTATGGGATGGGTTTCAGACCAGGGGCTCC 

TGCTTCTCATTGATCTGATCCGATTGCTGTCTCCCAGCCACGTGGTTCAGTTCCGCTCTG 

ACCACAGTAAATATATGCCAGACCTTACCCCGCAGTATGTAGATGACATGGATGGCTTGT 

ACACAAAMGCAAGACCAAGATGAGAMTCGACGTTTCAGACTCGCAGCATTTGCAGATG 

CTTTGGMTTTGCTGATGAAGAAAAAGAGAGTCCAGTTGAGTTCACTGGACATAAACTGA 

TAGGTGTTTATACMGACTTTGCATTCAGAATAACTCCAAGAAATAGGTAACTAACCCTC 

ATTTGGCTGAAGAATTATTTTTCCTTCCGTCGAAAAAGACCTGCCATTCTCA 

Sequence 1938 

GTCGACCCACGCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCTCTCGC 
ACTCTGTTCTTCCGCCGCTCCGCCGTCGCGTTTCTCTGCCGGTCGCAATGGNAGAAGAGA 
TCGCCGCGCTGGTCATTGACAATGGCTCC 
Sequence 1939 

ACCACGCGTCCGGGCGCCAGGCTAGGGCGGCCTGGCCACTGAGCCGGGGTGCAGTGGCAG 

CGGGAGAGTACCTGGCGATGGCGATATGAGCGGTGCGGGGGTGGCGGCTGGGACGCGGCC 

CCCCAGCTCGCCGACCCCGGGCTCTCGGCGCCGGCGCCAGCGCCCCTCTGTGGGCGTCCA 

GTCCTTGAGGCCGCAGAGCCCGCAGCTCAGGCAGAGCGACCCGCAGAAACGGAACCTGGA 

CCTGGAGAAAAGCCTGCAGTTCCTGCAGCAGCAGCACTCGGAGATGCTGGCCAAGCTCCA 

TGAGGAGATCGAGCATCTGAAGCGGGAAAACAAGGGTGAGCCGGCGCGGGGCCCTAGGCC 

GGCCCTGCCTCCCCAGGCACACTCAACACTGCCGCTCCCGCAGCACAGAAACACAGCCAT 

TCAACTCCAGCACACGCCTGGGCTCAGGGGGAACACAGGACGATCTCCATTACAAGCTCA 

TAATGAATCAGACATCACAGAAGAA 

Sequence 1940 

CGCGTCCGTGAAGGCCAGACCGAGAGGTGCCAGAAGAGAACAAACCTCCATCCAGACACG 

CGGGCGGAAAGGCTCCAGGGGTCCAGGGCCAGATGGCGCCGCCTCTGCCCGACTCAGAAA 

GAGAGAAACAAGAGCCGGAGCAGGGAGAGGTTGGGAAGAGGCCTNGACAGGCCCCANGCC 

TTTGGAGGAGGCGGGTGATCTTCCTGAAGATCCCCAGAAAGTTCCAGAAGCAGATGGTCA 

GCCAGCTGTCCAGCCTGCAAAGGAGGACCTGGGGCCAGGAGACAGGGGCCTGCATCCTCG 

GCCCCAGGCAGTGCTGTCTGAGCAGCANAACGGCTTGGCGGTGGGTGGAGGGGAAAAAGG 

CCAAGGGGGGACCGCCGCCAGGCAACTCCGCCGGNGACACAGGGCAGCCCGCANAGGACA 

GCNACCACGGTGGGGAAGCCTTCCCTCCAGCGGAGAAGCCGGCTTCAGG 

Sequence 1941 

CCGCGTCCGCAGAAACATATGTGTAGTGTGCTGCAGCATAAGATGGAAGAACTTAAAGAA 

GGCCTGCGGCAAAGAGATGAGCTTATTGAGAAACATGGCTTAGTTATAATCCCCGATGGC 

ACTCCCAATGGTGATGTCAGTCATGAACCAGTGGCTGGAGCCATCACTGNTGTGTCTCAA 

GGAAGCTGCTCAGGTCTTGGAGTCAGCAGGAGAAGGGCCATTAGATGTAAGGCTACGAAA 

ACTTGCTGGAGAGAAGGAAGAACTACTGTCACAGATTAGAAAACTGAAGCTTCAGTTAGA 

GGAGGAACGACAGAAATGCTCCAGGAATGATGGCACAGTGGGTGACCTGGCAGGACTGCA 
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GAATGGCTCAGACTTGCAGTTCATCGAAATGCAGAGAGATGCCAATAGACAAATTAGCCG 

AATACCAATTT 

Sequence 1942 

CCGCGTCCGCTCGCCTGCCCCGGTGCACCCAGTCCGCTCACCCAGCCCAGTCCGTCCGGT 

CCTCACCGCCTGCCGGCCGGCCCACCCCCCACCCGCAGCCATGGACGCCATCAAGAAGAA 

GATGCAGATGCTGAAGCTGGACAAGGAGAACGCCATCGACCGCGCCNAGNNAGGGCCGAA 

TCCGACANGAAGCAANGCNTGAGGACCGCTTGCAAGCANGCTGGAGGAGGAGCAGCAGGC 

CCTCCAGAAGAAGCTGAAGGGGACAGAGGATGAGGTGGAAAAGTATTCTGAATCCGTGAA 

GGAGGCCCAGGAGAAACTGGAGCAGGCCGAGAAGAAGGCCACTGATGCTGAGGCAGATGT 

GGCCTCCCTTNACCGCCGCATTAGCTGGTTTGAGGAGGAGCTGGACCGGGCCCAAGGANC 

GCCTGGCTACAAGNCCTGCANAAACTTGTANGANGCCNNNNAANGCCGGCCTNATNATTN 

GCCNNGAAGAGGGAATTNAAANGTTNTTNNAAAAACC 

Sequence 1943 

GTCCGCTTAGTTTCTGCATTATNAGTNAGCATAAATAATAAATCCAGAAAACGTGCTGTA 

TTTGTTGTTGTTTCCTCCATGGGCTTTCCCGCCATCTMTTTGATATAGACTTCATCTCC 

CGGCTCCAAATGAAGAACCACACTGTTACTGGCATAGTCGTAATTCTGATCAGCATCTTG 

GGCAATTG CACTAGCACGC ACCTACGTGAAACAGACAAGAATTAGGATGCGTAAATGAGA 

ATTCTCAAGTTTTCTTTTTTGCCATTCATGTAGCATCACAAGCTGACTTGCTGCCATAGT 

ACAGAATTTAGCATAGCAAAG 

Sequence 1944 

GCGTCCGGCTGCGGGCGGTCGGGGCTCCGGGCCGGGGCGGCGGCGCCATCTTGTGCCCGG 

GGCCGGTGGGGAGGCCGGGGAGGGGGCCCCGGGGGGCGCAGGGGACTACGGGAACGGCCT 

GGAGTCTGAGGAACTGGAGCCTGAGGAGCCCCCGGCAGCCAAGAGGAGGAGGAGGAGCCG 

GGACTGGTCGAGGGTGACCCGGGGGACGGCGCCATTGAGGACCCGGAGCTGGAAGCTATC 

AAAGCTCGAGTCAGGGAGATGGAGGAAGAAGCTGAGAAGCTAAAGGAGCTACAGAACGAG 

GTAGAGAAGCAGATGAATATGAGTCCACCTCCAGGCAATGCTGGCCCGGTGATCATGTCC 

ATTGAGGAGAAGATGGAGGCTGATGCCCGTTCCATCTATGTTGGCAATGTGGACTATGGT 

GCAACAGCAGAAGAGCTGGA 

Sequence 1945 

CCCACGCGTCCGGCAAACCGGGAAAGGAGAGGATCCCGGAGCCGGTGAGAATTCTCTGTT 

TTTTCTCTACCATCCTTTCCAGGCCTTTTCCTCACCTAATGAGTCGTAGAGACGAGGGCC 

CAAAAAGTCTGTAAAGGTGGCTGGTGAAAGATTAAGTGNTCCAAGGGCCCTACATTCCNG 

GANGNGGTTCGGGATAAAAGAGAACTAGTCNTGGGAACAATGTAAGTGGGAACNTNAAGG 

NANNGGNAAAGCGGCCNATAAAGGNGNNCGGAGGNCCCAATGGNANTAAAGCGGACCCTG 

TGTAGGTATAGAGTTGAGTCAAGTGGAGTCACTGCCTCTTGTCCCTCTTGGTCAGCGTGA 

TGGCCAGAGGCCTGGGGGCCCCCCACTGGGTGGGCCGTGGGGACTGCTGANCCTGGGCCN 

ACTTGG 

Sequence 1946 

ACGCGTCCGGCCGGGAGTGGTGGTGGGCACCTGTAATCCCAGTTACTCGGGAGGCTGAGG 

CAAGAGAATCTCTTGAGCTCAGGAGGCAGAGGTTGCAGTGAGCTGAGATTGCGCCACTGC 

ACTCCAGCCTGGGTGACAGAGGGAGACTCCGTCCCAAAAAAAAGAAAAAGAGAAACAGCT 

GTCACCTCCCGCAGGACCCAAATCCTCTCTCTGAGCACCGTCATCCACCACATGGCTNGG 

CCTNGNTTCCAAGANCNAGTCNANCCTTTNNNGNCTTANTTNNAGGTNGANNCCGCNNNT 

TTCNNCCCAAAGGAGACAGCCCCTGCTCCTAGATGCCCTTGGCCTCCGCAGTGCAGCCCC 

CAGGTGTCCTGACTGAAGCANAGGCCNTAGCCCCAT 

Sequence 1947 

NCGCGTCCGMGTGGATGAMATTGGTACCATGGGGAAGTCMTGGMTCCATGGCTTTT 

TCCCCACCAACTTTGTGCAGATTATTAAACCGTTACCTCAGCCCCCACCTCAGTGCAAAG 

CACTTTATGACTTTGMGTGAMGACAAGGAAGCAGACAMGATTGCCTTCCATTTGCAA 

AGGATGATGTTCTGACTGTGATCCGAAGAGTGGATGAAAACTGGGCTGAAGGAATGCTGG 

CAGACAAAATAGGMTATTTCCMTTTCATATGTTGAGTTTAACTCGGCTGCTAAGCAGC 
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TGATAGAATGGGATAAGCCTCCTGTGCCAGGAGTTGATGCTGGAGAATGTTCCTCGGCAG 
CAGCCCAGAGCAGCACTGCCCCAAAGCACTCCX3ACACCAAGAAGAACACCA 
Sequence 1948 

CGCGTCCGAGTATTTGAAGTGATGCTGGCTCAGACCGCTCCCACTATGCAAAATGTAACC 
CATACATGGATTCTCCACMTCMTAGGTTTTGCTGAACTGCTGAGCCTTGGCAGTGGGC 
GCCmGAGGCTTAGMGTGCCNNMGNNTCTNNCANN^^ 

TGNCGCACTNTTANCAGTCANCAAANAAAATCCNGGGGNGNTNNTNANCCCCNAAGGCNC 

NNGNTNCCACNGTTCAGTGAAACTTGCTTCAGAGGCAGAAAGAAGCAGGTTTCCAAGCAA 

CAATCAGTGCTGCACACATGCATGCATATGCGCTAGAACTTCTATTTGATCAGTTGCATG 

AAGGAGCTAAAGCTCTTGATGTAGGATCTGGGAAGTGGAATC 

Sequence 1949 

CCACGCGTCCGGAGAGAATGGGCCGCGCGCGCCCGCGGCCGGGGAAAGCCTGTCGGGAAC 

CCGGGAGAGCCTGGCCCAGGGCCCCGACGCCGCAACCACCGACGAACTCAGCTCTCTCGG 

GTCTGACTCGGAGGCCAACGGCTTCGCCGAGCGCCGCATCGACAAGTTCGGCTTCATCGT 

GGGCTCGCAGGGCGCCGAGGGCGCGCTGGAGGAAGTACCCCTGGAGGTGCTGAGGCAGAG 

GGAGTCCAAGTGGCTGGACATGCTCAACAACTGGGACAAATGGATGGCCAAGAAGCACAA 

AAAGATTCGTCTGCGGTGCCAAAAGGGCATCCCGCCTTCTCTGCGGGGCCGTGCTTGGCA 

GTACCTGTCAGGAGGCAAGGTGAAGTTACAGCAGAACCCTGGAAAGTTTGACGAGCTGGA 

CATGTCCCCTGGGGACCCCAAGTGNTTGGACGTGATGGAACCGTGACCTGCNCCGGCAGT 

TNCCATTCATGAAGAGTTTTTGTGTCCGGGGGGG 

Sequence 1950 

NGCGTCCGGCTTTACAACGGGCAAATACTGGAAACCATCGGAGGCAAACAGCTCANAGTC 

TTCGTATATCGTACAGCTGTCTGCATTGAAAATTCATGCATGGNGAAAGGGAGTAAGCAA 

GGGAGAAACGGTGCGATTCACATATACCGCGAGATCATCAAGCCAGCNNANAAATCCCTC 

CATNGAAAAGTTAAAACMGGATAAGCGCTTTAGCACCTTTCTCAGCCTNCTTGAAGCTT 

GCANAACTTGAAA AGAA GCTTCCTGACACAACNCTGGAGACTGGGACATTATTTGTGCCA 

ACCAAATNGATGCTTTTAAAGGGGAATGACCTAGTTGMNNAAAAAAGGAMTTTCTTGA 

TACNGGGACCAMMMTGGCTCNTTTCAAAMACAATTCCATTTCCTTTATTTNACCCC 

TGGACCACCCCMGGGMGTTTTTTTCMTTTGGGGAAAAMGNGGAATTTTTGG 

Sequence 1951 

CAATNCTTGAACCAGGAAGGCGGAGGTTGTCGGGGGCTGAGATTGCACCACTGTGCTCCA 

GCCTGGGCGACAGAGGGAGTCCCTTTCTCAAAAAAATAATAATAAAAATAAAGATGGCAG 

TAGGMGGTTTCAGCTTGAGATGCTGTCTTTTCTTCTGTTTTTATGCATAAATACAACGA 

AGACGGGAGAGGAGATGGAAAGCAAAGATGATTAAGTGAAATAATTGTGGGAAACAATAG 

AGGGATAGACTTTGCTTATAGGGGATGTGGACAGAGCAGAAAAATGGGAGGAATGGGGAG 

GATTCAGTTAGAGAAGGAAGAAACCGGTCCAAGGGGCTGGGGCTTTAGGCCCTGGGGCCT 

CCAGTGCCCGTATAAGGCTGTGGCAGAAGCCCTGCCCATTTCCGTTCCTTTCACTCCCTA 

TNTTCACCCTTACACCTCCCAAAAAC 

Sequence 1952 

CCGCGTCCGGGTTTTTTGGCCTTATTTTC 

GATTTCATTCATTGMGMTCMTACATATmGTTTCAAMTGTTTGAAACAAAAGACA 

TAGATG GTAG ACTTTTATTAAAACATATATGGATGTGGAAAGCACATATATTAATGCAGT 

CATCCCTTTTCAGGTGGGAAGAGAGCAMCCAGTTGATTTTTTMTTCATCCTTAGTACA 

CAGAGMTATCTTTTNCTCAAGGAATATACCCTGGTTGGAGCT7TAAAAMGAATGGTTT 

TGGGAA CCATT TCATTTTTCCCAANAAGGTTGCTATTCTTGGGGTAANTGGGGNATACCN 

GANTGNTTTTTCCNGGGGGGTTTTGGTGGTGGGGGGTAAAATTGGGGNI I I I I I I 1 1 I 

Sequence 1953 

GCGCCGAGGGGTGTAATGCATTNGCAGCAAGAGCTATGAGAAATAACTTTAGACATTATT 
TCATTGMCCTTCCCAACTGAMTTATTTTATGATGTTATAACATGGATAGTAACTCAAG 
TAGCAATAAGTTACACAGTTGTGCCATTTGTGCTTCTTTCTATAAAACCATCACTCACGT 
TTTACAGCTCCTGGTATTATTGCCTGCACATTCTTGGTATCTTAGTATTATTGTTGTTGC 
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CAGTGAAAAAAACTCAAAGAAGAAAGAATACACATGAAAACA7TCAGCTCTCACAATCCA 

MAAGTTTGATGMGGAGAAMTTCTTTGGGACAGAACAGTTTTTTTACAAQAMCMTG 

TTTGCAATCAGAATCAAGAAATAGCCTCGAGACATTCATCACTAAAGCAGTGATCGGGAA 

GGCTCTGAGGGCTG I U\U\U\ \ GATGTTAACAGAAACCAATCTTAGCACCTTTTCAA 

GGGGTTTGAGTTT 

Sequence 1954 

TCGCCCCGCGTCCGGTTAAAACTGCCTC7TTAGATGTGGATGCCTTAATGCTGTAACACA 

TTTGAAAACATTGGCAATACTTMGTTGCTGCCATGATTACAGATGGAATTATTGGCTAC 

CAAAGAGACGCAATTGATGATGAGAAGCATGATTCTTGCTTCCATATAACCAAAGTTAAT 

CTTMTTGCMTTTGACTCCGTTTCCTTGGTAGGGATAGACTTTCTTCAGATTCCAAGTG 

CTCTCTTAMTGGCAAATTAAGTTAAAGAATACTACTGCTCCATTCCCCTCACTTATTCT 

C CAGT TAATTGCTTGTCAGTTCCATTTCMGAAAGCAGTGGATGTTCCAGGTTTGATTCA 

GTTTTCCTGTGCACACTATTGCCAAA I J I J J J J J I AGCAAAGATTCTGCACTGGAACGTA 

GACAGTTGGAMCAGTACTACCTACTAGAGGGTATGGGGTTTCTCTTTCTCCCCGCTTTC 

ACCTCTTTCTTTTCCAATTC 

Sequence 1955 

CCNCGCGTCCGGCTCTGCCCAGTCACCCGGTCTCCTCCGGCTTCCCTCCGGCCAACAGCG 

CGCTCAGGCTCGCCTCAGGCCCCTCCAACGGAACAGGAGTCGAGGGGCAGTGAGGCCGGG 

ATGCGTGCGAGCGCGGGGCGCGGCTGGCGCTGGGCCGTGGGGGCGGGGCGGCGTGCGTGC 

CAGCGGCCGTGCGATTCTGTGAGGCCTGCTCTGCGCCGGCGGGGAAGCGCGGGCGACGCT 

ACGCTGGACGTCCTTACCTGCCGGAGGAGAGAAAGTGCCGTCAGCTGTAGGGG 

Sequence 1956 

GCGTCCGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGAC 

GCGTGGGCGGACGCGTGGGCGGACGCGTGGGCGGACGCGTGGGAGAATGTGCTATACCCC 

AGAAATTCCATCGATCTGTCATGGGCCCCAAAGGTTCCAGAATCCAGCAGATTACTCGGG 

ATTTCAGTGTTCAAATTAAATTCCCAGACAGAGAGGAGAACGCAGTTCACAGTACAGAGC 

CAGTTGTCCAGGAGAATGGGGACGAAGCTGGGGAGGGGAGAGAGGCTAAAGATTGTGACC 

CCGGCTCTNCAAGGAGGTGTGACATCATCATCATCTCTGGCCGGAAAGAAAAGTGTGAGG 



Sequence 1957 

CCACGCGTCCGCTGGAGGCACTGGACGAGATGCTGACGCACGACATCGCCAAGCTCATGC 

CCCTGCTGCGGCAGGAGGAGCTGGAGAGCACCGAGGTGGGCGTGCAGGGGGGCGCTTTTG 

AGGGCACCCACATGGGCCCGTTTGTGGAGCGGGGACCTGACGAGGCCATGGAGGACGGCG 

AGGAGGGCTCGGACGACGAGGCCGAGTGG^GTGGTGACCAAGGACAAGTCCAAATACGACG 

AGATCTTCTACAACCTGGCGCCTGCCGACGGCAAGCTGAGCGGCTCCAAGGCCAAGACCT 

GGATGGTGGGGACCAAGCTCCCCAACTCAGTGCTGGGGCGCATCTGGAAGCTCAGCGATG 

TGGACCGCGACGGCATGCTGGATGATGAAGAGTTCGCGCTGGCCAGCCACCTCATCGAGG 

CCAAGCTGGAAGGCCACGGGCTGCCCGCCAACCTGCCCCGTNGCCTGGTGCCACCCTCCA 

AGCGACGCCACAAGGGCTCCGNCGAGTGAGCCGGGCCCCCCTT 

Sequence 1958 

ACGCGTCCGCCTGGCTAACATGGCCGAAAGGTCGTATTCTCCGGGGGAGGACGGAGGCCC 

GGAGAGGAGGGGGTGGAGTGCCTGGTTTTCCAGTCAGGCGGCCGGAGGGCAGCCCTCAAG 

AACGGCCCTGACCGCCCGCGGGGTGAGGGGCCCTTTCTGGGCAGGACCCGCCCCTTGGTC 

CCGCAGAGCCTTGGTACTTGGACCTGAACCTTGCTCCGAGAGGGAGTCCTCGCGGACGTC 

AGCCMGATTCCAGAATGACTACTATCTTGACTTACCCCTTTAAAAATCTTCCCACTGCA 

TCAAAATGGGCCCTCAGATTTTCCATAAGACCTCTGAGCTGTTCCTCCCAGCTACGAGCT 

GCCCCAGCTGTCCAGACCAAAACGAAGAAGACGTTAGCCAAACCCAATATAAGGAATGTT 

GTGGTGGTGGATGGTGTTCGCACTCCA I I I I I GCTGTCTGGCACTTCATATAAAGACCTG 

ATGCCACATGATT 

Sequence 1959 

CCACGCGTCCGGGGGACATGGTGGTGAGCAGGACAGAATTCCTGTCCTCATGAGTCTTAC 
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Attctagaaggaaggagcagataaaatctaaataaggttatgagattgtgacgaagcgtc 
cgattacacctcaacaagaaggcaactgacaaacagccttatagcaagctcccaggtgtc 
tctcttctgaaaccactgaaaggggtagatcctaacttaatcaacaacctggaaacattc 
tttgmttggattatcccamcctttatttttcttcggtagcaccaatgaaaaga 

Sequence 1960. 

TCCGGTGCAGCCGGGCTAGCTTCTCCTGCATCTCCCGAACGTGCTCCAGCTGCTCAAAGG 

AGCNTTCCTTCCCTGGAACGAANCCCCTGNCTGAACCCTNNNAAAGAGNCCTGGAACCCT 

GGAACCCTGGAACCATGGNCCCAGCCNTNCCCCANGNACTGGGCCCTACAGCCCTATCCA 

CACACCCGCTGAAACCAGCCCCCAGGACTCACCGAAGGCCNGGGGGCGGCCAGAGTGGAA 

ATCATTGAGCAGGTTCAGGAGTCCCCCCTCCATCTNATAGACATCAGTCACCTCGGTCAG 

GAAGGAGTGCTGCAGGGGCGTGCCTGCAGAGCCGCTTCCACCTCCAGCCAGGACT 

Sequence 1961 

NGNCGCGGTTCTCGTCTCCTCCCGGCCGGCGGAGCGAGTGGAGGCTGCAGCCCAGCTCGT 

CTCGGCGCCCGCGTCGCCGTCNCGAAGCCCCCCGCCCCGCTTCCGCCGCGTCGGAATGAG 

CTCCCGGAAAGTGCTGGCCATTCAGGCCCNAAAGCGGAGGCCGAAAAGAGAGAAACATCC 

GAAAAAAAATCAAGCAGAAGATTGAGCTGCTGATGTCAGTTAACTCTGAGAAGTCGTCCT 

CTTCAGAAAGGCCGGAGCCTCAACAGAAAGCTCCTTTAGTTCCTCCTCCTCCACCGCCAC 

CACCACCACCACCGCCACCTTTGCCAGACCCCACACCCCCGGAGCC 

Sequence 1962 

CGCCCACGCGTCCNNCCCGCGTCCGGGAGAAGATGGAAGCAGTGCCCGACGTAGAGCGCA 

AGGAGGACAAGCCCGAGGGGCAGTCACCTGTGAAGGCTGAGTGGCCCAGCGAAACCCCGG 

TGCTGTGCCAGCAGTGTGGCGGCAAGCCTGGCGTCACCTTCACCAGTGCCAAGGGCGAGG 

TCTTCTCCGTACTGGAGTTTGCACCCTCAAATCATTCTTTTAAGAAAATTGAGTTCCAGC 

CTCCAGAAGCCAAGAAGTTCTTCAGCACAGTGCGGAAGGAGATGGCGCTGCTGGCTACCT 

CACTGCCTGAGGGCATCATGGTCAAGACTTTTGAAGATAGAATGGACCTCTTCTCAGCTC 

T 

Sequence 1963 

TCCGCGGGNGCGTGGGCGGCCACTCATTTGCATTATCTTAAATCACAAATAATTACTTAA 

TTNGCTGGAGTGTGTGCTTTGCAACTTTTATACCAGAGTAAAATTTGTATTTAAAACAAA 

AAATAAGAATGCCCATCACTAGGAGAAACACTCCTCACAGAAAACACACACACACACACA 

CACACACMTTTAAAMCTGAGTAAATTTAMTGTATGAAAGGCNCCACAAATTGATTTC 

MCAMTMATTTCATTTCTCTAGCTACTTATGTCTGGCCTTATTTTGAGCGTTACAATT 

TTATTGCCTTCATTTGCCTATTTAGACTGATGTAGTTTGATATGATGAAGT 

Sequence 1964 

CTNTAGGGAGNCGACCCACGCGTCCGCCCCGCGTCCGGTTAGTTCTACCTGGTGCCCATG 

TTCTGATTGTGTGTGGGATTGCATGGTGTCCTGATTGCATCTAGGTGGAGCGGATGGAAT 

GTGCTGGGCCACTGTTGGGTGGAGAGCAGCACATTCTTACAGAGGAGATGGAGCGTTATG 

AGCATAGTATGTGGATAGGTATCTTCACCTGCCCGCCCCTGAGTCAGCCTCCTTGACTTG 

ATAGCTTGAAGMTCCTTTTCCACTGAAATAGAGGATAATTAATTGACACATCTGAAATC 

CCCAATCAATCAATCAAGAGAAAGGTAGAACTAAAAACTCCTTAACTTACTGTTGCTTAC 

ACCCCTGAAAGTCTGTTTTT 

Sequence 1965 

CGTCCGGCGCCCTCCAGCCCCGTCCGGGAGTCCCCGGCCCGCTGCGGTGCCGGGAGTACC 

TCCAACCCCCTGCNCCCCNNANGGAGGCCNAGGGGCTTAGCCACCAGGGCTCGGAAGTGG 

GGGCCGAATCCGGTGCNNGNCNNNNNNNAGGNNAGCANAGCCGGAGTTGGGGAGACTGGT 

TGCTGAAAAGCCAGGAGTCAAAATGACTGAGCGCTTTGACTGCCACCATTGCAACGAATC 

TCTCTTTGGCAAGAAGTACATCCTGCGGGAGGAGAGCCCCTACTGCGTGGTGTGCTTTGA 

GACCCTGTTCGCCAACACCTGCGAGGAGTGTGGGAAGCC 

Sequence 1966 

CCCACGCGTCCGGTGCACTCAACAGAACGGCCCTTCAAATGTGAGAAATGTGAGGCAGCT 
TTCGCCACGAAGGATCGGCTGCGGGCGCACACAGNACGACACGAGGAGAAAGTGCCATGT 
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CACGTGTGTGGCAAGATGCTGAGCTCGGCTTATATNNNNGACCACATGAAGGTGCACAGC 
CAGGGTCCTCACCATGTCTGTGAGCTCTGCAACAAAGGCTTCACCACGGCAGCATACCTG 
CGCATCCACGCGGTGAAGGACCACGGGCTCCAGGCCCCGCGGGCTGACCGCATCCTGTGC 
AAGCTGTGCAGCGTGCACTGCAAGACCCCTGCCCAGCTGGCCGGCCACATGCAGA 
Sequence 1967 

CGNCCCACGCGTCCGGCGGTGCCGCGGGGATGGCGGGAGCCGGAGCTGGAGCCGGAGCTC 

GCGGCGGAGCGGGGGCGGGGGTCNAGGCTCGAGCTCGCGATCCACCGCCCGCGCACCGCG 

CACATCCTCGCCACCCTCGGCCTGCGGCTCAGCCCTNNNNCCNNNNNANNNATGGCNNGN 

TCAGGGGGCCTGGGGTCTGGGGACAACGCCCCGACCACTGAGGCTCTTTTCGTGGCACTG 

GGCGCGGGCGTGACGGCGCTCAGCCATCCCCTGCTCTACGTGAAGCTGCTCATCCAGGTG 

GGTCATGAGCCGATGCCCCCCACCCTTGGGACCAATGTGCTGGGGAGGAAGGTCCTCTAT 

CTG 

Sequence 1968 

GCGTCCGGGCGTGTAAACCAGCCGGAGCGGCGCGGCAGCGGCAAGGACCGCCGTGGCGCC 

TAGAGTAGCCGACCCGGGGGGAGCGCGGGGCGACGCTGGCTGCAGGGACCCGGTGACAGC 

GTGAGAGGTTCGCAGAGTACTAGGTTTTGACAAGCTTGCATCATGCGTGAGTATAAGCTA 

GTCGTT CTTGGCTCAGGAGGCGTTGGAMGTCTGCTTTGACTGTACAATTTGTTCAAGGA 

ATTTTTGTAGAAAAATACGATCCTACCGATAGAAGATTGTTATAGAAAGCAAGTTGAAGT 

AGATGCACAACAGTGTATGCTTGAAATCTTGGATACTGCAGGAACGGAGCAATTTACAGC 

AATGAGGGGATT TATA CATGAAAAATGGACAGGGATTTGCATTAGNTTATTTCATTNCAN 

GCCCANNCCCAATTTTACGTTTAAAAAACCTGNNNNAAAMMTTTTT7TGGNTAAMAN 

CCNTNTTGTNTCCCCAAMNTTTTTTTGGGGNANAAAAANNGGCCCTCCNAAAAAAAAAA 

Sequence 1969 

CCCACGCGTCCGCACGCCAGTGCCTCCCTTTACCTACTAATGAGGCAAAACTTTGAGATT 

GGGAATAACTTTGCCAGGGTTAAAATGCAGGTAACAATGTCACTATCCTCCTTGGTGGGC 

ACATCTNANMTTTTMTGMGAATTCTTMGACNGTCTTTCTAAANNACTATTTNGTAC 

ATTATGCTTGAAGAANATNTGNGAATTGAGGGAAACA 

Sequence 1970 

GCGTCCGGTTTTCCMCTGCAGCTTTTTAATGGTTAACCTTCATCTA AI MUM I CTCC 

CACTGGTTrATAGATCCTCTGACTTGTGTGTGTTTATAGCTTTTGTTTCGCGGGGTTGTG 

GTGAGGAAGGGGTGATGGCATGCGGAGTTCTTTATCTTCAGTGAGAANNGGCCCTGCCCG 

CCTGAGAGCCAGCTTNCGCGTTGGAGGCACCGNGTTCAGAGAGCTGCTGAGCGCCACCCT 

CTACCCNGCTGACAGACAACACAGACCTGTGCCGMGGCTAANTTGNGGCTTTTTACGAC 

CCTACCCCACCCCCTGTTTTCAGGGGTT 

Sequence 1971 

CGTCCGGTGAGATTCTCCGTAATGGGCGGGGACAGAGTGCCCTGCAGGAGATTCTGGGCA 

AGGTTATCCAGGATGTGCTAGAAGACAAAGTGCTCAGCGTCCACACAGACCCTGTCCACC 

TCTATAAGAACTGGATCAACCAGACTGAGGCCCAGACAGGGCAAGCGCAGCCATCTCCCA 

TATGATGTCACCCCGGAGCAGGCCTTGAGCCACCCCGAGGTCCAGAGACGACTGGACATC 

GCCCTACGCAACCTCCTCGCCATGACTGATAAGTTCCTTTTAGCCATCACCTCATCTGTG 

GACCAAATTCCGTATGGGATGCGATATGTGGCCAAAGTCCTGAAGGCAACTCTGGCAGAG 

AAATTCCCTGACGCCACAGACAGCGAGGTCTATAAGGGTGGTCGGGAACCTTCCTGTACT 

ACCGCTTCCTGAACCACTTGTGGTGGCTCCTGACGCCTTCACATTGTGGCCATGGCAANC 

TGGTGGAGCCCTGGCTGCCCCCCAGCGCCATGCCCTGGG 

Sequence 1972 

GAGGGGAGGGAAATTTATTTCTCTGCTTTTCTATTATACAAGTTGTTTACAGAAACTGCA 
MTTAAAAAATTACACTGGCATTTGCAGTCCTTAAAATAAATTAAAAGTTCTCMCTTTT 
I I I I I 1 I I GCTAAACATTTTTTTAAGTATG AGTCCTTTGTTTAAAAAGAAAN AG ATTANA 
ACAGAAAATATTTTCTATAMTAATACATGTATTTTGGTTTTAGTGCTCCCGCCC 
Sequence 1973 
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GCCGACCCACGCGTCCGACATCCCCAGGCACATCCGGAAGGAGGAAGGTTCCTTTCAGTC 

CTGCTCCTTCTGCANGNCCAAGAAATTCACTACCATGATGGTCACACTCAACTGCCCTGA 

ACTACAGCCACCTACCAAGAAGAAGAGAGTCACACGTGTGAAGCAGTGTCGTTGCATATC 

CATCGATTTGGATTAAGCCAAATCCAGGTGCACCCAGCATGTCCTAGGAATGCAGCCNCA 

GGAAGTCXCAGACCTAAAACAACCAGATTCTTACTTGGCTTAAACCTAGAGGCCAGAAGA 

ACCCCCAGCTGCCTCCTGGCAGGAGCCTGCTTGTGCGTAGTTCGTGTGCATGAGTGTGGA 

TGGGTGCCTGTGGG 

Sequence 1974 

CGCGTCCGCCGAGAGAAGAGGTACGGTCGAAGCCCGGAGCCAGGCCGAGCGGGAGCTGAC 

CAGGCTTGACTCGGGTACAGAACGAGGCACCAGTCCCCTTGCGAACCGAAGGGCCTCGCA 

GTGGATGGAGGAGGCCCAGCCCTGAGGTCAACGCCAACCAGGCTAGCCTGGCACGGGGCC 

TACAGGGTGGGTAGGCGGGCGTGCCGCAGCCGTCCAGGGCCTTCCCTCAGGTCCCGGGCC 

GAGGGGCCTACGCTGCGGCCCGGCAACAAGGCCCGACTCGGCCCCTCGGGACCAGAGCCC 

CACCCGATCGGAAGCGGATC 

Sequence 1975 

GGCCTCAGGGGGNAGNNATCCTGCAAAGACNNACATGAGCCCANAGGGGAAANAGAGNCA 

CCTGNGAGTACNNGCCTTTGGGNNTGACCTTGGCTCTCAGCACAAGATATTTACAGCCTN 

TGAGCTTGATATTCTAGAATTGNTACAGAGATAGCTCTGGAAAAGAAATAGACTAGAAGG 

ATAMGGGAAGGAATCATAGCTTATGMGGTT7TTACTCTGCATCAGACCGNTTTCTAGN 

NCTATGACTTAACGTCCTATAGGCTGTAAGGTTCTCTGCGTGAACACTTCTTTNCTGGCC 

TCCTTTCTGCCCCATTCCTNTTTAMCTCAGTTGCTGAGTTTATTATNCCCTGTGCATCC 

CTGGGCATTGNTCATTCACATATGGAAACMTTCANGGAGGACCCTTTGCCTANTTTNCT 

TTATCCTCTTGMTTNTGGATGGGAAAMTNTTAANTNCTTTTGGCCGCCTGCANTNGGG 

GNAANTATTGGGCCTT 

Sequence 1976 

CGCCCCGCGTCCGCGTTGMCAGCTTTAAGAACAATGTGAATGAAATCTTAGCAACTTGG 

TTAGTAATCTGAAMGTCTATTAATGTATACTTGAAATTCTGTTTGTATAAAAATGCATT 

TTCCCCTTTATTTTAACACTGTGTAAMGMCATTATGCATGTGAGTGGTTTGAGAATTA 

AATG G TTT AAT ACTC AAAA AAAAAA AAAAA AA AAA AAAAA AAAAAAAAAAAAAAAAAA AA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAANGGGGNGNCCNTTANNNTNNTTNAAAA 

AAAAACCCCCCNNNCCNCCCCCTGNACCCGNAANNAAAAAANNANNCCNTNGGGGGGGGA 

ANCCTNNTT NTTGNCC CTTAANAGGGGNCCCAAAAAANCNATTNCCNCCCNANTTTCNNN 

AAAAAANNC I Ml! rTCCCCNNNTTNNAAGNGGGGGGNNGGCCCANCCCCANAAGGGGTT 

TTTNAAAGGGGGGNAACCCCCNGGTCCCCGNCCCCAAAAAAAA 

Sequence 1977 

CGC CNCG CGTCCGGGCTGCCGTAACAATCGCCACAAACCTGGTGACTTAAATAACAAATA 

TTTATTTTCTCGTAGCTCTGGAAGCCGGAAGTCCAAAACCAATAACTTGGCAGGGCTGTG 

CTCCCTCTGGGAGATCTCGGGGAGCCTCCTTGCTACATGTTCCGCTTCTGGGGACTTCTG 

GTGGTGTTTCCCTGGAGGGCCAAGGCCTCCGGGCGGCCCGTGCATTCGGTGAACACCTCC 

AMGAGCTCTGAGTCACMAGCCTTGGGAMCTTTATTTTATTCTTTCTAGGACATTATC 

AGTAGTCCCGAGGAGACATCAATTACAAAACAAAAAAAAAAAAA AAAA AAA 

Sequence 1978 

CGTCCCGACAGTCAGCCGCATCTTCTTTTGCGTCGCCAGCCGAGCCACATCGCTGAGGAC 

ACCATGGGGGAAGGGTGAAGGGTCGGGAGTCAACGGGATTTGGGCGGTATTTGGGCGCCC 

TGGGTCACCAAGGGGCTGGCTTTTAAACTTCTGGGTAAAAGGTGGGATATTGGTTTGCCA 

ATCAMTGACCCCCTTCATTTGGACCCTCAAACTACCATGGGTTTACAATGGTTCCNAAT 

ATGAATTCCACCCAATGGGCAAAATTCCCATGGGCACCCGTCAAAGGGCTTGAGAAACGG 

GGAAAGCTTGGTCAATCAAATGGGAAAATTCCCNATCACCATCTTTCCAGGAGCCGAGNA 

TCCCTCCAAAATCAAAGTGGGGGCGATGCTGGGCGCCTTGAGGTACCGTCGTTGGGAAGT 

CCACTTGGCCGTTCTTTCACCCAACCATTGGGAAGAAAGGGCTGGGGGCCTCATTTTGCA 

GGGGGGGGGAGCCAA 
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Sequence 1979 

GAGTGTAGTTACTTCGGTCTTGCCTCACTGGGAGGCCACGTTGGTGACGATGCACACGAA 

GCCCCGGTACTTGTCCAGGTTAACCATGTGCCCGTCCGATGTCCTTGGCGGAAAACTCGT 

GCATGGAGCGCGCACAGCGCCAGTCGTCCNGGGACGCGCACGGGTCCTGGGACTTGGGAC 

ATTCGNGGATCTCCGGTCGCAAGCTGCGGGGACAAGGCTTCTTCCGCCGGCGCCGCGCCC 

TCCTGCGCCTGCACGCTGGGGCCTTCCGCCGCCGAGGGGCTCGCGGCCGCTAGACTA 

Sequence 1980 

CGCGTCCGATCGGAAGTGGCGCTCGTGCATTCAACTTGTTCCCGCTCATGGAACCCCTCT 

TTAAAAAGACGCAGGGCACCtGTGAGCGCAGGAGCGAGCCTAAGGCCACCCAGCGGCAGC 

GCCCGTGTCCTGGGCACTCAGCGTGCTGGGCAGAGCAGGTGCGATGGCCCCAGTCCTAGC 

AGCCCTCGCCCATGTCCTGTGCCCTTACATGGCTCCCGGACTGTGCAGGGAGCCGATACG 

TTTGCTGATAGCAATACTGGAACCACCCGGGTGCGATGGCAGTGAGGAGACTGCCCAGTG 

CCTTTGGGGCTGTGCTTGCAATAAAGAAAGAATTTCCTGGAAAGGCAGTCTGCAAAAGAG 

GGMCCCGG TGACT CAGAAAGACAGGATGTTTTGGTAATTTACCCCNAAATGTGCCATCC 

ACCATAGTGCTTTTTCCTCTTGCCCTTCGGCTTGTTTGMTCTCACAATTATGGTATTTA 

ATTCTCAAAGAAATATGTATCTGTTAGCCCGNTTGGTGACACTTATACAGATGATTAA 

Sequence 1981 

CNCGCGTCCGGTCGAAGCGAGGACGTGGTGGGTCCTCTGGTGCGAAATTCCGGATTTCCT 

TGGGTCTTCCGGTAGGAGCTGTAATCAATTGTGCTGACAACACAGGAGCCAAAAACCTGT 

ATATCATCTCCGTGAAGGGGATCAAGGGGACGGCTGAACAGACTTCCCGCTGCTGGTGTG 

GGGTGACATGGTGATGGCCACAGTCAAGAAAGGCAAACCAGAGCTCAGAAAAAAGGTACA 

TCCA GCAGTGGTCATTCGACAACGAAAGTCATACCGTAGAAAAGATGGCGTGTTTCTTTA 

TTTTGAAGATAATGCANGAAGTCATAGGTGAACAATAAAGGCCGAGATGAAAAGGTTCTG 

GCCATTACAGGACCCAGTAGCAAAGGGAGTGTGCAGACTTGTGGGCCCCCGGATTGCATC 

CMTGCTGGCAAGCATTGCATGGATTCTTCCAGTATATTTGTAAAAAAAAAAAAAAAAAN 

NAAA 

Sequence 1982 

GCGTCCGTGGTAACGATTGGCCCTAAGAAGCCCCTGCCTGACCCGTGAGCATTGTGGAAC 

CCAAAGATGAGATACTGCCCACCACCCCCATCTTCAGAACAGAAGGGTGGGGAAGCCAGA 

GCCCGNCTGCQATGCCCCAGCCAGTCCCCACAGCATAACAGGGTCTCCTTGGCAAGCTGT 

ATTCTGGAGTCTGGATGTTGCTCTCTAAAGACCTTTAATAAMTTTTGTACACTGGACTT 

TAAAGTATTGNTCACAAGGGTTATGCAATTCCNNGNCANG 

Sequence 1983 

CCCCGCGTCCGTGACAATCGAGTAGTACTCCCGATTGAAGCCCCCATTCGTATAATAATT 

ACATCACAAGGACGTCTTGGCACTTCATGAAGCCTGGTCCCCACATTAGGGNTTTAAAAA 

MCCAGGNATGGCMTTTCCCCGGGACCGGTCTAAAACCAAAACCCACTTTTCAACCGGC 

TTACCACGGACCCGGGGGGGTATACTTACGGGTCAAATGGCTTCTGGAAATCTGGTGGGA 

GGCAAACCACAAGTTTTCATGCCCATCGTCCTAAGMTTMTTCCCCTAAAAATCTTTGA 

AATAGGGCCCGTATTTACCCTATAGCACCCCCTCTACCCCCTCTAGGAGCCAAAANAAAA 

AAAANAANAAANNAAAGTGCGGCCCGCTAGACTTAGGTCTAGGAGGAAAAAAACCTTTCC 

ACACCTTTCCCCTGGAACCTGGAMCATAAAMTGAATGCAAATTGTTTGGTTGTTAACT 

TTGG NTTAT TGGCAGGCTTATAAATGGGTTTACCAAAATAAAAGGCCAATAGGCATCACC 

AAAAATTTTCAACAAAATAAAAGGCA I I I 1 1 I I II I I AACTGGCATTTCTTAGTTTGGNG 

GGGTTTTGGTCCCAA 

Sequence 1984 

CCGMAGAATATCTGTGTGCTTAGGGAGGAMCTTTTTGATCTGCAGAAAAGCCAGAAGA 

CATCTAGGACATCCATAAAAATTCATCAGAGAGCATTTTACTACTGAGCTGCAAAGGGAA 

AAACTTAAAATGGGATATGAAAAGTGAAGAAAGTGATCATAGGGAGAAAAACCATTTCAG 

ATGACAAGAGCACCTCAAAGGCAGCAGCCTCAAGGAGCAGCCATGGCCCCAGACTTGTCG 

CACGGATGCAGAAMCTTAATGGAGGAGGCTGAGGTCAGMTGGGAAGAGTTTTTAAAAA 

ATAAAAAGGGGAGCTAATATGTGAGGGACCAAAAAAANNNNCANAAAAAANAAAGTGCCG 
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GCCCGCTAGACTAGTCTAGAG 
Sequence 1985 

AAGCTTCCTGGTTCAMTGTGCCATTTCCCGGGTTGATGCTGCCACACTTTGTAGAGAGT 

TTAGCAACACAAGTGTGCTTAAGTCAGGCGGTAGGGAAATCCCTCACTAAAAGCAGGAAG 

AAAGGTTCCAATTCAAAAGGTGCCCAATGGATAGGAAGTCAAACAAGGAAAGGGTTAAAT 

GGTTNGGAAAACCACAAATCAGGGTGGTGGGATTTGGGTGCTTACTTTTGNAACAAAAAA 

GGGTCCCCCCTGGTGGGTCT TTTTG GTTCAAACAATTGGTACAAATTGGTAAGAAACTTC 

TTGTCCAMCCACTTAAATTTATTTTTGGTCTTTGGAGGTTTTTACTTACAMG 

NGACTATTGGGAATCCCCGGCATGCCCTGGAATTCACTAAAAGCCAAAGGGGGTCTTGTA 

AGGCCAACGCTGCTTCTTCTGGAGGACTTTCCATTTCCTTTTCTGGATTGGGCAACACCG 

TGCAGG CTCATGGACAAATCTGGTAGGGATAACAAATTCAGTGGTGGGANTTTCCACTTC 

i i 1 1 1 1 CMGTCCTTTCATGTTAAAAGGAATTTTAAGAACACCCACATACAA 

Sequence 1986 

GNGTCGACCACGCGTCCGGAGCAGCAGCCATGGCCCTACGCTACCCTATGGCCGTGGGCC 

TCAACAAGGGCCACAAAGTGACCAAGAACGTGAGCAAGCCCAGGCACAGCCGACGCCGCG 

GGCGTCTGACCAAACACACCAAGTTCGTGCGGGACATGATTCGGGAGGTGTGTGGCTTTG 

CCCCGTACGAGCGGCGCGCCATGGAGTTACTGAAGGTCTCCAAGGACAAACGGGCCCTCA 

AATTTATCAAGAAAAGGGTGGGGACGCACATCCGCGCCAAGAGGAAGCGGGAGGAGCTGA 

GCAACGTACTGGCCGCCATGAGGAAAGCTGCTGCCAAGAAAGACTGAGCCCCTCCCCTGC 

CCTCTCCCTGAAATAAAGAACAGCTTGNCCGGATAAAAAATNAAAAAAAAAAANAAN 

Sequence 1987 

GCCACGCGTCCGCAGGGAACGTGATTAGTGAAAGGAAGATAAACGTGGATGTTACTCCAA 

MCTTCGTTTMTGAATGCTTAAAGMTTCAAATTTTATCTGCCTCTCTTGTMTTTGGA 

TCTCTTCTTAATGTACATAGTGCTMCATGAAGACCTTTTTCTGCACTATATGCAAACAG 

GGTAACTMCTAAAACAAAGCCACTTTCAATCTTCAATCCTTGAAGGTATATCTAGGTTT 

ATGACAGTAATTGTGTTTACATTTTATGGTGCCTAGTATTGACAAAATGTTATTTCCCTA 

CATTAMCATGACTCCATAGACCTTTTCATTTGTGGGGTTTTTATTTCCTATGATGTATA 

CTGCCACTAACCTTNCAAAAATTACTTAGTATTGCAAAGTCAGGGAATCATCAGGGAACG 

TTTAGCTTGGCCAAAATACTTGGTCTGGTTTTAAAAACCCTGTCNAGGTCTACCAAACCT 

GTTCAAGGTCTACCCAATTTTAAGGGGCAAATTTGGGGGNAAAAAGGAAAAAAT 

Sequence 1988 

GGTGTCGACCNCGCGTCCGCGAGTCCCGCGTTCTCTCCTTGAATCCACTCGCCAGCCCGC 

CGCCCTCTGCCGCCGCACCCTGCACACCCGCCCCTCTCCTGTGCCAGGCAAGGTGACCCC 

ATGGCAAGGCGCAAGCCAGAAGGGTCCAGCTTCAACATGACCCACCTGTCCATGGCTATG 

GCCTTTTCCTTTCCCCCAGTTGCCAGTGGGCAACTCCACCCTCAGCTGGGCAACACCCAG 

CACCAGACAGAGTTAGGAAAGGAACTTGCTACTACCAGCACCATGCCCTACCAATATCCA 

GCACTGACCCCGGAGCAGAAGAAGGGAAGCTGTCTGACATCGCTCACCGCATCGTGGCAC 

CTGGCAAGGGGCATCCTGGCTGCAGATGAGTCCACTGGGAGCATTGCCAAGCGGCTGCAG 

TCCATTGGCACCGAGAACACCCGAGGAGAACCGGCGCTTCTACCGCCAGCTGCTGCTGAC 

AGCTGACNACCCGGGTGAACCC 

Sequence 1989 

CGTCCGACAACATTTGGCATNAGGGTTGTATCTGTTGGTGGAGGACACAACGCCAAAAGG 

AMTGGGATTTCCTGGTTAGGCCTGCGGCTTGGCAGATGATTGTTATGGGAAAGACACTG 

AGTCTGTTTAGGCAATTTCTTCCTCTTTACTMTAMGTGTTCCTATTTTGAAAGCAATG 

CTGAGTTGTGGACATGTGTATAAACCGTAATGCTGTAAGTTAGGCCTCTCTTGTCTAGAA 

TCTC AGCCT CTTCATACTCTCTCCCCCT7TTTTGCCATATTATTTTCCTTATCACTAACT 

ATATATTTTACCTGTCTCACTTACTATCTGTCTCTCCTCTCAGAGTGTTACTCCAGGAGG 

GCAGAGATAACTGTTTAGTCCAGGCTGTGCCTTTCGGTGCTCAAAATAATGCCTGGGTGC 

AAAAT AAATATATCTTGAATGAAATAATGAAGTAATCTTTAAAATGGTGCTCCGAGCATA 

ATTTTCTATAGTAACACATAGTTTGCAATTAGTTGTCTCTTTCAGGGATAATGGAAATTG 

GTCAACATAAGAAAA 



# 
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Sequence 1990 

NCCCCGCGTCCGMTTTGTTTTATGGATTGMTNATGTCTCCTCTAAAAAGATTTA7TGA 

TGTTCTMCCCTCCATGTCTCGTATTGTAACCTTATTTGGAATAGGGTTGTTGCAGAAGT 

AATTAGTTACATTAAGATGAAGTCATGCTGGAGTAGGGTGAGCCCCTAATCTAAGAGGAC 

TGGTGTCCTTATAGGAAGAGGAAAATGCCACATGAAGAGAGAGAGAGATGCCCAAGGAGG 

TGTCATGCTGCAATGAGGGCAGAGATTGGAACTGTGCAGCTGCATGCCAAGGAACACCAA 

AGATTTCAGCAAACTACTGGAAGCTAGGAAAAAGCAAAGAATTCCCCTAAATATTTCAGT 

GGGAGCATAGCCCTGCCAACACCTTATTTCAGATTTCTATCCTCCACAACTATGAGGCAA 

TMGCTCTGCTGTTTTMGCTCCTCAGTGTGMCTATTTTTGTCATGGCAGCCTTAGGAA 

ACTMCACMTTTACAAACAGATTAAGTTCACTAATCAAAAGACAGCCCTTGCAATTGGG 

Gl I I I I I I I I 

Sequence 1991 

CCGCGTCCGGCGGGTGAGCCGCTGGCGCGCCGGGCCGGGCGGGGGATTGGCTGAGGGCGA 

CGCGAGAGAGGGGAGACCCGGACTGAGGAGAGGACGGGGTGGAGGGTCCCGGCCGGAGGC 

TAGCCTGAGGAGACCGGGGGCGGAGGGGAGACCCGGGCCGCGGAGGAAAGGGATGGAGGA 

GAGGAAGCCGCCGGGCGCCAGCGGGACCCCCGGGCTGAGGGGAGAGGCGCCCCAGGCCGG 

GTGAAAAGTGGCCGAGGAGACCTGGGCTGGGCTGGCAAGTCCCGGACCGGGGAGGAGGGG 

AGCAGCCCTCCGATGTGAGGGATCGCAGAGGAATGAGCTTCGTTCTGGATTAAAAAAAAA 

AAAAAAAAAAA 

Sequence 1992 

GTCACCACGCGTCCGCAGCGCACGCCCCNTGCCCTGAGAACAGGAAAGGGCCCGGAAGGG 

CTGACTCACCGGGCCGACNCTCACACGAAAATGGGATGCACTTTATTTGCTGGTGCAAAG 

GCAGGTGAGGGTGCCTNCTGNGTGACCGNTGGCCCTNCTGCCTGGNGGCGCTGAAGGGAA 

GGAGCCAGTGAGCCTGACCCCGGGAGGGGCCGTCCCCGTGTGCCGCGTTNGGCGGGGCCC 

CACGCGGCTCCCCCANGCCCGGGTCTGGGGCCCCAGGCTTTCCCCTGNCTNGNGGNCGNN 

CNTNCCNNGCTTTGGGNTNCTTGNNTNGGNNTTTTTAATGCCAGNNNTCANNACATAANT 

GCNTTNTGAAAGAGGTTCCAGCTATCACTTGTAACCATATATATATACATATATATTCTA 

TCTACAMGTGTTTATTTNCAAAGATNTTTCAACGGTGAATTCAGTCCCCGGCCGCCCTT 

NTGACCATCTGTNCCCNGTCCTTGTCCCCGCCCCGGGG 

Sequence 1993 ' 

TCNCCNCGCGTCCTTGATATTTTGAGAAAAATCATGTGAGTCA l I I f I I CTGTTTCTCTT 

TTCTCTTAACGATTATCACTGTAATTCTGAATCTGAAAGGTAAAACAATTAGTCAAAATA 

TTATTGCCATCATTCTACCTGTGTTATGAMCTACTTATTCATAGTTAATTCTCATTAAC 

ACTTACATTTCCATAMGAAMCTCAAGTATTAATAAAAGAGACTTTACTGGCTTAAGAG 

GGCTGTGAMGATTTTTGATAGTGAATCATGACCCTAAGGGAGAGATTTGTGTGATAAAA 

GTATTGTATATMTAGATCAGCGATTTTTGTAAGGCAAACAGAATTTGTAAGTTGGCAGA 

TCTTCCTAAGTTGCAAAATGTAATGATGAGCTTGGTGGGAGAAGAATGAGTCGTTCTTGG 

MTACCTATGTGCAGCCACTACCCATCTCMTGTCACCTTGTTTGCATTCTTGGATAGCT 

TGTATATGTAGTAGTTTGATGAATMTTTAAAGAAAAACACCTAAAATTTGAAAAATGAT 

TGTAGGGATCAAAAAAGGCAGATGAAATTAC 

Sequence 1994 

CACGCGTCCGCTGACCTGACCCNTTCCTGATCCNAGGCCNGGTGGTTGTCTTATTGCACC 

ATACTCCTTGCTTCCTGATGCTGGGCAATGAGGCAGATAGCACTGGGTGTGAGAATGATC 

AAGGATCTGGACCCCAAAGAATAGACTGGATGGAAAGACAAACTGCACAGGCAGATGTTT 

GCCTCATAATAGTCGTAAGTGGAGTCCTGGAATTTGGACAAGTGCTGTTGGGATATAGTC 

MCTTATTCTTTGAGTAATGTGACTAAAGGAAAAAACTTTGATTAAAAAAAAAAAAAAAA 

AAAAAAAGTGCGGCGGCCGGGCGCGCCGGGGTGGGCGGCGAGCGGAGCCGGCCGGAGCGG 

GCCGGGCA 

Sequence 1995 

TCCGACNAAGGAACAAAAGCGAAACACACAAACCAGCCTCAACTTACACTTGGTTACTCA 
AAAGAACAAGAGTCAATGGTACTTGTCCTAGCGTTTTGGAAGAGGAAAACAGGAACCCAC 
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CAMCCAACCMTCMCCAAACAMGAAAAAATTCCACAATGAAAGMTGTATTTTGTCT 

TTTTGCATTTTGGTGTATAAGCCATCMTATTCAGCAAAATGATTCCTTTCTTTAAAAAA 

AAAAAATGTGGAGGAMGTAGAMTTTACCMGGTTGTTGGCCCAGGGCGTTAAATTCAC 

AGATTTTTTTMCGAGAAAAACACACAGAAGMGCTACCTCAGGTGTTTTTACCTCAGCA 

CCTTGC TCTTGTGTTTCCCTTAGAGATTTTGTAAAGCTGATAGTTGGAGCA I I I I I M AT 

MINI AATAAAAATGAGTTGGAAAAAAAATAAGATATCAACTGCCAGCCTGGAGGAAGG 

TGACAGTCCAAGTGTGCAACAGCTGTTCTGAATTGTCTTCCGCTAGCCAAGAACCTATAT 

GGCCTTCTTTTGGACAAACCTTGAAMTGTTTATTTAAA 

Sequence 1996 

ACGCGTCCGMGAGGTATATCCCATACCTTTTCCCCAAATTCCCTTGTCATTAATTTTCC 

AATTATGCTTCTTCCAAGTGCCTGACCATCCAGCCAAACCATTGGCTACAGCCCATGAAT 

CAGTATACAATGGCGCTACTGGTCATTTCTCTTTCCATACAAAGTGCACAACCTGGTATA 

CTCTTCAAAGTTCTGCCCACTGGGAAGATTTGCCTTCACCACTGTCCTTCAGGGATAGGG 

GCTATAGAGCTGCAGCTGTCCACTTTCAGGTGGTGCCTGCATATCGTGCATAACCATCTG 

TGAACCAAGCCCTAGTCTTGTCTTCCTCTGTCAACTGATCATAGGGAACTCCXJCATGAGG 

CCATCAGTGCAGGCTAAGGGAAAGAAGGCAAGGTGGCAGGAGTGGAGACCATGGACATTT 

GAGCTACTTTCTTAGGTAACTTACTTGTGTCTTCAGGACCTGCTGAAGCCAGATCACATA 

TATACCACTTTAATT 

Sequence 1997 

CGCCNCGCGTCCGCTACACTTAAGGATGAAGAGAGGAAAAATCTACAGTGTAGGCACAGA 

AATGCCTAGAAATGAGAGAAAAAAAAAACAGGAGCGCATACTGCCTCAGAAACACCAAGA 

GTAGTTTGGGAAGAAAGGAGTCAACAATAACAACTATCAAAGGAATGAATGGCCAACAGT 

GTAGCCTCAAGTAGGGTAGTGTGAGTAGTTGATAGACCAAAAATCCTCAGCAAAATACTA 

GCGAACAGATTCAACAGCATACTGAATGGATTATACACCATGGCCAAGTGGGATTTATCC 

CAGGAATACAAAGGTAGTTCAGCATTAGAAAATCAATCAATGCCAACACACCTCATTAAC 

AGAAAAAGTAAAGAACTGCATACATGATCCCACAATTGATGCTGAAAAGGCATGACAAAA 

TCCAACATGCTTTCATGATTTAAAAAAAAAAAAAAAAAAAAAN 

Sequence 1998 

CCACGCGTCCGCACACTTGACCCCAGAGATCACGCCACTGTCAGCTGCCCTGGCTCAAAC 

AATTGCCCAGGGAATGGCACCTCCACCTGTCTCCATGGCTCCTGTGGCTGTATCTGTGGC 

TCCTGTGGCCCCTGTGGCTGTATCGATGGCCCAACCCTTGGCAGGAATCACAATGAGCCA 

CACCACCACTCCCATGGTGACTTACCCTATCGCTTCCCAGAGCATGCGCATCACGGCCAT 

GCCACACTGATGGGGCTAATGGACACTCCCCTGGTATAGCCTCGCAGGGCTGGGGTCAGG 

GGGCCCTTGCCCACTCACCTAGCCTTCCCCATCCCTGTCTGAAGGGCTCCCTTGAGAACT 

AGGACAAGAGACTACAAGGAGTATGTCCTGAGGAGGGGTTGGGATGGTGTGGTTTTCTCT 

CACCTCCCTTTTATGAGGGTCCTCTTGTCCATCTTCAAGCCTCACAGTGGGGGGCTT 

Sequence 1999 

NNGGCAGGAGAGGTTCAAATGCATTGCATCAACCTACTATAGAGGAGCTCAAGGTAATGG 

GGCTGGGTGAAGTGGGGTAGGTGGGTCTCAGAGTGCACATGGCTTCTCATATGGAGCTGG 

AAGGATTGGGGAAATGAGCAGTAGTGTCTTCCCTGTCAACCTGGGGCTGTTTNTGCCACT 

CTTCCAGCCATCATCATTGTCTTCAACCTGAATGATGTGGCATCTCTGGAACATACCAAG 

TAAGTGAGCATCCTGCAATATAATGGGAGGCTCCG 

Sequence 2000 

ACGCCTCCAGCCTGGGCAATAGMTGAATGAGACTCCATCTCAAAAATAAATAAATAAAT 

AAAA TACT GAGMAMGAATCTTTATTGTTTCCTGTAAMTAMTTTTCCTTTTAGCAAA 

GCTCTTTTCCCTTTGACTCTCGCCGCCTAGATTTCGGTACCAGGACCACACATTTTTAAA 

GATGCTCCTCACCGCCGTACAGCTCCTGTACAGCCCAGAAAGCTCCGTGCGCACGAAGCT 

CATCCAGCTCCCGGTGGTCTACGTGATGCTCATGCAGCACTCGCTGTTTCTGCCGACTCT 

ACTGACGTCTGACGGAGAGGAGAGCCCGGACAGCCAAGTAAAAGGTGACCCCTCACCCCA 

GCCCTTCCATTCCGTATCCAGATTTTATCTCCTGATTTCCTTA 

Sequence 2001 
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CCCGCGTCCGCCGAGATGGCGCCCCTGCACTCCAGCCTGGGTGACAGAGTGAGACCTTGC 

GTTTAAAAAAAAAAAAAAAGGAAAAGGAAAAAAAAGTTCTACTGTGGGTAAAATGCTATC 

ATGCTATCAAACAGCATCACATGCTACAGAGAAGTCTTGTGAAAGGATGTCAGTGTGGCA 

MCTTCATTGTCTTTTAAGAAATTGCCACAGCTATCGTAATCTTCAACCACCACCACCCT 

GATG AGTCAGT AGCCATCGTTAACGTTGGCGTAAGACCCTCCACCAGCAAAAAGATTGTT 

AGCATTTTTTTAGCAGTGATGTATTTATCMTTATACATAATGCTATTGTACACTTCTTA 

ATAGACTATGGTATAGTGTAACCATAATTTATGCACTGGGAAACCAACAAATTCATGAGA 

TTCGCTTTATTGTANTRGCTCTGGAATTGAGCCTGCCCGTGTTTCTTGAGGTATACCTGT 

GTATTAGGTATGTTCCTTGTATATAMACCCCATTTTTTAGGGTAGGAAACTAANGCTTA 

GAGTTTGAGCCAA I I 1 1 ICCCATA 

Sequence 2002 

CGTCCGCACTGTTCAMCCCTGTCATGCTTTAAAACTGATGCGAGATGATTTTG 1 M 1 1 I 

GCATAATCAATACTTAAGGGTGCAATCAACTGTTAGTAATTGTGCAGTANAGTAAAGCCC 

TGTGGTGTATCAACTACTAGTTAAGAGTCTCAGTTGATTTCTGTAATGTTTGACCTAATA 

ATAGCCCGTTTCGTCTCTGACCCAACAGAGGAAGCACAGATCAAATCACCTTGGAGTGGT 

CACCAGGGGGACAGGGAGCCCCCCACCAATGTATCAATGGGTGATTTATGATGCCTTCTG 

CCCTTTG GCGAG TGMTGGGTTTCCCATAGGGGAAGTTNGGCCTCCCTCCGTGAGCTTTG 

GAAMTGTTTTTCTMTAGACACAGGGGAGGCCAGTTTTTGTTTTNANAGCMTTATT^ 

TTC CCAAA TTCNTCTGTTTTGGGNGGTNGGAACCTGNGGGGCCCCCGGGGTTTCTGGTTT 

TCCTTTTTN CNGCN GGAAAATTCTCCTGGCTAAAANANTCCC I I 1 1 I i I I I I NGGTTTNC 

CANMG CCCTTTTTATAAATGCANNATANTGAATNGCTTGGGGAACNNAAAATAAANTTT 

TTTTT TCCCANTCAAAAAAANAAAAAAAAAAANAAAAAAAAAANGGCGGGGCCCCTTANA 

TTTTTTTAAAAAAAAAAAACCCCC 

Sequence 2003 

TTNCCATCCCTCAGGTGCTGAGAACCAAAGTGCCTAGAGAAGGGCAGGAGGAGGATGACG 

ACGATGAGGAAGACGATGCTGACGAGGAGGCTCCCAAGCCCGACCATTTTGTTCAGGACC 

CTGCAGTGCTGAGAGAGAAGGCAGAAGCCAGGCGCATGGCCTTTCTCGCCAAGAAAGGGT 

ACCGGCATGACAGCTCAACAGCAGTGGCCGGCAGCCCCCGAGGCCATGGGCAGAGCCGCG 

AGACAACCCAGGAACGCAGGAAGAAGGAAGCCAACAAGGCGACAAGAGCCAACCACAACC 

GGAGAACCATGGCCGACCGCAAGAGGAGCAAAGGCATGATCCCATCCTGAGACCTGGTGC 

AGGGCCAGTGGGGAGGCAAGC 

Sequence 2004 

TNCACATTTTCCTCCTAAGCACATGGGTCATTCTCAAGGAGAGACCATATGTTAGGTCAC 

AAACATTCAAAAAATTGAAATATTATCAAGCAACTACTCTGACCACAATGATGTAAAACT 

AGAMTCMMCCMGAGGMTTTTAGAAACTATAGAAGCACATGGAAATTAAACAATCT 

GTTTCTTAATGACCAGTGAGCCMTGAAAAAATCMGMGAMTTTGCAAATTGTTTGAA 

AGAAGTTATAATGGAAACACACTATACTAAAACCTATAAAACAGCAAAAGCAGGACTAAG 

AGGAAMTGTATAGCTATTAMGTGCCTACATTTTTAAAAAATGAAAAMCTTTAGATAA 

ATCACTTATCTATGAATTTGAAAAAAAAAAAAAA 

Sequence 2005 

CGGGMGCTTTNTTTTAATTMGCTGAMCCGAAGCTT^ 

ATGTGCCCATATTGGACAGTATAGCTCCAGGGTTTATTAAACCACCATTCCTGGTGATAG 

GATAGAGCCCAGCACAAACTAATCTGGGCAACAAATCAAAGGGCACAAGTCGCATGTAGG 

TTCCAAACTCAGTACATTTTAGAGAAAAGGAGTTTGATTATGTAGTAGAAGGAAGAACTG 

CCTGGTGGGACTCATGATCTTCCTTTAAGAGCAAGGCTCAAAGACCTGGGGAGTTTTGAT 

TTGATGCTATGATGTCTCCTGGGGCTCAGAATATTATGAAATGAGGAGTGAATCTCTGAG 

TGAAAAGAACTCAAGCTGCTTGTTGCATTGCGGAATGTCTCTAAGAGGTAGAGAGGCGTC 

TGTTAAGTGGCTTTGTATGAAGGTTTCAGAAGGTAAGATGAGCC 

Sequence 2006 

NCATTTAACTGTAATAGATATAGAACGTATCCTTCAATAACCMCMCAAATGCTTGTTT 
CCAACAGGCTCTGATGGTTGTATATANGGAAACCTTGATACCAATACGTCCAGCTGACCC 
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AGAAAAAAATCAGAGCTTAGTTAAATCACTGCTGCTCAAGGCTGTTGTATCTGGNNACGC 
TCGAAATGGAGTTGCACTCACTGCCCTGGATCAGGATCACGTCGCAGTCCTAGGAAGTCC 
ACTAGCAGCTTCT AAGG GTAACTGACATCCAATTCATTAAGMGGCCTTTGTATGTTTCC . 
TTCACGGTGTACCTTTTAGTCTTCC^^ 
Sequence 2007 

GTCCGGCCCTGCACTAGGGACTGGGGAGGACATGGAGAATAATATATGTACCAGCACAGA 

CAGACAGGCAGCTTCTGGAGCCAGACTGTCTGTGTGATCTTGGGTAAGTAATCTAGCCTA 

ACCATAGAGATGATAACAATTGTACCTGCCACAGGCTCATGGGGCTATTGTGATGGTGAA 

ATGTAAAGATGTAAAGTATAGAGAAAGGATGTAAAACTGCCCGACATATAGTAAGGGCCA 

TTTAAMGATGCCTGTTATTGTCCCTCTTCTTGAGAGCATATTAAGGGCTCAAATCTAGC 

CAGAGTCTGGCGGGTACAGAGGTAAACAAATAATGGCAGGTTGGTTAAGTACTGGACACC 

AGTGATGTCCAAGGTCTCTAGGATGCCCTGGAGAAATAATGAACTCTTCTAAGATTGTTT 

GGMGGGGATTTATAMGGACATCTAGGCTGGGTTTTGAGTGAGGTATGGGGAGTTCGCT 

GGTCAAGAATGTANGMTTACAAAAGCANAAAGTGCCTTATTTNAAAAAAAAAAAAAA 

Sequence 2008 

NCCCCGCGTCCGGAAMTATTTTAGAMGCACCTTGMGATTAGTATTTTTATGTAACTT 

CTGTTGGAGAGATGTCTTCAGGAGACTGAAGTAGAAGAGCGACTGTCAAAATGGAAAGTC 

CCAGAGACATCCAATTTATGTAAATCAACATCACCTGAATTCAGAATCTCATCCAGATTT 

CAACAAAGACTTCTGAATGCCAACAAAGAAGAGGACTGAATTTACAGACTCTCACTCTAA 

CAATATATGCTGTTCMTTTGAAAACAGMTAAAATTATTTTGGCAGAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Sequence 2009 

AGAGTGTNCAGCTGGCTTTCCTCACTTGGGAAAAGGGGTACTGCCAGTCTAAGCAGCCTC 

CTCTGTACTCAGCCAGGACACCCAGCGCGTGGGACCTGTTTGTGTCTGTTTTGCTTCCTT 

GGGMCGGCACAGTCACTCACCCTGCCATTTGCGGAAATGACCTGGTGCACTTTGACTGT 

TAAGCAATGCGT TATTG CTGTAGTCAAGGTTAGTGCAAGCAAGGAAACATTCCCAGTAAG 

GTATTTGTTTCCATTTTCTGTCTGTGCTTCTGTCAGAMCTTGCTAGGACTTTAGTGGCC 

AATAAAAAAGAAATTCTAGCTTGATCGACCCACGCGTCCGGTAATGATTGATGAGGACAT 

GACTGCCCGAATTAGCATGGCTGATGTCAAGTTCTCTTTCCAATGTCCTGGTCGCATGTA 

TGCACCTGCCTGGGTAGCCCCCGAAGCTCTGCAGAAGAAGCC 

Sequence 2010 

GCGTCCGACAAACATAATGATCCGTGACAGTGACCTTAATATATAAATGTGTGTGTTGTG 

ATTGTTCTACTGACCAGCCATTTCCTGTCTCCCTCTTCTTATGCCTCCCTGTTAAATTAG 

ACACAACAACATTGAAATCAGACCAATTAATAACCCTACAGTGGTCTCTGAGAGAGGCTA 

ACAGCTAATCCTCTTGTGCTAAACAGCCAAATTGTGAATGCAAAGGAAAAGTTCTCAAAG 

GAAATTAAAAGTGCTACTCCAGTGAACACATGAATGATAAGAAAGTGAAACAGTCCGGGC 

GTGGTGGGTCACGCCTGTAGTCCCAGCACTTTGGGAGGCTGAGGTGGGTGGATCACCTGA 

GGTCAGGAGCTCAAGACCAGCCTGGCCAACATAGTGAAACCCTGTCTCTACTAAAAATAC 

AAAAATTAGCTGAGTGTGGTGGTGCACACTTGTAATCCCAGACACTCGGGAAGCTAAGGC 

CGAGAGAATCACTTGAACCCCAGAGGTGGACATTGCAAGTGAGCTGAGATTGTGCCACTT 

GCACTTCAACCTGGGTAACAGAGAGAGACTCTGTCTCAAAAAAAAAAAAAAAA 

Sequence 201 1 

GTCTATTAACATACGGAGCTGAATCTGCCGCAGCTNGAAATGCTCAAGAACCAGCTGGAC 

CAGGAAGTGGAGTTCTTGTCCACGTCCATTGCTCAGCTCAAAGTGGTACAGACCAAGTAT 

GTGGAAGCCAAGGACTGTCTGAACGTGCTGAACAAGAGCAACGAGGGTATGGGGTAGGCG 

GGTGAGGGTMCCTAMGTGGCGMCCTGCTTCTCTCGTCCCACCTCCTAACCCAGTTTT 

TCTTACCTGAAACGAGAAAATCCATTACATATCGTATACCGCTTCATGAACCCTTTGCAT 

GTTG CCTG CCTAGAATTGAAAAGTACAGGACATTCCTCTGCTCCTATTGCCCCTGTTTCC 

GTTCTTTTCACACTGTCTGTGGGTGCTGTGCCCTGTTGGAACTCTCTTTAACGTCTTACC 

GTTGGAGCCGCTTACCTTCCCAGGTGTTGTCTTCATTGGCTTTCACAAGGGAAAA 

Sequence 2012 
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NGGCAGCTGCTTTGTCTGGGAGGGGGCTTTGTGTCGAGTCTCCCTGAATGAGCAGGGCTG 

GCGACAGTTGTCAAAACACATGGTGCTTGGTCAGAGCCCCCGTAGAAGCCCCTTGTCCTC 

CGCATGGCCTCCGCCTGCACCCGGGGCGTGGAATGTGCTCTTGTGTGTCCCTGGCTGTCT 

GCTTGCTTCTACACT GGCCC CTGCAGATGGAGGGGGTGGGGTACAGGGGTTCCTATAAGA 

AGCAGACACTTGGGGTTTTTCCCAGGCCCTGTTGCAGGAGGGTGCGTGGGCTGGTTTCCC 

TGAAGGCGCCTGGGCCGTGTTGGTGTTAACTGATCTGAGATCTTCTGTGGCCCTGATGTC 

TATGAGCATGCCCCAACTTGCANGGGGCTGAGTAGCCCGGGCACCACCAGGAGGCTTGCG 

TGCCCTGTGCTTGGGTGTACCCATGCCCTGTCAGCATCGTTGGTCTGTTAGGGGTCAGGG 

ACTTCGGCTTCTTGTTTAATACCCTNC 

Sequence 2013 

AAGTCGGACGCGTCTCAGTTCCTTAAAGCTGTTGGTCCAAGGCTACCCTCAGTTCCTTGC 

CACATGGGCTTCTCCMCAAGGCCATTTACATTAACAAAGCCAATGAGAAAGAGAGTTAC 

CGGGCATGGTCGCTCACACCTCTTAATCCCAGCACTTTGGGAGCCCGAGGCGGGTGGATC 

ACTTGAGGTTAGGAGTTCGAGACCAGCCTGGCCAATATGCTGAAACCCCGTATCTACTAA 

AAATACAAAAAAATTAGCCTGGCATGGTGGTGGGCACCTGTAATCCCAGCTACTCAGGAA 

GCTGAAGCAGGAGAATCACTTGAACTTGGGAGTCGTAGGTTGCAGTGAGCCAAGATCGAG 

CCATTCCACTCCAGCCTGTGCAACAAGAGCGAGATTCCATCTCANACAAAAAAAAGATNA 

ANNAAAAAAAAAAAANT 

Sequence 2014 

TCCTATTCCTCTCTTGAGTTTATGCACATCTCTATAMTCATTAGTTTTCTATTTTATT^ 

CATAAMTTCTTTTA GAAAA TGCAAATAGTGAACTTTGTGMTGGATTTTTCCATACTCA 

TCTACMTTCCTCCATTTTAAATGACTACTTTTATTTTrTAATTTAA 

GTATCATGAGTAGGTCTTACATCAGTGATGGGTTCTTTTTGTAGTGAGACATACAAATCT 

GATG T TAATG TTTGCTCTTAGAAGTCATACTCCATGGTCTTCAAAGACCAAAAAATGAGG 

TTTTGCTTTTGTMTCAG6AAAAAAAAMTTMTGAACCTTAAAAAAAAAANANNAAANA 

ACANGTGCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnGCA 1 1 1 1 1 I II NACTGCANTC 

taagttggggggttggnccaaactcatcaannnatcntaatcatggctgggatccccngg 
gtcccgngcctcggamttamttnccttttttccggttttccttggggtt™ 
tgnggngaaan"! i i i i i i i tgggggggggggggg 

Sequence 201 5 

CGCGTCCTGAGAGGTCAGGCCGGTCCTGGGGGCAGCAAGCCCGGCCACACTCCCCACCGC 

GACCGGGGCTCTGGGCTCGCTTCCTGCTTCAGTTTCCCCAAGCTCCNGATGAGACTCCGC 

TACTACCACCACGTCGATAACGCAAACCTAGAGGGACTCAGGGTAAACTGAGGCACTCAA 

ACTGCCGAGGAGCTCCGCCTCCCGAGAGACATTTAATCCGGGGGGATTTGCAGGAAACTT 

CTAAATTAAGGGTAGCGGCTGCTGCAGCTGAGGGGGGGCACACCGGTCCCTGCGCCCGGG 

CAGCTGCCGTGAGCTCACGCCCCGAAATAGCCCCAGGGGCCCCAGCCGCAGCTGCCACTG 

GGTCCGGCTG TCACT CAGAGGAAGCACGGAGCCCCCAGCCCAAGGGTCCCCTCCCCTTCG 

CATCGCGGGGTTTTTCCAGCCGACCGTCGGCCACTTTTTCCTCCGACNGCTGGCAGGGAA 

GAGGGGGATTGGGGGCCGGGACCCCAAGGGAGGCGGTCCCCAATGGGTGGGCCAAGGG 
Sequence 2016 

ACGCGTCCGCACACCCCCAGGTGCCCGCGCTGGCCCCCCAGGCGTGGTGGCCTGCACGGA 

GGGGACCACTTACGTCTGCTCCGTCTGCCCAGCAAAGTTTGACCAAATCGAGCAGTTCAA 

CGACCACATGAGGATGCATGTGTCTGACGGATMGTAGTATCTTTCTCTCTTTCTTATGA 

ACAA AACAA AACAACAACAAAAAACAAACAAACAAAAAAGCTATGGCACTAGAATTTAAG 

AAATGTTTTGGTTTCATTTrrACTTTCTGTTTTTGTTTTTG 

ACA TGM GMCTGTTTTTTGCCTGCTGGTACATTACATTTCCGGAGGCTTGGGTGAATAA 
TAGTTTTCCCAGTCTCCCTCGGATGGTGGCCTTAAGGCCTGGTAGTGCTTCAAGAGGTCC 
ACTGGTTGGATCTCTAGCTACTGGCCTCTAMTACAACCCTTCTTTTACAAAAAAATCTT 
TTAAAAAAAAAAAAAAAAAA 
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Sequence 2017 

CGCTCCGCAGCATCMGAGTTATTTTTCTGACTGTCAAGTTTCAACATTCAGGTCTGTCC 

CCAACAGGCACCACACCGGGGTGGACTCCCTGTGTAACTTCTCGCCACTGGCTCGGAGAG 

TAGACAGAGTTGCCATCTATGAGGAATTTCTGCGGATGACCX^GGAATGGTACCCAGCTGC 

AGAACTTCACCCTGGACAGGAGCAGTGTCCTTGTGGATGGGTATTCTCCCAACAGAAATG 

AGCCCTTAACTGGGAATTCTGACCTTCCCTTCTGGGCTGTCATCCTCATCGGCTTGGCAG 

GACTCCTGGGACTCATCACATGCCTGATCTGCGGTGTCCTGGTGACCACCCCGCCCGGCG 

GAAGAAGGAAGGAGAATACAACCGTCCAGCAACAGTGCCCAGGCTACTACCAAGTCACAC 

CTAGACCTGGAGGATCTGCAATGACTGGAACTTGCCCGTGCCTGGGGTGCCNTTTCCCCA 

ACCAGGGTNCNAMNMCTTTGGCTGGGGCAAGAAATNAAANCATATTTGGTCCGGAAAA 

AAAAAAAAAAAAAAAAGGGCCGGCCCTTNAACTAAGTCTAAAANAAAAAAACTTTCAAAA 

CCTTCCCCTGGAACCTGAAACATAAAATGNATNGCAATGGTGGGNGGTNAACCTGGGTTA 

ATTTGGCANCCTTTTTANTTAAAANTTGGGGGGGGGGTT 

Sequence 2018 

CGGATCAAGACCATCCTGGCTACAGTGAAACCCCGTTTCTACTAGAACTACAAAAAATTA 

GCCGGGCGTGGTGGCAGGCACCTATNGTCCCAGCTACTCGGGAAGCTGAGGCAGGAGAAT 

GGCGTGAACCTGGGAGGTGGAGGTGCAGTGAGCCAAGATGGCACCAGTGCACTCCAGCCT 

GGGCGACAGAGGTAGACTCTGTCTCAGAAAAAAAAAAAAAAAAAAAATCAGTCACTGGAT 

TTGGGCCCACCCTACCTNCATATGACCTCATGTTAACTTGATGACATCTGCAAAGACCCC 

ATTCCCAAMAGGTCACCTTCACCAGTMGTNGGGGGTTAGGACTTGAATATAGCTTTTT 

GGTT GATGTAATTCAACCCACAGCACTGCCTTTTNCATTCCATGTTATGTTTTTTGAGAT 

TTTTGAGATTTGCCAMTATATGMGCTATAAATTATCAGNGAAMTAMTMmCAAA 

ATNTMGCTGTTGAAMCTCTAMTTAT7TTTAAGCC 

GACAAGGNCCCGNNNGGGCTTCAATGCCTNTAATCCCCANCACTTTTGANANGGCTGATT 
GGGNGNGNGGATTCCACCTTGAGGGTTTAGNGGNTTTCAAAGACCAAGCCTGGNCNCTAN 
CGGNGGTTGAAAACCCCTGGTNNTTCTNACCCTTNAAAAAAAA 
Sequence 2019 

GTTTTT TTATT TCTACTGTCAMTGATGTGCAAAACCTrTTACTGGTTGCATGGAMTCA 

GCCMGTTTTATMTCCTTAAATCTTMTGTTCCTCAAAGCTTGGATTAAATACATATGG 

ATGTTACTCTCTTGCACCAAATTATCTTGATACATTCAAATTTGTCTGGTTAAAAAATAG 

GT GGTAGA TATTGAGGCCAAGAATATTGCAAAATACATGAAGCTTCATGCACTTAAAGAA 

GTATTTTTAGAATMGAATTTGCATACTTACCTAGTGAMCTTTrc 

ACTCTAAGTCATGTATGTTTCTCTTTGATTATTTGCATGTTATGTTTAATAAGCTACTAG 

CAAAATA AAACG AGTTGACCCACGCCGTCCGGACACAAGAAAGGAATATAATTCATACAC 

TATTGCATTTTTAATAMTCTTTTGMATTTGCAGAATTAAGATTGTATTGTGTATT^ 

GGTTAAATGATMTTG MTGT AMTATTTMGATGCAGCACCATATTTTATAACCCAGCT 

TTAGGCATTTCTTCATATTTTMGGGAMCCCCCCACCTCCTTCTTTTMNGGCGCTTCT 

TGCTCTCTGAMTGCCCTGCTAMTGCCTTCTCTTMTTATTTGGMTAANGGTAGGTTT 

TTGGGGGAAAATTTTAAAAAAAAAANGGGGGGGNAAAAAAAA 

Sequence 2020 

AATTTTTNATMTCTGMTTATCACAAMAGATMGGATTTTTAAAGTTATTTGGAGGGA 

GTGTACACATTGTTTATTTAATAGTGAGGGCTATTATACAAGCNGGTCNATGTAAAATAA 

TCCCTTTATATGTATGAGCATAGTTAATTTGGTAAACAACAGACAATTACATACTGTGAT 

CATAAGGACTTTAGTATCAGTTACCATATAGCAGGTACTCTTTAGTCAGGATATACCTAT 

ATAGGTGCTAAA7TAAAAATCAACCTTATATCTCAAATTTACTTCCTAGTAGAGTGTAAC 

GCTGCCATAATTGCAAGCCTAATTATGGGGTTGTCCATACTGCAGTCCCATCAGTACTCA 

TACACCAAAGGGTCGGGCCTGTGAAGACTGGAAAAAGAATAAAATATCTTTTGATTGAGA 

TACTACAAGCAAAAATGACTTTCTGGCTACCATTACTGCAAAGAACAAAACTCAACTGAA 

AACAATATGTAGCAATTAAAAAGTCTCATTATCTTCAAAACACAC 

Sequence 2021 

CCCGCGTCCCGGAACTCTTACCCATAACCCTAATGATGCAAGTCATATGGGGGAACACTT 
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TGTAMTGGTCAGGATAAAMCCAMTCTGGGTGCCAGATCCCAGCACTACTTTTTATTA 

CTGGAGAAATGGGGGGGATAGAAMTTCTACTTTGAATTATTTAGI HUM I AAAGAGT 

GGGTTGTGTTTGTGCTTCTCCCACCTTTCAAGCATTTATAGAACATGCTGCCCCACATAC 

AMGTCMGACCACTTACTTTTATGTGACACTAGTAGTTTGGGGTTAATGGTTTGNGTAA 

AGAACAAGCTGCATATGAGTAAAGGTTACCCCAACCCNCAGTGAGGANGAAAGATGTTCA 

CATACTGGGAACTGTCCTGNCAMTAAATNTGGCCCCTATTGGGCTCTGTTTTAATTNGG 

MGNGGGCAAMGTMCCTCTTGCTTTGGGGCMCTATTTGGNNTCAAAATTANAAACCT 

TTTAG ACCCA MNTTNANNNNNNNNAAAANNGNNNNNNNANGGGGCCGGNCCCNTTNGAC 

CTTAGTTTTTANANAAAAAAAAACCTNCCACACCTTCCCCTTGGAACCCTGAAACAATAA 

AMGGAMNGCCANTTGGGGGGGNGGGTAMCCTTGGTTTNATTGGCACNCTTTATAAAN 

GGNTTNCCAAAANAAAAGGCAATTTGCCTTTCCCCAAANTTTTCCCAAANAAAAAMGGG 

Sequence 2022 

ACCACGCGTCCGGTCTGCAGAGGCCCGGGCCTGGGCACAAAGGGAGAGAGGCCTCCATTG 

TCCCGCAGGGGCCAAAATGCAGACCGTGCATCCCCGGTGACCTCGGGGACCGTNCTCTGA 

TCAGCAGGATTTTCTTGGACTCTGGGGTCCTTGTCCTGCTCAGGCATCCCTGCCCTGCTC 

TCCTTGAGGGCCCTCAACACTATCTTCCCTGGACACAAGTCTGGGGACAGCCGGGTGTTG 

AGGACCCCAAAGGGGTGACTACCTGCTCCTGGGCCCCACAGAGTCCTTGTGCTCAGTGTA 

GTGGCTGAGCTGGGGGATGCCCTGGAATTCGGAGCACACAGCACTGGCTTACTGTGGTAC 

CTGTGCAGTGAAATTGGAGACAGAATCACCAGGATGGAACACAGGTCTTGCAGGATCACG 

GAAMCCTTTTAGAGTTGTCTTGACACCACTTGATGTTGAGTGTCCGGGTGTTTGTAGGA 

TGGCCTGCACTCAGTCCAGGGGCAGG 

Sequence 2023 

CGCGTCCGCTTGACCCTGTATTTTGGGAGTCGAACGGAGAATGGAAACTGAAAGTGGAAA 

TCAGGAAAAGGTAATGGAAGAAGAAAGCACTGAAAAGAAAAAAGAAGTTGAAAAAAAGAA 

ACGGTCACGAGTTAAACAGGTGCTTGCAGATATTGCTAAGCAAGTGGACTTCTGGTTTGG 

GGATGCAAATCTTCACAAGGATAGATTTCTTCGAGAACAGATAGAAAAATCTAGAGATGG 

ATATGTTGATATATCACTACTTGTGTCTTTTAACAAAATGAAAAAATTGACTACTGATGG 

GMGTTAATTGCCAGAGCATTGAGMGTTCAGCTGTTGTAGAGCTTGATTTGGAAGGCAC 

CAGAATCCGGAGGAAAAAAC 

Sequence 2024 

CGCGTCCGCAG ACTTTC CCTCTGCAATAAATCCTGTAACAAAATTGCACTCGCACCCCTT 

ACGTTTATACAAATTTTTAAAATAAATAATAACTAGAAANGCAAGAGGAAACCAAACCCA 

AAATTATTAGAAGAAAAGAAATGATAAAGATTGGGGCAGAAATTAATGAATTGATACTAA 

AAATAGTACAAAAG ATGG ATGAAAG AAAAAGTTG I" I 1 1\ I I I 1 1 I I I AAAAGATAAGCAA 

AATCAACAAACCTTTAGCCGGACTGTAAAAGAGAGAATATCCATATGAATAAAATCAGAG 

ATGAMMGGAGTCATTACATCTGATMCACAGAAATTCAAMTATTTTAGAGACTATTA 

TGAGCAACTATATGCAGTAAATTGGAAAACTTAGAAGAAATGGATGAATTCCGAGACACA 

TAAAAACATACCAAGATTGAACCATGAAGAAATCCAAAACTTGAA 

Sequence 2025 

AATTATCCTGGTGTGGTGGCGTGTGCCTGTAATCCCAGCTACGCTGGAGGCTGAGGCATG 

AGACTCGCTTGAATCCAGGAGGCAGAGGTTGCNTTAAGCTGAGACCACACCACTGCACTG 

CAGCCTGGGTGACAGAGCAAGACTCCGTCTCAAAAAAAAAATAAGCTATTGATGGGCTAT 

ATATTGTTAAGCTATAGGGTGTGGTTACAGTGTCCAGTGTAGCATTGNTCGATTAATTTA 

TAGCCTCTTGTGGCAACAGCAAGCAGNTTCCAGAGATGAATACACAGCTTGAGTAGGCAG 

GGGGGAGTAGGACATGACGGCTGCCTCATCTTAACACCTTGGGCCTGATAATTTAAAAGG 

ACTCACAGTCCTCAGATAMCATGACTTTCTTTTCTCATGAGGAAACANANAAGGTGGCT 

AAAGGGTATNTCTTTCTCTCATGATCCCAAACCTATCAGGTTT 

Sequence 2026 

ACCNCGCGTCCGGGTGCTGTACCAGACCAGAGGCCAGCTCCATGTCCTCCGCGTCGGCAA 
TGATACCCACTGCCAACCAACAAAAATTGGCTGCAACCATCCCCTACCAGGACCCGGCCC 
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CTACAGGGTGAAGTTCCTGGTGATGAATGACGAAGGACCCGTGGTGAAACCAAGTGGTCC 

AGCGACACTCGCCTGCAGCAAGCCCAGGCACTTCGGGCTGTCCCCGGCCCCCAGAGCCCG 

GGCACCGTGGTCATCATCGCCATGCTGTCTATCCTCCTGGCCGTCCTCCTCACGGTCCTC 

CTGGCTGTGCTCATATACACCTGCTTCAACAGCTGCAGGAGCACTTCCCTATCAGGCCCA 

GAGGAGGCAGGGAGTGTGAGAAGATACACCACGCACCTCGCGTTCAGCACTCCTGCCGAG 

GGGGCTTCTGAGGGGTTCCAGAGGGGCCCACGTGTCCCTCCACCTCCTCCCTGGCCCAGG 

CTGCAGAGCCTGAGCTGGGACACGCCCTGAAGCTTCTGGACCCTGAGAGAGATTGGTTCT 

Sequence 2027 

GCTGCTAGAAAAGGTTAGACTMCTGGGAGTTCATMTTTGTCTTGGAGATTTAAGAAAG 

TTTTTGAAGAGGTTTTCTGTGTTGAGATGAAGAGGMGATTGAGACAAAGCACAATGGAG 

AGTAGGGGAGAGAGCACTTCAGAATGTTTGAAGGTGCTAAGAGGAAGTGCTTGACACACG 

GGAGGAACTGAGAAAGAAGGCTAGGGCCATCGGCTGGAGGGACAGGACAGGTGGAGATGT 

MGCATGGGCCTCCAGACCATGGTMGGCCACTGGATCTTTACTTTAATGCATTTGTGCA 

TGTGATTAAAAAAAAAAAAAAAAAAAAA 

Sequence 2028 

CCCCGCGTCCGAAAAAGACATATGTACAAGTCTATTTCCTATAGCACTATTTGTAATACC 

TAGCACTATTTGTANTATCTAAAGACTGTAAACAACGCAGGTGCCCACAAAGGGAAAATG 

GTTTGACAMCTTACGMCATCCTTTTAAAGGAGTACTAGACAGGTCAAAAAGGAATGAA 

GAATGTATATATTACTATGGAGTGAATCTTCAGGGTATATGACTAAGTGAAAAAATGCAA 

GGTGAAGTANTATGTAACATATGGTACAGTTTACTTAAGAGAGAAATACAGATGTATATA 

CACATTACCTAAAGCAAAATGAGGACCCTACTGGGCTGCCATCCCAGCTGGACTGCTGCT 

GTGGAGCTCAGCATCAAGTACT 

Sequence 2029 

CGCCCCGCGTCCGGAGAGAGCTGTCTTTGCAGTTACTAGGTTTCATCAAACTTGTTTTTT 

TCAGTATGGTAGGTTTAAAMTGGGGATACATTTTTGTTTTTATTT 

TTTCTTAGGTTAGTTGGCTACTTAMTTTCTTTTTCTGAAAACTT^ 

TTAAMTTCTTATTGAC TTGC CTGAACTATTTGTAAATTACAGMATTAGCCCTTTGTCG 

TATGTGTTGCAGGTGCTTTTCCAGTTTGCCAGTGGTCATTCATTTTGGTTTATGGTACTT 

TTGATAGACCAGAATCTTTGACTTTTATTTAGTGAGATTTATCCATCTTTT^ 

CATCTTGGTATTATAGCATTAATATTCTCTTTTCT 

Sequence 2030 

CCACGCGTCCGGTTGGGACGGCACCAGGCGAGGTGTTGAGTTGGCTCGGCTCAAGGTTCT 

TCGGGGTGTGAGCTGGCATGAGGACCTGTTGGAAGTGGGATCCAGGCCTGGNGCAGNCTC 

CCAGCTGCCTCGATTTGTGCGTGTGAACACTCTCAAGACCTGCTCCGTTTATGTAGTTAT 

TTCAAGAGACAAGGTTTCTCCTATCAGGGTCGGGCTTCCAGGCTGGATGGAGTGCCCTGG 

CGCGATCTCGGCTCACCGCAACCTCTGCCTCCTGGGTTCAAGCGATTCTCCTGCTTCAGC 

CTTCTGAGCAGCTGGGATTATGAAGGGGT 

Sequence 2031 

NCCCCG CGTC CGGCTCMGGAGGTGATTAAAAGMTGTGTMTTTAGTATTTTTAAGATG 

TTTAGAGTTTTAAAMCTTACGTTGTTGCTGTCATTGTATTTAAATAGTNATAAAGAATA 

TAAACTG ATATAAGTAA I I I I I I I I AAATTCTTCAGATATAATGAAG ATCTGGAAC7TG A 

AGATGCCATTCATACAGCCATCTTAACCCTAAAGGAAAGCTTTGAAGGGCAAATGACAGA 

GGATMCATAGAAGTTGGAATCTGCAATGAAGCTGGATTTAGGAGGCTTACTCCAACTGA 

AGTTAAGGATTACTTGGCTGCCATAGCATAACAATGAAGTGACTGAAAAATCCAGAATTT 

CAGATMTCTATCTACTTAAACATGTTTAAAGTATGTTrTGTTTTGCAGACTTTTTGCAT 

ACTTATTTCTACATGGTTTAAATCGACTGTTTTTAAAATGACACTTATAAATCCTAATAA 

ACTGTTAAACCCAAAAAAAAAAAAAAAAAAAAA 

Sequence 2032 

CGACCNCGCGTCCGGCGTTCTACCCTTCCGGCCCGTGTTCTATCCGCCGCCTCCACCTTC 
CATNCGGCGCCGGCTTTCGGCGCGACGGTCGCCGCGTTCCATCGTCGCGCGGCCCTTCGG 
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GCGGAAGGCGAGGAGGACGGCTTCGGGGAAGCAGAATACGCTGCCATCAACTCCATGCTG 

GACCAGATCAACTCCTGTCTGGACCACCTGGAGGAGAAGAATGACCACCTCCACGCCCGC 

CTNCAGGAGCTGCTGGAGTCCAACCGGCAGACACGCCTGGAGTTCCAGCAGCAGCTCGGG 

GAGGCCCCCAGTGATGCCAGCCCCTAGGCTCCAAGAGCCCCCAACCGGGACCCAACCCTG 

CCTCCCTGGGCTAGGCTCTGGCCTGGGCACTCACCCCCTGGCTTAGACACCTTCTCAAGG 

GCTGGCCTTCAGGGACCCCTGGTGGGTCTGCTGCCTGGGCCACCCTTCTGCCTGGGCCTN 

CCCTTG 

Sequence 2033 

CGACCACGCGTCCGCTACCTCAAGGNCCTGGGCACCGAGCGGGCCTACAAATCCGCACTG 

GACTACACCAAACGMGTCTGGGGATTTTCATTGACCTCCANAAGAAAGAGAAGGAGGCG 

CATGCCTGGCTGCAAGCAGGGAAGATCTATTACATNTTGCGGCAGAGCGAGCTGGTGGAC 

CTCTACATTCAGGTGGCACAGAACGTGGCCCTGTACACAGGCNACCCCAACCTGGGTGCT 

GGAGCTGTTTGAGGCGGCNTGNAGACATCTTCTTCGACGGGGCCTGNGAGCGGGAGAAAG 

CTGTGTCCTTCTACCGGGACCG 

Sequence 2034 

GGGGGGGAGGGGNGNAAAAAAAAGNGANGGACAAANAAANAGAAAAAANAAGANANAAGA 

AAGNNANAAACANNNNAAAANACNNNGGNAANCGGAGAAAAGAAAAGAAANNNGNANANA 

GAANANNANGAGGGGNCAAGGNAGAAGAAAANNNGANAGAAGGGGNNAAAGGGGGNCGGG 

AGAAAAGAAGNAAAAAAAAAANAAAANNNGAAAAAANGGGAAAAATNNGGGGAAANNGGA 

AANNAAAGAAAAAAAAAANANCNGNGGNAAAAAAAAAAAGGANNNAAAAAANNGGGNAAA 

ANANGGNNAANGNANANANAAAAAGNGGGGGGGAGGGGGGAGGGNAAGGGNNGAAAAANN 

NGAAAGAAAAAAGAAAGAAANAAANGAANAANAGANGGAGANGNAAGGNAAAAANAANAA 

AAAGNNNNNAANAANANGAAAGGGNAAGNAAAANGGGNNNNAAAAGAAAAAAAAAAAGNG 

GNAAAAAAAAAAANNNNGGAAAAAAAAAANNNGGNNANNNANNCCAAAAAGAAAAAAAAA 

AAAAGGGNNNNAAGGGGAAAAAANAANAAANANAAAAAANGGGGAGGNGGCNGGNAAAAA 

AAAAAN G N N AAAAAAAAAAAAA AANN N AAAN AAA AAN AN AN AN NG N AAN AN N N AN AAAAN 

AAAAAAAAAAAGAAANAAGNGGGNNGNAAAANACNNAANANNGANANGNANAAAANNGGG 

AANAAAANAAANNAAAAGNNANAANGAAAAAAA 

Sequence 2035 

CCCCCCGCGTCCGCGTTTTTATGTGTGTATGTACAAMCAAAATACCTTTTTGAAATTAC 

ATAAGTGATACATGCTTATTGTGAAAGAGTTGGATAATACAATATACTGTAAAGAAGATG 

MAACCACTCATAAATTCCMCCTGTAMTACTTTCGTGTTCATATTTTGGCATCTAATG 

TATCCTTTATGTATATTTAMTATATATTTTATTCAAGTATAGGATCATGTACCTCCCTG 

TTTTATAATTTTC I I I I I I AATTTTTCAGTGTATTGTGGACATCTTTTCTCATCAACAAA 

TACATCCCACAATGTTCATTTTTGTGCCTCATTATACCATGGAATAGTGACATCCTAGTA 

TG7TTAAGACATTCCTTATTAATGAACAATTAGGCTACTTCCAAI I I 1 1 AATTATAAAGG 

ACATTTTGAAGGACTTCCTTGTACATATTATTCTATTGCTTTATCATCI I I I I GGGATAA 

TTTCATGTAGTATATTTCCAGTTATTTAAATGAGAAAGTACATGGTTTATAACAAATGGT 

ATTTMCGTCCAAGTCACTGCTATCTAAAGGGGTAATTTTTMGGTATAAMTAATTTGG 

CTTATAMAAATCGTGGGAAAAATATNCTAGAMTATTTAANGATTAACTTCTAAATTGT 

AAATTGGCATATTTAATGATAGAATTCAAAAAAAAAAAAAAAAAAAAA 

Sequence 2036 

CGCGTCCGGMGAMTTGTGCACCCTCCCAAMCATACAMGTTTAAAAGTTTGGATCTT 

TTTCTCAGCAGGTATCAGTTGTAAATAATGAATTAGGGGCCAAAATGCAAAACGAAAAAT 

GMGCAGCTACATGTAGTTAGTMTTTCTAGTTTGMCTGTMTTGAATATTGTGGCTTC 

ATATGTATTATTTTATATTGTAC I Mil I CATTATTGATGGTTTGGACTTTAATAAGAGA 

AATTCCATAGTTTTTMTATCCCAGMGTGAGACAATTTGAACAGTGTATTCTAGAAAAC 

AATACACTAACTGAACAGAAGTGMTGCTtATATATATTATGATAGCCTTAAACCTTTTT 

CCTCTAATGCCTTAACTGTCAAATMTTATAACCTTTTAAAGCATAGGACTATAGTCAGC 

ATGCTAGACTGAGAGGTAAACACTGATGCAATTAGAACAGGTACTGATGCTGTCAGTGTT 
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TAACACTATGTTTAGCTGTGTTTATGCTATAAAAGTGCAATATTAGACACTAGCTAGTAC 
TGCTGCCTCATGTMCTCCMAGAAAACAGGATTTCATTAAGTGCATTGAATGTGGCTAT 
TTCTCT AAAGT TACTCATATTGNCCTTTGCTTGAATGCAATGCCCGTGCAGATTATGTGG 
CTGCTATTTTTATTTTCTGGGCATT^ 

TTCTTCACTGACTGGCAATGGNCCTTTACTGCAATAGGAAGAAAA 
Sequence 2037 

CCCCGCGTCCGGCTGGGCTTAAGGGATCTTTCCCAGGTAGCTGGGACTGCAGGCATATGC 

CACTGTGCCCAGCTGCTCCCTTAGTCTTGACTATGTAI II M 11 1 11 I I I GGTTGTTTTA 

TTAGGATAGAGTCCTAAGAATGGCATTACTGGGTTAATAGGTATGAACATTAGATCTTTA 

ATACATACAGTCGMTTACTTTTCATAMGCCATATACCTTTTTATATTCCCACCGATAC 

TATCCCTATCACTMGTATTAACTTATTTTCATCTTTGCCAATATGATCATCCAAAGTGA 

GGCAGAGGTTGCAGTGAGCCAAGGTCACACCACTGTGCTCTAGCCTGGGTGCCAGAGTGA 

GACTGTGTGTCAAAAGAAAAAAAAAGGGGGGGGTGCCGGGTGCGGTGGCTCACGCCTGTT 

ATTCCAGCGCTTTGGGAGGCCGAGGTGGGCGGATCACCTGAGTTTGGGAGTTTGAGACCA 

GCCTGACCAACATGGAGAAACCCTGTCTCTACTAAAAATACAAAAT 

Sequence 2038 

GTCGACCCCGCGTCCCGGACGCGTGGGTCGGCCATGAACAAGTTTTCAAGTATCAGTTGA 

TTTATGATATAGGCTTATCCATTTGGTTATAAAATCATATGTTTATTACATAATCATTGA 

CAAATAGTTTCTGTATAATAACTGGCAGAGTAGCTCTAAAACATATGCAAGGAAATAAAT 

AAAGAA AAAAC TTACMTAAAGAGAGTAACTCATATTTTMCAGTTTTGTGAAAAAAATA 

G AAAAT ATTTTATGTAGCTTATAGTACATATATTTTTTTACMCAGMGAATCGCATTCT 

GATTT TCCA TATGGATCATTTCCCTATGTTGCTAGACCAGTACACTGGCAACCTGGTCAT 

ACAGCTTTTCTTGTCAAGTTGAGGAAGGTCAAACCACAACTTAACTGAGTACTCCAGATG 

ACAGTAACTGACTTG AAGATG GAAAAATATCAAAATAGAACTTTATATTGAAAATCACTG 

CTTCCATAGATTGGCATTTTTAGCTATTACTATGACTTATATMCTTATACATATMTTT 

TGAAAATAACAACTAAAAG ATGTA TAACATAGCCAAAACTGCTTAAACCATCCATTTTGA 

CCACTTGTCTTGCAGNTAGTTTTGACATTTTGTAGGTTMTGGATTCCAAATTGGTTTAA 

GTGGGCCATCTCATTCTTCACTTTCTGGNMNCCACTCCATAGATTTGGCTTTTCTTCAG 

GAAAATTAAGN7TCCTTTNCCTTTATTTGGATTGGANGGNCATTGGCCTACTGGAAAANA 

AATATGCCTTTTTAGGGTTAAAAAAAAAAAAA 

Sequence 2039 

GTCGACCNCGCATCCGACCACGCGTCCGCCAGACCTCACGTCAACCGGCTGCACCCCACT 

TTCCAGCCTGCGCCCCAGATCTGCAGCCTTCGCCCCTAGATACACCCGCCTGGTGATGAG 

GCGCTCCTCGCGTTCCTTCCGGGCTCCAGGTGTCCGTGAGCCTCCCTTCCGCCCTGGCCT ' 

CCGGTCTCTGCCTTGCTCGTGCTTCTACCACCACCCTTCCCCTCCCAACCCGGTGGATCC 

TCTCGTCTCCCCCAGTCTCCAGTGCACCGGCTTTCCCTCGTCCTCTGCGCAGTCCATCTC 

AGCTCATCTCTCCAATTCAATGCCATCATCTCTCCTCACCATCTCTCGGTGCCCTGGAAT 

GT TTGCT GTCMGATGTCCCCTGTGAAACCCACAMCGCTTGCGATTTGGCCTCCTTGTT 

TTATTTTGTGTAGTCCTACAACGTCTTGTTACTACCCCCTATTACAACACTTATAACTCA 

NAANGAAANAGNNNNNNNNN 

Sequence 2040 

CGTCCGCGGAGATCCGGCACACTGCGGACCGCTGGCGCGTGTCCCTGGATGTCAACCACT 

TCGCCCCGGACGAGCTGACGGTCAAGACCAAGGATGGCGTGGTGGAGATCACCNGCAAGC 

ACGAGGAGCG6CAGGACGAGCATGGCTACATCTCCCGGTGCTTCACGCGGAAATACACGC 

TGCCCCCCGGTGTGGACCCCACCCAAGTTTCCTCCTCCCTGTCCCCTGAGGGCACACTGA 

CCGTGGAGGCCCCCATGCCCAAGCTAGCCACGCAGTCCAACGAGATCACCATCCCAGTCA 

CCTTCGAGTCGCGGGCCCAGCTTGGGGGCCCAGAAGCTGCAAAATCCGATGAGACTGCCG 

CCAAGTAMGCCCCTAGCTTGAGTCGACCCACGCGTCCGATTTAAATATTTGTCCCATTG 

TTTGTGATTAGGATGTAAGCTTTGTGGMTGTAATTMCCCTGCTTTACGAAGTCACCAT 

ATTATAATAGGAAAAACACTGCCTAGGAGGCAAAGAGATCTGAATTCCAGTTCTGATGCT 

GCCACTGTGTAAGGAAGTAGTTTTATAAACCATGGGCAAATCATCTTGAGCTTTCTCATC 
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TGTAAAGTTAGGGG 
Sequence 2041 

TCGACCNCGCGTCCGAAAAACCAAAACCTGANTGAGATCTTGGAACCGNTGTGCGCCGGC 

CGNNCCTCTCCCANGGGACCANCCANCCCCGCGCGGTGGCCGACTGNATAGGCGGGACTG 

CGCTTCGAGGCTTAAGGACGNCAGATCGGAGGCATCGTGTGTTGTCTGTGCGGAGAAGCC 

AAAANNGTGATTACGTTTATTTGCAAGACCGTTCATG7TGTTTTCAGTTCATGGTATGAT 

TAAMCCCGATCCTTTGTTACCATGCCCTTAGGTACGAMAMTAATTGTTTNGATATTT 

GGCAGTCACCCAAAAATATC CAAAA AGCCATGAAACAGTANAGGTAAACAAGTANGAAGT 

GAAAN TMTNTTCGTCCTTTGTTTTCTTTCTGGAGGTGCTCAAAACACCCTCTCAAACCA 

I I I I ICTCAGCATAGAACCAAGTGTGGNCNGGNTANCAGCTAATATTTACNAGGGNAGAA 

ACGMCCCTNGCGATATTTAGTCACTTTGTTNCCNGGGANCACANAAAATNTTGAACAAA 

CACATGAGAACTGTCACCGATCTCTGTATTGATNACCANGGATACCCGTGAATTTTATGT 

MTATTAATCTNNGGNAGGCANGANTNTTTNNTAGGTATTTGCCTTTTCCAAGGTGCNCT 

TTCCNTA CCAAAGGAAMNGGTTATTTTAAAMCTTTTACCANAANAAAGGGGATGNCTT 

Al I I 1 1 IGGTCCT 

Sequence 2042 

NGGACTTGGTTTGAACGCGTTTTCCCCAAAGTTTATGTGTTGGAAACTTGACCCCCAATG 

CAGCAGTGTTGGAAGGTGCCTACTAGGTGGTGTCTGGGTCATGGGGGTATGACCCTCATG 

GATGGATAAATGCCATGACTGAGGCGGTGGGCTGCTTATAAAAGAATGAGTTTGGGTGAA 

ACCTCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTGGCACATACCTGTAATCC 

CAGCTACTCGGGAGGCTGAGGCAGAATGGCTTGAACCAGGGAGTCGGAGGTTGCGGTGAG 

CAGAGATCGCACCACTGCACTCCAGCCTGATGACGGAGCAAGACTCCGTCTCAAAAAAAA 

AAAAAAAAAAAAAA 

Sequence 2043 

GAGAAGCCTGGGGGTCCTGGCTGAACTGGGCTGGGTGAAGGGGGCCCCCTGACCCCCTTG 

GGGTCCGGGCTGGGCTGGGTGAGGGGCGGTTTCCGACCCCCAGCCAGGTTCCCAGGCAGG 

ATGAGCTGGGGTTGGGGTGGCTAGGCCGTGGGCCTTGGGAGCTGGGCAGTCTGGGCTGGG 

CTGGGCTGGGCAGGGCGCCACATGGAAGCTGGAGGAGCAACGGGAGCGCTGGGCGTGGGG 

TGCAAATTGCCCAGTGCCTTCTGTTTCCCAGGCAGCTCTGTGGCCATGGATATGTTCCAG 

AAGGTAGAGAAGATCGGAGAGGGCACCTATGGGGTGGTGTACAAGGCCAAGAACAGGGAG 

ACAGGGCAGCTGGTGGCCCTGAAGAAGATCAGACTGGATTTGGAGATGGAGGGGGGTCCC 

AAGCACTGCCATCAGGGAGATCTCGCTGCTCAAGGAACTGAAGCACCCCAACA 

Sequence 2044 

A CACTCATCA ATTAGGTTTTATTTTTATTTCTTTCCTCTACCCCCAGAAACMGCCTG^ 

AAI I 1 1 I I I I CCTTCTCCTCTGGCGACTGTGTGATGAATCCTTTCTTGCGTGATCAGGTT 

GCGGATAGACTTGTAAGGGTGTTTGCTGCATACAGTGTAAGCATTGTGACCGCCAATAAA 

CTTCAATGGTTTCTACTGAAAAAAAAAAAAAAAAAAAAAGGACGCGTCTACTTCCCACTG 

GGTCCCTCCCACAACACATGGAAATTCAAGATGAGATCTGAGTGGGGACACAGCCAAACC 

AAATCAAAAGGATATACAAAATAACCAGAAAACAATGAACAAAATGACAGGAATAAGTTC 

TCACCTATCMTAATAACTTTGAATATGTGTTAAATTACCTACCTAAAAGATAGAGACAG 

GCTTAATGG ATAAAAAATG ACTCAACAACCGTCTACAAG AAACTCACTTCACTTGTAAAG 

ACACACACAGACTGAAAGTGAAGGGATTGAA 

Sequence 2045 

GCCNCGCGTCCGTGAGAATACACAAGGGGGCACGCTTCCAGTAGATGTGTTGGGGAAGGA 

GGAGGGCAGAGGGGACAGGGGACAGGATTCAGCTTTGTGGTGGGTCCTGAGGGTTCCTAC 

CAGGGGTAGCCAGGATCTGGGAAACAGATCAGCGACTCTAGTCTGAAGTGGCTGCCTGGT 

TCGGGGGCTGCCTTCAGCAAGATTCAGGCAGGAGAGACGGAAATAGCCACCTTCCAGGCG 

TGAGTCCTGGAGATAAAAATGGATTTTAACCTAGGACTGCCGGGAGCTGGCCCTCCGCGG 

CTGCTCAGACTAGGGCTGTGTGTGCTGGCTCTCGCCTGTTTCCGGTGTCTAACTGGCTTG 

TTTCTCTTTATGGCTTGGCTTCATTCCGACCTGGGGTGGGGCCACATNCAACCCACTGCC 

CACTGGCTGTCCGTCTGGCCTGCCCCCGCGGTTTCAACCACANTGGTGAAACAANCCTTG 
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CAAGATNTACAACTCGCAACACCGGGTCAAGCAATCAGCTGCATTCX^GGACCGGTGTGNA 

AGACCGAAGGGG 

Sequence 2046 

CCCCCCGMTATCTTATCCTTAACATTAMTTGAATTTTTTGCAAATGATCAAAAGGTCA 

TTCC GAGTAAATTCTGTTGTATAGTGCAGATGATCMGCTGAGTATTTGCCATGTTTTTA 

TTTTAGAAAAGAGATGTTGCTATAACATAAGTAAATACGATTCTCGTATGTGGCAGATAA 

ATTTACTTGTAATCTGCTCTAGAGTGAAATTATTTTTTACATATAAGCATTGTCATCATT 

CTAAGG ATTATTGAATAATGAATATAAAATGTTCTTGTGTATTTGTGTATGTGTATATAA 

TTTTTTGAAAGTTTCTTTATCCTATTGACCCTTCTCATAMCAGNAGCATATATATATTA 

TATGTAGTAGAATTTATATAGGMCATTGTCTTTTTCCCAGTAATGCTGATTCTAAACTA 

GTTATGTCAATTTCATGTAACATGACATTNAGAATAGTGGGGTGCTAAATATATTTAGAA 

ATGATTTCCAAMTTGNTGTATTTCTMCATAGAANGATATTTGTCATTTTAAAATAATG 

TAAAGAAAAAATGC 

Sequence 2047 

GCACCCCTCCCTGTTGACACAGCCTGGATCCAGAGTTCAGCAGACCTTGAGACAATGAAA 

ACAAACTTAGTMTAATCATTTTTCMTCATTGCAGTAATTATTGATTTGGACAAAAATC 

AATTGACGTCAAAACCTTA AAGT GACGTTTCTCTGCCTATGGAGTGGGTCATTCTTTTAT 

TCCTTTAGTTTCATAATAMTTTTCTTTTACTTAAAAAAACTTATAGTTTGATGAAGA 

GAGATATATACCTCATCTCAAAGAATCTTCACACACACACTTATTAATTACAAAAGGAAA 

ATCAGTAATTTTGCAGTGGAGACATATGGCCAACTCCACCTTACCCAAGTGGCTGAAAGT 

CACTGCACCAGTAATGG 

Sequence 2048 

AGAAAAGCCNAGCCAACAGCTCTTAAAATCAGAAAAAACAANGGGAGTCCTTCCTTGTCT 

CNTCTGTGNTCNCNGGCCTTGTCTCTGAGACTNTCTGTGCCCNNAANCNNTNTNNTNGCT 

NTNANCTGATTCTANTTTTGNTNCCCATGGMTCTGTCCTMGACTGGGGNTTTTTGNCA 

NATGACAGNCTTGCCNGNACNCAMTATCATMCAGCATTTNNNANCGANTTTTGCNGAT 

CMGTMNATANTTGCNTGACMTGACAGCTTTTTMCTCTTTCAAAGTCACCTAAAAGC 

TATTATTGCAGGAGGATTTANGAAGTCACATTCATTNAACACCCAAGTGCTATGGGTGAA 

NNATTCATGATAGCTTGGCCCACMGGTCATGMTTGAGGAGGGAATCTTGCTTTTCAAA 

AAANCMTGGMTGNTCCCNCCACTGAAAAAGGGNNATACGTTTTMTATTTTGGACCCT 

TCANAAAGGNTAANGAAAAAAACCCANGGTTCTTTCNAAAAAGTTAGNGAATAAGGGGGA 

ACTTAANTTTCATGGAANACAAGCCCATTNTTTNAAAAAAAAAAAAAA 

Sequence 2049 

CNTACGAACGTCTGAAACGTGGAGGAACCTTCAGTTCTGGGAACTCCCTGCCCCTTTCCC 

GGAAAATTCATGAGTAATCCACCTGTTTAGCATATAATCAAGAAGTAACCATAGGCATAG 

TATATCMGCAGCCCACACTGCTGCT7TGCCTATGGGGTAGCCACTTTTATTCCTTTACT 

TTTTATTAAACTTGCTTTCACTTAAAAAAAAAAAAAAAAAAAAA 

Sequence 2050 

C GCNTCCG AAATCCAATCCTAATGAAAGAGATTGATAAGTGTGACTACAAAAGGTTTAAA 

AC I I I I I ICATAGCAAATTATCTCAGAAACTAAATTAAAAGACAAGGGAGACCAGGTGCA 

GTGGCTCACGCTGTAATCCCAGCACTTTGGGAGGCCGAGGGAGGTGCATTGTTCTAGCCC 

AGGAGTTCGAGACCAGCCTGGGCAACATGGTGAAGCCCTGTCTCTACCCAAAATACAAAA 

ATTAGCCAGGCGTGGTGGCTTATGCCTGCAGTCCCAGCTACTTGGGAGGCTGAGGTAAGA 

GGATGGCTTGAGCCCAGGAAATCAAGGGTGCAGTGAGCTGCNATTATGATTGTGCCACTG 

CACTCTAGCCTGCATGTCCAAGTGAATCCTACATNAAAATAAAAAGTNCAAAAANANAAA - 

AAATGTGCCGGCCCGCTAGACTAGTNT 

Sequence 2051 

CCACGCCTCCGGAAATGCCTCTCTCCAGAGTCGGACCCTCACCTCCTTCCTGGAACTGCC 
TTTGGCCCCAGAACCATGAGACAATCCCCACCCTGAGAAGCTNCGATCACTGGGAGGAGA 
GAGAAAGCCTCCAGCTTTGGGATTCAGGCTTCAGAAGTTTTTAGCAGCCTTTGCTCATTG 
GAGAGGTGGGGAGGATAAGTCTATAAGGAATCCTATTTCCCCAGCTCTCCCACAGAGAGG 
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ACAAMGMGTCTTCACACCGTTGTGGAACTTTCCTGCAACTTCTGGATGCAGACAAGCC 
TCAGAGCAGACTGTTCTGGCTCCAGNGAATATCGGCTGCCAAGCTGTGAGCATCCAGGGA 
TCCNCGTCTGCCTGGCTTTCCTGAAAGTCAGAAGGCGCCTTGGTCATACTGTGTGGGGTG 
NGTNGGAT^NAGTTNTGNTCTCTTTTCTTmCCTT^ 

GCCAACACCCCTGACAGCAATTGTGCNGCACTTGGCTTAATTCACACCCTATGAATAATT 

TTTNATATTTCMCTTGGAAAAGGTGGTTAAGAAACTTTT 

Sequence 2052 

GATGGACTGTGTCATNCAGGACGGCCCTGCTGCATTGGCACCAAGGGCAGGTGTGAGATC 

ACCTCCCGGGAGTACTGTGACTTCATGAGGGGCTACTTCCATGAGGAGGCCACGCTCTGC 

TCTCAGGTAGGTCTGCAGAGTGTCCGTCGTTCCCCTCCCCCAGCTACTGTGATGCTGATA 

TGCTGCTCTGCGCAGGTGCACTGCATGGATGATGTGTGTGGGCTCCTGCCTTTTCTCAAC 

CCCGAGGTGCCTGACCAGTTCTACCGCCTGTGGCTATCCCTCTTCCTGCACGCCGGGATC 

TTGCACTGCCTGGTGTCCATCTGCTTCCAGATGACTGTCCTGCGGGACCTGGAGAAGCTG 

GCAGGCTGGCACCGCATAGCCATCATCTACCTGCTGAGTGGTGTCACCGGCAACCTGGCC 

AGTGCCATCTTCCTGCCATACCGAG 

Sequence 2053 

NCGCNTCCGGGCAGAGCCCCCGGAGCCTGGCCAGCCCTTCCGGCAGCTCCAAAGCCACAG 

GCAAGCCCCGAGGCTGGGATGGCCGGCCCAGGAGGGAGGAGGACGACGTACCTCCCGAGG 

AGAAGAGGCTGCGGCTGGGGCTGGAAGGGGGAAGCGCACAGCCCGAGGACTGCNAAGGAC 

GGGGAGGACGCGCCGCGGCCAGGCAGGGAGGAGACCGGCACCCAGACAGGTGGCGACGGC 

AGAGGAACACAGTGGCTCACGCCTGTAATCCCAGCACTTTGGCAGGTCGAGGCTGGCGGA 

TCGCCTGGGGTCAGGAGTTCGAGACCAGGCTGGCCAACATGGCGAAACTGTCTCTGCTAA 

AAATACGGAAGTTGGCTGGGAGTGATGGCACGCACCTGTAATCCCAGCTGCTTGGGAAGC 

TGAGGCAGGAGAATCGTTTGAAGCGGGGAAAGCGAGGTTTGCAGTCAGCTTGAGATCACA 

CCACTGCACTTCANCCACCTGGGGTGACATGAGCGACACTTCTGTTTTCAAAAATAAACC 

GAA 

Sequence 2054 

CTGTGTAGGACAGACTCTCTTTGACTCCCTAGGATTTACCCCAGTGCCTAGCATGTTTCA 
CAGCTTAGAGGAAAACAACATTTGTTGACTGACTTTTGATCTCCA I I I 1 I I GGTGAGATG 
CAGTGGCTTACACCTGTAATCCCAGCACTTTGGGAGGCTGAAGCGGGCGGATTACTTGAG 
GCTAGGAATTCAAGATCAGCCTGGACAACATGGCAAAAAATACCAAAAATAAAAAAAAAT 
AAATAAATAAAAAATTTAGCCAGACATGGTGGCAGGCACCTGtGGTCCCAGCTACTTGGG 
AAGCCAAATCGCTTAAACCTATGAGGTGGGAGGTTGCAGTGAGCCAAGATTGCACCACTG 
CACTCCAGCCTTGGTGACAGAGTGAGACCCTGCCTCAAAAAAAAAAAAAAAAAAAA 
Sequence 2055 

TCGACCCCGCGTCCGGGMTTTGGGGTGGMTGTGATGAGATTAAAATGTAGCTTTGGTA 

TAACTTCATGTGTATTTCAAAATATACTGAACGTCAACATGATTTGAATAAAGAAAATGT 

ATTTTCTACTTGAACCACATAACACTGTTATTTAAAACAGTTTTCTGCAGTCTAAAAAAA 

AAAAAAAAAANNAACAANNAATN 

Sequence 2056 

CGTCCGGCAGAATGGCTCCCGCAAAGAAGGGTGGCGAGAAGAAAANGGGCCGTTCTGCCA 

TCAACGAAGTGGTAACCCGAGANTNCACCATCAACATTCACAAGCGCATCCATGGAGTGG 

GCTTCAAGAAGCCGTGCACCTCGGGCACTCAAAGAGATTCGGAAATTTGCCATGAAGGAG 

ATGGGAACTCCAGATGTGCGCATTGACACCNAGGCTCAACAAAGCTGTCTGGGCCAAAGG 

AATAAAGGAATGTGCCATTCCCGAATTCCGTGTGCGGGTTGTCCAGANAACCGTAAATGA 

GGGATGGAAGATTCACCAAAATTAAGCTATATNCCTTTGGNTNCCCTATGTACCTGGNTA 

CCACTTTNNAAAAANTTTACCAGACCAGGTCCAATGGNGGGATGGAGAACCTAAATCGNT 

TGNTCGGCCGGANTCAAAATTAAGGTTNTTAAAATTTGCCAAAAAAAAAAAAAAAAA 

Sequence 2057 

CGCNTCCGGAGAGAGCCCAGGGATGCCTTATGGTCAGAACAAATTTATAGACAACAAAAG 
GGAAGTGACCGTGCAGAAATCAGAAGTGAGGTACAGAAACAGCTGGACTGATTACAGCTC 
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NGGCTGCTGTGATTGGCCAGACCTCAGCTATTGTTACAGGCACATACTCTTAAGTCAGGT 

TTTCAATCCTATCTGACTATAMGTTAGGTTACAGTTTGTCCTCATGGACTCAAATTTAG 

MGTATGGCGTCCTTCTCAGGCCATATTTAGTTCAGTTTAACAAGTGCATATGGCTTCTG 

ACAAAGGTGTGGCCCCTTTAGGACTCCAAAGACGCTGTCACTTACATGGTATTCAGGGAA 

GACACAGAGGATCTGTGAGCAGCCTGCAGCCAMGCTTTTGAGATCATATTGAGATTTTT 

TTGTANTATANGAGGAGGGGTTCTGGCTC 

Sequence 2058 

CGGAAGCATCGACCTGCGAGCTCACAGAGCTGGGAGCAGAGCACCCACGCACACCCCGAA 

TGGCTATGGAAGCTGCAGGGCGCCAGGGACACTGGGAGTCCCTGCTCTCATGGCAAAGCA 

GGGACGGGGGACTTAAAAGCCACCAACAGGAAAATCGGGGAAAAAAGGGAAGATGGTGGT 

AACAGTTGGACACTATTTCTTGGCAAAACCGTGGAAAAACACGTTCTACACCAGCAGGTG 

GCAAATTGTGGCCGCCATCTGTGTTTGCAAATAAAGTTTA 

Sequence 2059 

CCCCTACAATGAGCTGTCCCGCCTCAGTGGCCTGCGAACCCTCAACCTCCACAACAACCT 

CATCTCCTCCGAAGGCCTGCCTGACGAGGCCTTCGAGTCCCTCACCCAGCTGCAGCACCT 

CTGCGNGGCTCACAACAAGCTCTCAGTGGCCCCTCAGTTACTGCCCCCGTCCCTCCGNGT 

CGCGGATCTGGCTGNCAACCAAGT 

Sequence 2060 

ACCCACGCGTCCGCCGATTTTCCAGGTGCCGTCTGTCACCCCTTTCTTTGACTNGGAAAG 

GGAACTCCCTGACCCCTTGCACTTNCCGAGTGAGGCAATGCCTCGCCCTGCTTCAGCTCG 

CACATGGTGCGCGCACCCACTGACCTGCGCCCACTGTCTGGCACTCCGTAGTGAGATGAA 

CCCGNTACCTCAGATGGAAATGCAGAAATCACCCATCTTCTGCGTCACTCAAGCTGGGAG 

CTGTAGACCGGAGCTGTTCCTATTCGGCCATCTTGGCTCCTCCGCTCTATTTGCAGTTCT 

TAAAGGGCTATTGTACTCTCTGGGATTGTACGAACTTGGACTGNATTGGAACTGCAACAG 

AAAATCTTCCAGGCAAGTGCCA 

Sequence 2061 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACTTTCATTCAGTGTACGAGGGAAAAAAGCAT 
GTATTGGGCCACCGGAAGACAAGCTAATAAATAGGCTGGAAGTAATATTCTACCAGCAGG 
MCTCAACAGCTCCAGTTAAATGCTTTGATATAGNGGCTCCTTTGCAGAGCCAAAACAAG 
ATTTATTAAATTTCCTTCAAACTGTTTATCTTTA 

TGCTGMGCAAATGTGAATGCCAAAGACTATGTTTTGCAGTTTTGCTTTCCTCCCAATAA 

ATATTMTGTATGTAATTCTAGAGGGTMAAATGTMATAGGTTTGGACAATATTTGCAC 

CCTTGTTTGTGTTATGMAAAAATTTTTCCAAGGCGAGCTAGAGAGMAGATGTTTGGCA 

TGCCAMTTMCTTGCATGTTTGTTAAAMAACAAACACATGTTTTTGAMAGAAACCAG 

ATCTGAACGTGTATTTGTTGAAGTTTTGCAAAA 

Sequence 2062 

CCCTTTCGAGCGNCCGCCCGGGCAGGNACI I I I I I I I I I 1 1 I U I I I I I I I I I I I I I 1 1 I 

MM I I MM MM I IMM I I I MUM MMIM II M I MM III I MM IMIM I 

TTTTAAMAAAAAANNAMANNNTNNNNTTTTTTNGGCCNTTNNTTNAANNNAANCNNNT 

TTNTTTNGGGNTTNNTNNAAAAAANNNANNNAAANCCCNNCCNGNNNTTTTTAGGGNAAA 

AAAAANTTTTNCNGGGNTNAAAAAAAATTNNTTTTTGG^NCCMAANNTNNGNNAAAAA 

ANNAAANNANCCNNNNTTTTTTNNNNNGGGGCNCCCCCTTTTTTTNCCCNTGGNGGGGN 

GGGNAAAAAGGGGNTTTTTTTNGGANCCGAAAAAAMCGGGAMNTTATCCCTTTTTTG 

GGGGCCNTAAAC I Mill I I NNCCNCNNTTlsTTTTTTTAAMANCCCCCNNCCCTTTNTCC 

CTGNTGGNNCCCTTTTTGGCCCGGGGAAACCCNN I I I M I I M II I I I I 1 1 

Sequence 2063 

AAGGGAAAATGTCACGTANACTAGATCAGGGAACAAAATCCTCTCCTTGTGGAAATATCC 
NATGCAGNNNGNTGATACAACTTANTATCTTATTGCCTAANAAAAAAATTTCTTATCATT 
GTTTCANAAAAGCAAAATCATGGAAAATTTTTGTTGTCCAGGCAAATAAAAGGTCATTNT 
AATTTAGCTGCAATTTCAGTGTTCCTCACTAGGTGGCATTTAAATGTCCCCTGATGTCAT 
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TMGCACCATCCMMAGTCTGCTTCATMTCTATTTTCAAGACTTGGTGATTCTGANAG 
TTTTGGTTTTTGNGACTTTGTNTCTCAN^ 

TATMTTCAATTTAAATATGNTGNGGCGTCTCATCCAGGATNGGATAGGTTGTCTTCTAT 

TTTCCATTTTACCTATTTAC 

Sequence 2064 

CCCTTTCGAGCGNNCGCCCNNNCAGGTACAGACTTAGAAATTATCTAAAGATTTCATCTT 

TTTACCTCATATTTCTTAGGAATTTAATGGTTATATGTTGTC I I II I I I CCTATGTCTTT 

TGGCTCAAGCMCATGTATATCAGTGTTGACTTTTTCTTTCTTAGATCTAGTTTAAAAAA 

AAAACCACATAACMTTCTTTGMGAAAGGAAGGGATTAAATAATTTTTTTCCCTAACAC 

TTTCTTGAAGGTCAGGGGCTTTATCTATGAAAAAGTAGTAAATAAGTTCTTTGTAACCTG 

TGTGMGCAGCAGCCAGCCTTAAAGTAGTCCATTCTTGCTAATGGTTAGAACAGTGAATA 

CTAAGTGGMTTGTTTGGGCTGCTTTTAAGTTTCTCTTAATCAAAATTACTAAGATGATA 

GAATTCAAGAACTTGGTACATGTATTACTTGGTGGTATCGATMTCATTTAAAAGTAAAA 

GACTCTGTCATGCA I I I I ICCCCATTTCI 1 1 1 M I I I CCCTGTCTCCGGGGCCAACCCAA 

GTGGGTCTTCATTTT 

Sequence 2065 

CCCTTAGCGTGGTCGCGGCCGAGGTACNCGGNGTCCAAGATGGCGGATGAAGCCACGCGA 

CGTGTTGTGTCTGAGATCCCGGTGCTGAANACTAACGCCGGACCCCGAGATCGTGAGTTG 

TGGGTGCAGCGACTGAAGGAGGAATATCAGTCCCTTATCCGGTATGTGGAGAACAACAAG 

MTGCTGACAACGATTGGTTCCGACTGGAGTCCAACAAGGAAGGAACTCGGTGGTTTGGA 

AAATAACATCTGGGCCTGCTGGAGAAAAAGAAAGATTACAAACTTCGTTGCATATGACTA 

CCGTAAAAAACAAGAATACCTCAAAGCTCTTCGGAAGAAGGCTCTTGAAAAAAATCCAGA 

TGAATTCTACTACAAAATGACTCGNGTTAAACTCCAGGATGGAGTACTTTAAI 1 1 I I I I I 

TNTTNATAhnTNCCAGGMCATTTTCTAATTATGTTATATAAATGGGTATGTGATATGTG 

NGCTATTTGTGTGCTAATGTCCTAAGTGAAGTTCTGCAGACCATCTGGGTCAAAGTGCAT 

TTCGCATGATCCAAAANATGAAGAACCACTTGTTTGTCACGGGAGACNAGGGAAAAAAAA 

A 

Sequence 2066 

CTTAGCGAGGTCACGNNCNANGAACGCGGGGNGNTCAGGAAGATNTCTGAAGAGTGCAGC 
NGCCTGAACCGAGCCCTGCCNAACAGCTGACAATTGCACTGCAACCATGAGTGA 
Sequence 2067 

CATGCAGAANTCCTCGCTGGAGTTTCATAAGGCCAATGAGTGCCAGGAGCGCCCTGTTGA 

GTGTAAGTTCTGCAAACTGGACATGCAGCTCANCAAGCTGGAGCTCCACGAGTCCTACTG 

TGGCAGCCGGACAGAGCTCTGCCAAGGCTGTGGCCAGTTCATCATGCACCGCATGCTCGC 

CCAGCACAGAGATGTCTGTCGGAGTGAACAGGCCCNGCTCGGGAAAGGGGAAAGAATTTC 

ANGCTCCTGAAAGGGAAATCTACTGTCATTATTGCAACCAAATGATTCCAGAAAATAAGT 

ATTTCCACCATATGGGATTCCCAGACCATGANGCCAAAGTAATTCCTAATCCCCACACAC 

AGGAATGGCATGGGACCTGNGATTTTGAGTTTTCAAGGGGCCGTAAGNTTTTNATTCTTC 

ACACCTNCAAATTACCCGACCCCAAAAAAAAAANAAAAAAAAA 

Sequence 2068 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTNTCCGATTTCAAGAACTGATGAAATTAGAAA 

AAACACCTACAGAACATTGGATAGCCTGGAGCAGACCATTAAACAGCTCGAAAATACAAT 

CAGTGAAATGAGTCCCAAAGCCCTAGNTGATACCTNATGTTCTTCCAACAGAGATTCTGN 

AGCAAGTTCATCCCACATAGCCCAAGAGGCCTCTCCCCGACCCTTGCTAGTTNCGGATGA 

AGGTNCCACTGCCCTAGAGCCCCCTACGTCGATACCTTCAGCTTCACGTAAGGGCTCCAG 

CGGGGCCCCACAGACGAGCAGGATGCCTGTCCCCATGAGTGCCAAGAACAGACCCGGAAC 

CCTGGACAAACCCGGCAAGCAGTCCAAACTGCAAGAACCNCGCCAATATCGNCAGGGCTA 

ATGGAANTNCTMGAMTCTTGGNNGGGGACTNTTAAAGCCTACTTTCCCCTACTTACCT 

GCTTCTAAAGATTCCAAGGCCNTTCTTCCAAACTTTTGGG 

Sequence 2069 

CCCTTTCGAGCGGCCGCCCGGGCAGGTTCATGGATNNGAGCAGCTTCACCAACCCCTGCA 
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AAGTGACTCTGAAGAAGACGACAAGCCCTGCTCCAGTCACACCCGGAAGCTGACTGGTCC 

ACGCACAGCTGMGCATGAGGAMCTCATCGCGGGACTAATTTTCCTTAAMTTTANACT 

TGCACAGTAAGGACTTCAACTGACCTTCCTNAGACTGAGAACTGTTTCCAGTATATACAT 

CAAGTCACTGAGAGAACATCACCACCCTGAAGCCAGAGACTAACACTGCAGGACTCAGCA 

GGACTATTTAAGAAACAACTGAGGCATCAGACCAACTTTCCCCACAAGTCCTCGGATCTT 

TCCTGCCATGCTGATGCCATATATTCCAACGTGATCAACCTGGCTCCCCAGAAGGAGGAC 

GACTTTGCTGTCTACACCAACATGCCCCCTTTTCATCACCCCAAGAGGACATTGCCAGAC 

CMGTGGGMTATGTCTTCCATTGTATTTCCAACTGATGGGGAAAGCCTANATGAAGATG 

CTCMGMGNGGGGGGTCAAGACCCTGACCCCAGCNTGAATCTTTGGCATTACCCTTTGC 

TTT AAATTT AATGTG GTG G N G NTTT AAAAAAAAAAAA AA 

Sequence 2070 

CCCTTTCGAGCGGCCGCCCGGGCAGGTA C I I I 1 I I I I I I I 1 1 1 1 1 1 1 1 1 1 1 1CNGCGGGG 
NGNNCTACTTNANMTCTTTGGCNGGTTTNNCNGTTTTNGGTTTTCTNANCNCTTGGNCT 
GGTN CATTT GGTTTNGMNMTCNGTTNCTTC^ 

AANGGTTTTNGAAANCTTTNNCCTTCTTCNGTNGMGTNGTGGGTTTTTNANAANANAAA 
MTTGGGGTTNATCATTTTTTCTAGGCCNGAANGTTTNGNNCNTNCCTNTTTCANAATCT 
ANATTAATAC 
Sequence 2071 

CC CTTT CGAGCGGCCGCCCGGGCAGGTNCNGGTTANCAGACCCACAACACGAAGCTCCTG 
CCTTTTAAGACTACAAAGAGGCAGCTCAAAATTAGACTGCACAGGTAAGCGAGGAACTGC 
AGTCTAAGCCTGGACTCTGCCTTCTGCCCTCCCCCCGCGTACTCAAGCAATAAAAT 
Sequence 2072 

CCCTTTCGAGCGGCCGCCCGGGCAGGTNCTN I I I I I I r I I n NTTTNGCNGAGTGAGCTA 

CTNTAGGATCTTNTGCTGGTTNTACAGTTTTTGGTTTTCTTAGCACTTTGTCTTGTTCAT 

TNNGNTTN GMGA ATCTGNTTCTTCTGATTT^ 

TTTGAAAA CTTTTA CCTTCTTCTGTTGMGTTGTTGGTTTTTTAGAAGA 

GTTTATCATTTTTTCTAGGTCTGAMGTTTTGCGCATTC^ 

TAACTAAAAATCTTAAMCTTGNTTTTTGAGAATATTTACTCTTCTGACAAATTCTTCGA 
TTAMCTTTCCTCAGATTTTAAGTTAGCTAAAACAAAANTTCC 

TNTCGM TGCT NACTCTGTATCTTTCAAGTTTTNCAATTTTTTCGATGTCTTAGCATCAA 
AAACAGATTTTAAMCGTCTTCAMTTACTTTTAAATCTGTTCCTGCAGCTAAAACNGTC 

Sequence 2073 

CCCTTAGCGTGGTCNCGGCCGAGGTACGTGCTTATACAAGATGTCAATTATGTGGTCGTC 

CACATGCTGTATTACGTAMTTTAAAATTTGTAGAATTTGCTTCCGTGAACTAGCTCACA 

AAGGACAAATACCAGGTATTAAGAAAGCGAGTTGATAATATGATAATCACAGATCCAATA 

GCAGATATGATCACAAGAATCAAAAATGCCTTCACACGTAAACACAAAAATGTTATTATT 

CCTCATTCTAAGAAAAAAGAAAGAATCTTACAAATCTTCTTAGATGAAGGATATATAAAA 

GGATTTACTGTATCTGGTGMGTTAAAAAAGAAATTAATGTTGAGCTTAAATACAAAGGA 

AATACAAGTTCAATTGNTGGAATTAAAAGAGTTTCCAAGC 

Sequence 2074 

CCCTTTCGAGCGGNCGCCCGGGCAGGTGGGCAGGTACTTCAGCAAGTCCTCTTTCTCCTC 

AGCAGTAAGCTCAGCCGGCAGGTGCCTGACCAGAAGGGTTCGGTCGCCCCGAGGCGGGGA 

AAGCGAGGAGGAGCTCGTGCATCCCCTTGATATCGCAAGCGGCTGCTCGGGAGCTGCCAT 

TTTCCTTGGAGAAGCAAAAACAGAAATCGTGGGAAGAAGTCTCAGTCAAAATCGCGGCAT 

CAACACAAGCTGGGAGAMTATTTTTTCCGCCTCGCGCTAAGGATTCTGGAAACCAGGAA 

ATACCGAGAAAGAAAGTCACCTTCTCGCGAGAACTGCGCCACCGAAAAGCGGCAACCCTT 

CGAAGACTCTTCGGGGAAGGGCGCGGTGCTAATGATTTAAATTCCAAGGGGTCTNCGGAA 

AGACTTACAAAGCCAAAAATTGGCCCAAAAGATGTGCGANGGTTAACACAAGTTGTCAAT 

CAAAGAAAGGAAACAGGAACCCCAACCCCTTTAAGGA 

Sequence 2075 
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CCGCCCGGGCAGGTACAAATTGAGCTCTCTATTCATAACCTCAATGTATGTATTCCTGCC 

CATTAATATACTTTGCACCAGCAAMGCGATTTCCMCATATGTGTTTTGGAGGTAATTA 

AGTAACTCTGTATAAAAATAMTGCACTTTTCCCTCCTTTCCCCAGTGAATGGAAAACTT 

CCATACTTTCAAAATMTAATAAAAAAATAATTTTTAAGAGCMCAGCCCTCMCTCTTT 

GCTGGTGCCTGCCATACTGCCTTTCTTCACTCCATTCTTAGCTCTGCTAGTTTCTTCTTG 

TATGTCATGATAAMAGGGMTGTGGGTGTGTMCTTTTGTGTATGTCCCGTTTCCAAAT 

TTCCCTCTCCAAAAAGCCAACCAAATAAACAAACAAACAAACGAAAAAAACAGTGCAACA 

AAACACAAATAGCATTCCAACAGTT 

Sequence 2076 

TTTCGAGCGGCCGCCCGGGCAGGCACATAAMCATTATTCCTTCCTTGGCCTAAAAACTC 

ATCGCCACCTACATTAAAGCTAATATGCCTGATTACTGTTTTTAGAGMCTTATTTTATT 

AGGGCAGTTCCAAGCTCAAAAATACGCTAACTGGCACCTTGTTAGCtACATAAAAATGCA 

CCCTAGACCCGAMCTTACTAGACTCATTATAAMTTTTCTTTAAGGTGTCCACGCAGTC 

CCTGGTCACACTTGAAGCAGTCCGGAGAAATATCAGCCCTACCCCAGTAATCCCCAGAAG 

GAACTTACAC I M I III \ AATCTTTTCCTACAACTTCATATTTTATAAATAAAAAGACAA 

AAATGTCAGGCCTGTGAGCTGAAGCTTAGCCATTGTAACCCCTGTGACCTGCACATATCC 

GTCCAGGTGGCCTGCAGGAGCCAAGAAGTCTGGGAGCAGCCCGAAAAACCACAAAGAAGT 

GAAACAAGCCAGTTCCTGCCTTAACTAATTAACCCACCTTACGACATTCCACCATTATGA 

CTTGTCCACCATTATGACTTGTTCCTGCCCTGCCCCAACT . 

Sequence 2077 

CCCTTTCGAGCGGCCGCCCGGGCAGGTTTANGTCANAGTCTTCTNTTCTNTTCTNNNTGA 

GATGGAGTCTTGCTCTGTTGCCAGACTGGAGTGCAGTGGTGCGATCTGGGCTCACTGCAA 

TCTCCACCTCCCGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGTAACTGGGACTAC 

AGGGTGCGCGCCACCAAGCCCAGCTCATTTTNGTATTTATAGTAGAGATGGGGTTTCACG 

ATGTTGGCTAGGGATGGGTCTCGATCTNTNGGTCAGAGTCTNTTNCTGTAAAATATCCTT 

GGGTAAAAGAAGCAATTTTANACTGTAAACTGATGNCAANATGCTTTAAGGGAAGAAGGC 

N 

Sequence 2078 

TCCCTTNCTTTCTCGCACGTTCGGCCGGCTTTTNCCCGTCAAGCTCTAAAATCGGGGGGG 
CTCCCTTTAGGGGTTCCGAATTTAAGTGGCTTTACGGGAACCTTCGAACCCCAAAAAAA 
Sequence 2079 

CCCTTTCGAGCGGCCGCCCGGGCAGGTNCAGGGTCTGTCAGAAACTGTTGGAATCTTACA 

TAAAGTCAAGTCTCAGAAATGTCCGATGCTTCACCATATTCTTATATTCTATGCAATTGT 

TGTCTGTGCACTAATCATCTCGACCTTCTACATGAGATACAGAATTAATACTCTGGAGGA 

GCAGCTGGGGTTACTAACCTCCATTGTGGACACCCATAATACTGAACAGGCAGCACCATC 

TGGCCTGAGGTCACAAGTACCTCGGCCGCGACCACGCTAAGGG 

Sequence 2080 

ACCNTATAACGGCCGCAGTGTGCTGGAATTCGCCCTTTCGAGCGGCCGCCCGGNCAGGTA 

CGCGGGGNTGGTTCCAACTTTTCTGCTNATCTGGGAGGTGNTGGGCGCGGACAGTCNAGA 

TGTCAGAGMAAAGCAGCCGGTANACTTAGGTCTGTTAGAGGAAGACGACGAGTTTGAAG 

AGTTCCCTGCCGAAGACTGGGCTGGCTTAGATGAAGATGAAGGATGCACATGTNCTGGGA 

GGATAATTGGGATGAATGACAATGTAGAGGGATGACTTCTCTAATCAGTTAACTGAGCTG 

AAACTAGAGAAACATGGGTTATAAGATGGGAGACTTCATAGCCATCCAGAAGAAGTGCTG 

AAGTAAACCTAAAACTTGACCCTGCTNAAATACATTGTAGGGGCAAGAAGAACCCAAGGA 

ATGGGGACACT 

Sequence 2081 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGNAAGTGTCGGCGCCGCCACTGTCCG 

GCCACAGCCTAACGCTCTTNGCTGTCGTTTGNGGNCTCGCGCAGGGCGGCCCCGGNTCTG 

GTGTTTGGCNGTCGGAATTAAACAACCACCATGTCCGAGCAAAAAGGCAAAGACCAAGAC 

CACCATAGAAGCGCCCTTAGCGTGCAACATCGAANGNGTTAGNCATGTTNGACCAANNCA 

CAGATTNGAGGAGTTCAAAAGAGGCCTNCAACATGAGTTGATCAAGAANANGAGATGGCT 
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TCATTNGANAANGGAAAAATGCCNCAAGGNNNNGCCTNNGCTNACTCTAGGGGAAATAAA 
CCCCNACCTGANTNNGAAAATCTTGAATGGCCCATGGATGAAATGAGGCCCCCA 
Sequence 2082 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGTGATATNTCACATATTTTTGAGAAAAATTCC 

CAAGCCAGGCGAATGTGGATTGGAATAAAGACATAGGCAGTGTATACCACCATAGCAATA 

ATGGTTAGTAAGATGGTGTTAAACATAGATCGCTCCCAGGGCTCTAAAACAGCACAGCAG 

CTAATGATTTGGTATTGATAGTAGAGCCAGGAGAAATATTCCTTCACACGCCTCAAATCC 

ATGGTTGGCTCCTTCAAGCTGCAGTAAGTTTGTCCTAAGAAAGTCCAGGTCTGG7TCTTC 

AGCCTTGCTCCTTCGCGAAATGATCCTGTGTGGGTTAGTTCTCCTCTCTGGGTTGCTGTT 

TCCTCATCTCCCAGTTGGGTGTATCTCCCTGCGGCTTAGGTGAGCGCCGAGGCTTTGGCT 

CCTCCCCAGCTGCTGCGAGCTTCCGCTGCCCCCGCGTACCTGCCCGGGCGGCCGCTCGAA 

AAGGG 

Sequence 2083 

CGGGCAGGTACCAAAGCCTTNTGNCCCCATTTTCCATCATACGAATAGTATTCCCTGTTG 

CTAAGCCGATGATACATTANCCTTTTCCCAATAGGNGNGAGNGGATGGACTGAATGGAGA 

A 

Sequence 2084 

CGCTCACTGCCCGCTTTCCAAGTNCGGGAAAACCTGTCGGTGCCAGCTTGCATAAATGAA 

ATGGGCCAACCGCGCCGGGGGAAGAGGGCGGTTNGCAGTAATTGGGCNGCTCATTCCAGC 

TTCCTCCGCTTCACTGGACTTNGGANGGCNCTCGGGTCCGTACNGGCTGGCGGGCNGAGC 

CGGGGNATCAAGCTCAANTTCAAAAGGCCGGGNAAATACCGGGATAACCCACCAAGAANT 

CAAGGGGGGATAAACGCAAGGGAAAGGAACATGGTGAAGCCAAAAAGGG 

Sequence 2085 

NCNGANCNGGGAGNGGGGAGAGCCCGCGGAGGNGCCCCAANCNCCCNAAAGCGAGNCGCA 
NNACGCGCGCNCACGGGCCGCGAAAGACAACGCAAGACGGGGAAAACCCCGCCGCNACCC 
AACANAACCGCCGNGCAAGCACAANCCCCANAG 
Sequence 2086 

TTTNNAGTTTTNGGGANCCACTAGTTCTNGAGCGGCGAGGTACTTATCAANAATGGGGTC 

CNTAMGACATAATCTTTGCCAGTAGATGCAGAGTTGCTGGTAGNTTATGACTAATTCCA 

AGACCGTCCGAAATCCCTGGGCTGTGTTGAAATGTGNNTTCATGCTCCCTCGCTCCCAAG 

CATAGACCGNCAGGAGCTCCAGGGCATACTGAGGNGGCAAGCTTCCAAGCTTCTTCTTAC 

AATTTTGGACCTGCCCGG 

Sequence 2087 

CCGGNCAGGTACACTCGGAAATGGAAGGCGANGTTTGAGTCTTTCTTCATCTCGGTATGG 
AAATCCACCTGCAGCTGTGGGTCCATGACGAAAGGGATGATCGGTGTCCCTGTGATTATC 
ACACAAGAACCAGTAGACAAGGACACAGGCCGTGTGTGTGGTACCT 
Sequence 2088 

CNGGACTCACGCGGCGGCGGGAAATGTNATNCGATCACAATTCCACACACATACCNCCGG 

GANTnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnATCAGGGGGATAANGCAGGAAAGAANATGNGGAGCAAAAAGG 

CCAAGNAAAAAGGCC 

Sequence 2089 

TTTNGGGGACCCACNANTNNGGNGAGCGGGGCCACCGCGGNGGAGCCCCAACCCGCCCNA 

CAGNGAGGCGCANNACGCGCGCCCACTGGCCGACGANNNACAACGACGNGACGGGGAAAA 

CCCNGGCGGNACCCAACAAAANCGCCNGGCAGCACAANCCCCNNTCGCCAGCGGGCGGAA 

NAAGCGAAGAGGCCCGCACCGAACAGCCCCTCCCAACAAGAGGCGCAGCCNGAAAGGCGA 

AAGGGACGCGCCCNGNAAGCGGCGCACAAAACGCGnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnAGCGCCCGCNCCCTNCGCGNANCAACCCCTNNC 
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CNTCNCGCCACGGACGCCCGGCTNTCCCCGNCAAGCNCTAAAAACGGGGGCCNCCCACAA 

AGGGGGCCGANANAAGAGCNNNAACGGGNACCCNCGACCCCAAAAAAACNNGGAAANAAG 

GGGGGAAGGGGNCAACGCAAGNGGGGCCAANCGCCCCGGANAAAACNGGAGANNACCGCC 

CCNCNGAACGGANGGAAGACCANCGGCCCNAAAAAAAGGGGNCCCACGGGGCCAAANCAG 

GGAACAACACACAAACCCCAAANCCGGGGCCANNNCCNNTGGAANCAAAAAAGGGANANG 

AGCCGAACACCGNCCCAATGGGGGA 

Sequence 2090 

NATTGCGAANTGGGCCGCTCTTCNCGCTNNCTCGCTCACTGACTCCGCTTGCGCTCGGTC 

CGNNCGGCTGCCGGCGAGCGGGTATCAAGCTCACTCAAAGGCGGGAAANACNGTTATTCC 

ACAAGAANCAAGGGGGAATAAACCGCCAGGGAAAAGAAACAATGGTGAACAAAAAGGCCC 

AGCAAAMGGGCCMGGAAACCCGAAAAAAANGCCCNCGGTTGGCTGGCGTTTNTTCCAA 

TAAGGCTCCGGCCC 

Sequence 2091 

CCCTTANTTTNGNCNTTNNCGANGNACCACACACATAGGTAGCCNGCATTCATGGAACAG 

GCACCGTGGGCTGGGCTGCACCACACCATCTTTCCATGTGTTATCTCTTTCTAGAGACTT 

CTTGAAMTTGGTAGGATTATCATATCATATGTTCTTGGAAACATCTGTTGACTATTTCT 

GTACATCATGGCTCGGACTTGGGTCAAGCTCTTGGCACCAATGTCCTGGCATGAGTGTTG 

GATGCCAGCAATCAGGTMGGGACAMTTTGTGGATTGACCCTTTGTCCTGCACAGCACC 

AGACACTCCCTGGGCCACTTTGATTTTGTCAAGCTTCACTGAAATATCTGTTCTGGCTGC 

TGAGGTGCTTGTCCCATGGCATCGAAGAGAACCCATACCGCGATATTTCTTTAGCNCGGA 

TCCCATCGGAAAAAGAAAGTANTCACCAGGGGGCCCTAAGTGGGTGGCAGCCAGGAAGAG 

AGCCCCATCATGGACTGNNGGAGGCCCCAAGGGGC 

Sequence 2092 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTTAMGAGTTGTTTTTTGGCCGGGCGC 

NTTGGCTCATNCCTGTAATCCCAGCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTC 

TGGAGTTTGAGACCATCCTGGCTAACACAGTGAAATCCCGTCTCTACTAAAAATACAAAA 

AATTAGCCAGGCGTGGTGGCTGGCACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGG 

AGAATGGCGTGAACCTGGAAGGAAGAGGTTGCAGTGAGCCAAGATTGCNCCCCTGCACTC 

CAGCCTGGGCAACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAAAAAAGTACCTGCCC 

Gnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnG 

TTMTTGCGCCCCTTGGCGTAATCATGGTCATAAGCTGTTTCCTGTGTGAAAATTGTTAT 

TCCGCTCACAATTCCACACCAACATACGAGCCCGGGAGCATTAAAGTGTAAAAGCCTGGG 

GTGCCTAAATGAGGGGAGCTAACTCAACATTTAATTGCGGTGCGCCTCACTTGCCCGCTT 

TTNCAATTCNGGGAAACCTTGCGTGNCCAGCTTGCANTTAATGAAATCGGCCCAC 

Sequence 2093 

CCCCGCGTCCGCCTCGNAAATTGTTGATGCTCTTTCCCCTCCCCGAGGTCTCGCATNCAA 

ANCCTGGTGGGCTGGCCTTGTGTGGCTGCTTCTCCAGGCCTGGTCAGNACCCAGCAGGCT 

CAGGGTCTGCTCCTGATGCTGNGCTCTGGGACAGGCACGCCACTGTGNGAAACACTAAGC 

NAGGTAATCGAGCATTTNGTGATCACAGACTCCAGCTTCCTGGTCCACCCAGCATGTAGT 

CAGCACTCTGACCTTNACACCAGAGCTCCACAGCGGCTAGGAGTTGACTTCCTGTGTCAT 

GACCTCAGGAAATAAATTTCCTTGACTTTAAAAAAAAAAA 

Sequence 2094 

TTCTGCTGAGACGCGTGTGGCTNCCTCCCCGCAACANCCAAAATGNTGAAGCTGATCGAG 

AGCAAGGAAGCTTTTCAGGAGGCCCTGGCCGCCGNGGGAGACAAGCTTGTCNTGGTGGAC 

TTCTCTGCTACGTGGTGTGGACCTTGCAAAATGATCANGCCCTTCTTCCATTCCCTCTGT 

GACAAGTNTTCCAATGTGGNGTTCCTTGAAGTGGATGTNGATGACTGCCAGGATGTTNCT 

GCANACTGTGAATTCNAATGCNTGCCAGACCTTCCAGNTCTATAAAANGGGNCAAAAGGN 

GGGGGNNNTCTACNGNGCTAACAAGGAAAAGCTTGAAGCCTNTATTACTGAATATGCCTA 

ATCATGCTCTGAAAAGTGGGACCAGCTNCCAAGCTGNTTNAAACCTCGTACCNTTNTTAA 

TTTGCTAAAAACTATGAAAGTGTGGAGAGGCTATCCCAACTGNCATCTGATTATTAGTGA 
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CAATAAAAAAANTAATTCTACCCCTTNANAAAAAAAAAAAAAAAAAA 
Sequence 2095 

TGTGTAGCACCTGNGGNGTCCTTGNGTGATTATTCTTGTNCGAGGTACTTAGGGCAAGTC 
ACATGCCCTCCATCCCNTGGCTCANAGATGAAGAGTAAATCCAAAACATGTGCCTCGCTC 
TTGGTCACTAACTGCTGNCCTG 
Sequence 2096 

TCGAGCGGCCGCCCGGGCAGGTACTTTNTTAATGCCTTNGTTGGAGTCCTNATCCTCATC 
TTTAAAAAAAAAAAAAACAGNTTANCCTAAGCCANATTCACTTTTTTTAGTTNACAAA^ 
GGATTAANTNGCCACANTGTGATTT 
Sequence 2097 

ATTNNCCCTTAATCATCTCACGCCCCATGTATGATTCTCAAAGNGCCTAGCGTGANCANC 

NGTCCCTNAGACCACACCAATTTCTTNATGTCNCNCTCAAGAAAGCCAAAATGACAATNA 

TAANGCCATCTCAANCNCAATANCCTACCANAACCACCCCNCGGNCTTATCTANACTTCA 

ACTCMACTCTCTGCCTCCTTACTNTCTGGGGAGCTTNAAAACCANNTNACTNATAACTT 

TAAAAACCTNTCTNTAAAATNTCANAAACCACANCCTCACCATTNNCACAACCACCCCCA 

ACACCAAANANNTTCCCAAAACAACC 

Sequence 2098 

CCCTTAGCGTGGTCGCGGCCGAGGTACACCAAGACCAATTGCTAAAATCTTGGATTATGG 

MAATTTAAGTATGAMGAAAGAAAAAACAAAAAGTTGAAAAAGAAAAACAATCTTTCAC 

MACMTAGAGAAATTCGTTTATCTTTTGGMTCMTTTAAGNGATATAAAAATCAAAGC 

AAAAAAAGCCAAAGMTTTTTATTAGATAACGACAGAGTAAAAGTGGCTCTTCGTCTTAG 

AGGGCGTGAAAATACAAGACCTGMCAAGGTAMTTMTTTTAAATTCTTTTTTTGATGA 

AGTAAMTCGATTGCAAMTTAAGTAAAGAAATGCMTCAGTTGGTMTTTTTTA 

Sequence 2099 

NGNCCTTNCGAGCGGCCGCCCGGGCAGGTACAAATTGAGCTCTCTATTCATAACCTCAAT 

GTATGTATTCCTGCCCATTMTATACTTTGCACCAGCAAAAGCGATTTCCAACATATGTG 

TTTTGGAGGTMTTMGTMCTCTGTATAMAATAAATGCACTTTTCCCTCCTTTCCCCA 

GTGAATGGAAAACnCCATACTTTCAAAATAATAATAMAAMTAATTTTTAAGAGCAAC 

AGCCCTCAACTCTTTGCTGGTGCCTGCCATACTGCCTTTCTTCACTCCATTCTTAGCTCT 

GCTAGTTTCTTCTTGTATGTCATGATAAAAAGGGAATGTGGGGTGTGTAA 

Sequence 2100 

NCCTTAGCGTGGTCGCGGCCGAGGTACACTGGAGGCTGGAGCCTGCAGATGGCATGGCTC 

TGCGGCTCACCTTGCTGCAGTTGGTGGTGGTGACAGAGACTGCAGCTTGACTGTAGTGAA 

TTTGGAAATTATCTGTCTGGAAGCTCTGAGTTTATCTTGGGACCTCAAGAGGAGAGGATC 

ACCCAACTCACAGCAATCAAACTCCAAATGGTGCTATAAACTGAACCACACATGGACACG 

TCAGTCTTCCGAGGACCCTTAGATCAACCCCAGGAGGAGCCCTAGCTGCTGTTCCCCATT 

CGACGCCCCTTTCCAGCAGG 

Sequence 2101 

NAGGGGCGGGAATTTTGGGNGGGCCCCCCTTTCTTANAATGCATTGCTTCCGNANGGCCC 

GGGGNCCCCCCCCCCAGTGGTGGATGGGATATTCTTNCCAAAAATTGGGGGGGGCCCTTT 

TTNNGGNCCNNAAAAAACCCCNGGGGCCGNCCCGGGGCCAAGGTTACCTTGGACTTGGAA 

MATTTGGGGCNTTTTNTTTTGGGGGNNCCTTTTNCCCNNNNNNNNNNNNNNNNNNNNNN 

NNNANGGGCCCCCGGGGTTTNTTNCCNTTTTCCCAAAANAGGCCCCNTTTNNNGNGNGGG 

GNGGGTTGGGGGNAAAAAAANNNNNNNN I MINIM f'NNNNNNNNNCCCCCCCCCCCCN 

TTAAAAAANAAAAAAAAA 

Sequence 2102 

CCCTTTCGAGCGGCCGCCCGGGCAGGCACTTA I M M I M M I M I M M M M i CTTTT 

ll l M I I IIIIIIIIIIIIIN II IIII III IIIllUlllll l llllllllllllll.i l 

TTTrNNTNMNAAAAAAAAAANTTTTTNTTNAANTNGGGNCNNAMCTNTTTAACNAANN 

AAAAAAAANNNTNNTAAAANGTTNTCNAAGNNGGGNNNNNCCNNANAGGNANAAAANGAA 

MNGNNTNATTTTTTNTTNAAAAAAAAANNTTTNNTAAAANTGTTGNNGGNGGGGGTAGG 
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TTAAAAAAAAAA 
Sequence 2103 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTCTGTCTCTGTAGTCTCTCCATTCTAAAGTT 

TAATTTGGAAAGGTCTGTCCTGAATTATCAAAGTAAGAATACTCAGATTTCCATAAGCTC 

TTCTCACCTCCTCTTAGCCCAACTCAMCTCCATCAGACCTTCTCACCTTGATTTTGAGC 

TGGMTGTTTCAATGAGCAAAATAAATTAGACAAANGTTAAAAAAAAAAAAAAAAAAAAA 

NGTACCTCGGCCGCGACCACGCTAAGGG 

Sequence 2104 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTGACCCCGGTCCTCAAGGAATCAAAGTTTAAG 

GAAACAGGTGTAATTACCCCAGAAGAGTTTGTGGCAGCTGGAGATCACCTAGTCCACCAC 

TGTCCAACATGGNANTGGGCTACAGGGGAAGAATTGAAAGTGAAGGCATACCTACCAACA 

GGCAMCAATTTTTGGTAACCAAAAATGTGCCGTGCTATAAGCGGTGCAAACAGATGGAA 

TATTCAGATGAATTGGAAGCTATCATTGAAGAAGATGATGGTGATGGCGGATGGGTAGAT 

ACATATCACAACACAGGTATTACAGGAATAACGGAAGCCCGTTAAAGAGATCACACTGGA 

AA 

Sequence 2105 

CCCTTAGCGTGGNCGCGGCCGAGGCAC I I I I I I I I I I I I I I I I I I i 1 1 I INGCI MINI 

I I II I 1 1 I I I I I I I I I UNI I I II I NTTNTNCNTTTCN A I MM l ATNCI I I I I 1 1 I 1 1 I 

TTTTNAATGGCCAGGCTCCCAACATTTNAAAAAAACTGCNCCCCCCAATGGGTGAACAAA 

GTAAAGAGTAGTAACCTAAAGTTCACCTGAGTAAGCCACTGNGGAGCCTTAAGNGGNGAG 

GTCTTCCAATTTNANAGNGATGNGNCTTCMCTTGTATNATNA7TTTANGCGGAAAAACA 

TAA 

Sequence 2106 

TCGGCGTCGCGACCCCCGAGGACCTCCTCTNCTCGCTCTGTGGCATACACTAGTCCTGGG 

CACTCAACCGCGGAGAGCCCCCGACCCCGGGGTAGCGGCTGAGCCTCAGCCGGGACCGGN 

ACCGGANCCCGCGCGGAGCATGTNATCCGGGCTGGGGGCAGCTGGNACAGTGGGCTGGGT 

TGGCCCTCCT 

Sequence 2107 

MTTTGTGTGTTTGTGTTTGTTTGGGGTTTTTG I I I I I ATTTCTTACATTANAGTNCATA 

TTTTCTGGGATTTAAATTATAGGTGTATTTTCCTATTCTCTTGAGAAGNAGACTAAACAG 

TCTTTGCAATGATGACGGATGCACACAGAANAAAACATTAGAAGACATTACTTTCTATCC 

TCTCATGTGGTTGANCATTCTTACACGCCAAATGACTAMTTGGTGTTTCNTAGGAAGGA 

GCAGCTGTCACTTACAATGTGAAAATATTAATGTTTTAGGCCAGNGGGCCAAACCTTCAA 

GGGGGCCTNGTTGGNCMTTTATNGTCCCCTNATNCTTTTNAAMATTTGAANGGTTCAN 

NNAAANTTGTNNAACCACCAACAAAACNCCCCCTTTNANCATNGNGGGGGCAANGGGGTT 

GGGGACCTTNGNGACCTTAANTGNATTGGCCAATTACCGCCTTGGGGCCTTGNTNAANGC 

CTTCAACNCCNAMANAATTGGCCCACCMTTGGGGGGTCTTTTNGTNANACCTTNTTTG 

NACCAACCAACAACCTTAANCTTNGANCTTCACAAANGGCTTGGGGCNAGGGTANCAACA 

ACNTTCAAGGTTTATNCCGGCTTCNCAMNGGGCNCTTNMCCTTIvrrTTGGNGMTTTGA 

AGGCTTAAAAACCTTGGCCATGGAAATCCCANCTTTAAGGGCCCAATCCCCTT 

Sequence 2108 

ACCACGCGTCCGAGCTCGCTCAGCACTCCCAGGTCCTTAGCACTCCCAGGTCGTAGCTGG 

CG CAGTCAG TAGGAACTGTAACTATGTCTCTGATGCACCACGTGTTTAGACACAGCACAG 

TCCI MM I CTGTTCCTACGGTGGAAGTAGTTTCTCTTTGGGCATGCTGACAGCACTTTT 

TCATAGCCTCACCGATGAGCCCTTTCTGCGGGAGTGACTCCATGCCTGTATACAGAGTAT 

TTATACAGATGTTTTAGCATCTTCATATGCGGTGTTAACCCCTAGTTCTGTACAGCATAT 

TCTGTTCMGTATTTTTTTACAAGCTTGTGCTGTAGGCACATGCCTTCTGCTGCAGAAGT 

GGACACCCGTGGCACACCCACCCCGCCCCAGTGGGGTGCCATGCCTTCCTGGGACATTGC 

CACTTCTGCCCTGGAACTCATGCAGGTACGTAGTAGCTGCTATTGCCAGA 

Sequence 2109 

NCGCCTATCACATAGTCAAACCCAGTCCCTGGCCACTGACAGGAGCTCTGTCAGCTTTCT 
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TAATAACGTCCGGCTTAATCATATGATTTCACTTCTACTCTATCACCCTACTAACACTAG 
GCCTACTAACCAACACACTAACCTTATATCAATGATGACAGTGACATCGTACTGAGAAAG 
CACATACCAAGGCCACCACCACAATACTCCCGTCCAAAAAAAGCNCTTCCGGTACCGGAA 
TTMCAACAT ATTTTA TCANTTCTC^ 

TCTTGAGCANTTTTTANCCACATCTAAAGGCCCTCAGCNCCACAACAACACCTGGCTTTA 

AGGNGGGGCNCAATTTGNACCCANCCCANACAAGGGCGATTCANCCCCCCCCTGNAACCC 

CCCCATTAGTAAAGGTACNCCCCTNNCTTAAMACMCCCTTCTTGGNNATTTAACTTTG 

NCAATCCTTGGGGGGGTNAAACAAANTCAACCCTGGAAGGCTTCACCNACAAGNCCCTNC 

ATTAAAATTAAATNGACCCCGAAAAAANCAAAACCAAANTCCCAAAGGCNCCCTGCCTCA 

TTCACCAATACCTAACTTAGGGTTACCCTTAT 

Sequence 2110 

CGTCCGGGACCTTTATGTCTTGTNAAGATGTCTAGGCCTGGCCGGGCGCGGTGGCTCACA 

CCTGTAATCCCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACGAGGGCAGGAGTTTGAG 

ACCAGCCTGATCAACATGGNGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCCGGGC 

ATCGTGGCACATGCCTGTAATCCCAGCTACTCGGGAGGCCGAGGCAAGAGAATNGNTTGA 

ACCCAGGAGGTGGAAGTTGCAGNGAGCCAAAATCACGCCACTGCACTCCAGCCTGGGCAG 

CANAGTGAGACTCCGNCTNAAAAACAAAACAAAACAAAAAAGCAAAACCAGATGTCTAGG 

CCMTGATMTTATTTTTGATGCATTGTGGATTANGNTCTTTTGTTMCCCCACTGTCTT 

GGGG MTGATGCCTGCTGGGAAATTGAGTTTTTTGACTGAAACATGGAACCTTNCTGCTT 

TTTTTCTGGNTCCTATGAAGGTTTTGGAACATNTGAAAACACAAAAACTCACCTTGAAAT 

TTGAGCAGGTCGATGATGGCAAAAAATTATTT 

Sequence 2111 

GCGTCCGCTGATCTGCTTTGGGACGGCCTTTATATACTTCCTCCTTTCCAGGCCTTCCAC 

CACCAGTGACCACTATTCGACATCTGGCCCACTCTCAGTCATCCTCCTGCTTATGCTTGT 

CTNCTCCTTGAAGGCTTCCCACTGCATGTAGGACAAAGGTCAGATTTTGTAACAGGCCAG 

GCCTGGNCTTNATAGTCTGGNATCCACTAATTTATGGTCTNAGTCTNATCCCTTGGAGGA 

TTACCTCTGNCCTTNGNAAGCTCTGTGCTCCNG 

Sequence 2112 

TTCATAACMTTCTCCTACAAATCACCTTAATTCTGACATTCATGGCCACAGAACTAATT 

ATATNCTACATTCTATTTGAAACCACCCTTATCCCCACCCTAATTATCAGGTACTAAATG 

AGGCAGCCAAGCAAAACGCCTTAACGCMGNACTTACTTTCTATTCTACACACTAACCGG 

CTCTCTACCCCTACTCATCATACTAAGCGCACACCTACAAACAACACAGGCTCACNTAAA 

CATCACAGCTANCTAACACTTCACGNGACCAAAAAAACNTAACAACCCACCTGGGNNCCC 

CACAGCANTTACCCTGGGCCTAGCNANTGGCATATATAAGCGCTTTTATGGACCAAAAAA 

TACCCCCTTNTTATNGGGGTCTACACNCTTATTGACTTCCCNCANAAGNCCCATGNTTTG 

AAAGCCCCNCATTTGCAAGGGGNTCAAATANGTCCTTTGCACGNCAGNTACCTTCCTTAA 

AAACTAGGGCNGGCTATGGGGCAATAAMTACCGGGNTTTAACTCCCCATTTCTTCAAAN 

ACCCCTTAMCGGGAAATTANCATAAGCCNTACCCCATTTCCCTCATTATTAATCCCTTA 

TGGGGGCNATAANTCATAACAAAGGCCNTAAACANTGCCTCCCGACAA 

Sequence 2113 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCTACTTTTTAAACMTTTCAACTGCA 

GCTCTCTTTCACTMGTNAGATGGGNAMGCATGCCATTTCTGT^ 

CTTTTTAGMACACACATGCTTCCACTGCCATCTGACACTTCTCCACACGCTTTCATNTT 

GTAMCCTGAATTCTATTTNGAGTACCTATCAAATACTTTCTGGAGGNGGGGCACGCTCC 

GCTCGGTCATGATGCTGATCCACTTGGGAACATCAGTTCCTTTCCTCTTCACTCCAGCGT 

CATAGAGATCCCGAGCATCTTGGNNAATCAGTTCATAATCAATGACAGAGCCATCCTCTG 

CTCTTCTACCCTTTGCCA 

Sequence 2114 

GTGGGGGGGGGGGGGGNAANCTCGTTGGGACNCGCCCCNAATTNNNANNNAACCNNNGNN 
CNNNCTNGAGGGCGAAGGNATNNATAAGCTTGAGGGGGGGGGGCTGGANACCGANGNATC 
CACTAGTTCTAGAGNGGGCCGCCNACCGGGGTGGGNGCTNCAGCTTTTTGTTCCCTTTAG 
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TGGAGGGGTTMTTGCGGCGNTTGGGCGNNAATNATGGGNCATTAGGTGGTTTCCTGGNG 
TGGAAATNGTNATCCGGGGGGGGAGGTNCANACAAAATAGCNGAGCCCGGNNAGNATAAN 
AGTGTAAAAGCC 
Sequence 21 15 

GCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGTTGTGTCCCTTTCTCTTCAAAGAT 
CCTGAGCAAAACANNGATACGCTTTCCATTACCTNATGGGGGGGNCNNGGACCCAGCTCT 
TGGCATTGCTANGGNNGGCTANNATCNNGCCACNTGAGGNTGTGGANNN 
Sequence 2116 

GTMCTCACATTAATTTGNGTATGCGCTCACTGCCCCGCTTTCCAGTCGGGAAAACNCTG 

GTCGTGCCAGCCTGCATTAATGAAATCGGCNCAACCGCCGCCGGNNGAGAGGGCCGGNTT 

TGCGTATATGGGCAGCTCTTCCGCTTTCCTCGGNTCACTTGACTCGCTTGCGCTCGGGTN 

CGTTTCGGGCTTGCGGGCGAAGGCGNGTATTCAGGCTCACTCAAAGGGCGGGTNAATACG 

GGTTATTCCAACAAGAATTCAGGGGGGATAANCGCANGNAAAGAACCATTGTTGAAGCCA 

AANAGGGCCCAAGCAAAAA 

Sequence 2117 

ACCTGTCGTGCCAGCTGCCATTAAATGAATCGGGCCAAACNCGCCGGGGNAGAGGGCGGT 

TTGCGGTATTGGGGNCGCCTCTTCCCGCTTTNCTCGCTCACTGACTCGCTGCGCCTCGGT 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTTCACTTCAAAGGGCGGGTAATTACCGGTT 

TATCCACCAGNAATCAGGGGGNATAACCGCAAGGGAAAGAACATTGTGAAGCCAAAAGGG 

CCCANCAAAAAGGCCANNGAACCCGTAAAAAAGGGCCGCNGTTTGCTTGGCCG 

Sequence 2118 

CCGCGGTGGCGGCCGCCACGCTGGTTTTGCATCTTCAGGAGACGCTCGTAGCCCTCGCGC 

TTNTCCTCGGCCAGTTCGCGGAAGAAGTGGCTCACGCCTTCCAGAGCCACATCATCGCGG 

NCGAAATAGAAGCCCANAGAGAGGTAGGTGTAGGAGGCCTGCAGGTACAACTTGTTGGCC 

TACATAAAACACCTAGATGGTAACAACGAGGCAGCCCTGGAATGCTTACGGCAAGCTGAA 

GAGTTAATCCAGCAAGAACATGCTGACCAAGCAGAAATCAGAAGTCTAGTCACTTGGGGA 

AACTACGCCTGGGTCTACTATCACTTGGGCAGACTCTCAGATGCTCAGATTTATGTAGAT 

MGGTGAMCAMCCTGCAAGAAATTTTCAAATCCATACAGTATTGAGTATTCTGAACTT 
G 

Sequence 21 19 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACCTAACCTACCTTTAAGACTGGGATAACTATT 

GGAAACAATAGCTAATACCGGATATAGTTATTTATCGCATGATGAGTAATAGAAAGGAGC 

TTCACAGCTTCACTTAAAAATGGGGGTGCGGAACATTAGTTAGTTGGTAGGGGTAATGGC 

CTACCAAGACGATGATGTTTAGCCGGGCCGAGAGGCTGTACCTGCCCG 

Sequence 2120 

TGCCCGGGCAGGTAC 1 1 I I 1 1 I I I I I! I I I I I I I I I 1 1 I GGG ATGGAGTCTTGCTCTGTT 

GCCCAGGCTGGAGTGCAGGGGCGCAATCTTGGCTCACTGCAACCTNTTCCTTCCAGGTTC 

ACGCCATTC TCCTGC CT CANCC TNCCAAGTAGCTGGGACTACAGGTGCCAGCCACCACGC 

CTGGGCTMTTT7TTGTATTTTTAGTAGAGACGGGATTTCACTTGTGTTAGCCAGGGATG 

GTCTCAAACTCCAGACTCTCGTGATCCGCCCACCTTTGGCCTCCCAAAGTGCTGGGGAAT 

TACANGGCATGAGCNCACTTGCGCCCNGGCCAAAAAAACAAACTTTTTTTAAAAAAATGG 

TACCTNGGCTCGCTCTAAGAACTAGGGGGGATCCCCCGGNGCNTGCAGGGAATTCCGATA 

TCAAAGCNTTATCGATTACCCGGTCTGAACCCTTNGAGGGGGGGGGGGCCCGGGTACCCC 

AGCI I I I I IGGT 

Sequence 2121 

GGTACCTTGTCTGGAGAATGCAGTGACAGCACCGGCCCATGCTTGAGAACCCANGCGGCT 

GTGCAGAGGGCAGCCGACCACTATAGCCAGCAGATGGCCCAGCAACTGAGGCTCCCCACA 

GACACGCTCCAGGAGCTGCTGGACGTGCATGCAGCCTGTGAGAGGGAAGCCATTGCAGTC 

TTCATGGAGCACTCCTTCAAGGATGAAAACCATGAATTCCAGAAGAAGCTTGTGGACACC 

ATAGAGAAAAAGAAGGGAGACTTTGTGCTGCANAATGAAGAGGCATCTGCCAAATATTGC 

CAGGCTGAGCTTAAGCGGCTTTCANAGCACCTGACAGAAAGCAT 
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Sequence 2122 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCNGGCCAGGTACCCTTGGAAGATGGGAA 
AGGTGAGGGAAATATNNGAAGCAGGGTCAGAACATCCACTAAGAACATAGCACCTNAGTA 
NAGCTTACATTATATGAGCCAGGGTAGAGTTANTACTGAAT 
Sequence 2123 

TACTA TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTAC I I I I I I I I I 1 1 1 1 1 I 

TTTTTGCAACTCGAGACGTTAGTTTACTCCAGAGTCATGGAGACAGTGAATGGAGTGTGT 

GTGTGTGTGTGCGTGTGCGTGAGTGAGAGAGAGAGAGAGGAAGAAGGGAGGGAGGGATGG 

TTACAGGAGTGTGGGAAAAGTCAACAGGAAAAGGAGGGAAGGTTCGCTGTNAACTGACAG 

AAACTGTGACATCACAGGCATGTGCGGAGTCCCCGCCCCCCCAGGCACTGTGCCAAGGCC 

AGTACCTGCCCGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCGGNCTGCAGGGAATTCT 

GATATTTAAAGCTTATNNATACCGACAACCTCNGAGnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnCGCTTNGCCGTANTCATNGCCAATAGCNTGTTT 

NCTGTGTGAAAANTNGTTTATTCCCGCTCACAATTTNCACACAACATACNAACCCGGGNA 

GCATAAAAGTGTAAAAG 

Sequence 2124 

GCGMTTGGATCTCNCCGCGGTGGCGGCCGAGGTACCTTTTTAAATCTAGCCCAGTATAA 

ACATTA GCCT GCTTAATATTTAGACATTTATAGGTAGAATTCTGAGCACTCAACTCATGT 

TTGGCA7TTTAAAGTAAAAACAAGTGTGACTTCGAGGACCAAAGAAATTGTCAGCTATAC 

ATTTATCTTTATGAACTCATTTATATTCCTTTTTAATGACTCGTTGTTCTAACATTTC 

AGMGTGTTCTTATAAAGGTCTAATGTATCCACAGGCTGTTGTCTTATTAGTAAATGCAA 

AGTMTGACTTTGTCTGTTTTACTCTAGTCTTTAGTACTGGTTGTCAGGATTCAGCCGAA 

TGGCTTGCCTCAGAGGGTCAATGGCGTTCTGAGATGGTGGCCAGTTGTCCAGACGGTAGC 

TTGCTATTGCCAGTCCAGAGGTGAATTCTGGGTTCTTTGGCT 

Sequence 2125 

GCGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGTGAAAGTTCTTGATGAGGG 

TCTCAATGGCCCTCTCCACATCACTGAATTCCTGAGCATCCTCTGCGTTGGCTGACCGAC 

ACTGTGCCATGGTGCCCACTGTGTCTGGTCCTTTGGTGAGAGTTCTGTTGTCCTATAGCT 

GGCCCCAGAGGAGCTGATGGCTCATGATCTGTTGGCAGCCGCTGAGACAAGACAGGAGGC 

CCCGCGTACCTGCCCG 

Sequence 2126 

CCGCGGTGGCGGCCGAGGTACCCGGTGCGCATAAGAGGAAGATTTCTGAAGAGTGCAGCT 

GCCTGAACCNANCCCTGCCGAACAGNTGANAATTGCACTGCANCCATGANTGAGAACAAT 

TAGAATNCCTTGGNGGGCAGCCTACGGNANCTNAAATGCCATTTAACCTGGAACTTGATG 

Sequence 2127 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACGANGTCATATGTTATTTACAATT 

GGGTTTGTGTGGGATGGGAAGTAGGGCGGATGAGCCAGTACTTNTGCAATGAAGATGCAA 

TAGTCNTTGTCCTNTCCCACTGTCNCCTNTTTCCTCACCCNATGGCAGCTTACATGACCT 

ATTCCCAAAGGGTCCACCGAGNCCTGAACTCAGCTTCATCACCAACATTCCTCGCCTTNA 

GNAAGAATTCAACACTGTATAAGGGAGTNGAGGCANAGACTTGGGNCAGGGNGAGGGTGG 

NNAACACNNAAGCACACTNTCTTGTCATCAACCCAAGTTCAGAGACAAGGCCTCCNCANA 

TGNGGAAGATGATGCCCTTAGACACCNCAACCTGNTTGTCNNCNTTCNTNGAAGCNCAAA 

GCAGCCTGNATCTCAACTGAAGAGAAGGGG 

Sequence 2128 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGGGGMTGGAA 

TGGAATGGAATGCAATGGAATGGAATCTTCCGGAATGGAATGGAATGGAATGGAACGGAA 

TGGAACGGAATGGAACGGAATGGAATGGAATGGAATGGAATGCAATGGAATGGAATCTTC 

CGGAATGGAATGGAATGGAATGGAATCAACCCGAGTGCAATGGAATGGAGTGGAATGGAA 

TGGAATGGAATGGAACAACCCGAATGGAATCGAATGTAATGGAGTTGAATAGAATCAATC 

CGAATGTAATGGAATGGAATGGAACCGGAATGGAATTGAATGGAATGGAATGGAATGCAA 
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TGGAATGGAATCAACCCGAGTGCAATGGAATGGAGAGGAATGGAATGGAATGGAAGGGAG 
ACTACCCGAATGGAATGGAATGTAATNGAGTGTAAGGGAATTGAATAGAATCAATCCCAA 
TGTAATGGAATGGAATGGAATGGAATGCAATGGAATGGAATCTTTCCGAATGGAATTG 
Sequence 2129 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACNCGGGGNACTGAAANTCCA 

CACGACANAATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGCAGAACGGCTG 

CCTAATTTACAGCACCCATGAGGAAAGGCCACTTANGGATGCAGCAAGAAGGAGCCATCT 

GCMTCCAGGMGAMTTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTAGGACT 

TCTAGCCTCGAGAACTTACAAATGGTGATGATCATCAGGTCAAGGATAGTC 

Sequence 2130 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAACTGAAAATCCAC 

AAGACAGAATAGCCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGCAGAACGGCTGC 

CTAATTTACAGCANCCATGAGGAAAGGCCACTTAAGGATGCAGCAAGAAGGAGCCATCTG 

CAATCCAGGAAGMATTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTAGGACTT 

CTAGCCTCGAGAACTTACAAATGGTGATGATCATCAGGTCAAGGATAGTCTGGAGCAATT 

GAGATGTCACTTTACATGGGGAGTTATCCATTGATGACGATGAAATGCC 

Sequence 2131 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATCATTAATGTAT 

TATATACACTGATACTTTAAAACTTGTGTGGAAAAAACTAACTTATAATTTTGTATCACA 

CACCCTGGATATGTGTTCTGTTTCTAAGCGACATTTGTGAGAGATTATTGTAAAATGAGA 

GCGAGCAAATAAMCTT AATTT MTCTTTGCAGATACATACTTATGGGAMTTTGAACAA 

ATGAGTG AAACTCT GTGTTTTTAGTAGGCTGTGATAAACATTTCCGGGCACTTTGCAAAA 

GGACTTTC 1 1 J 1 1 1 GCCGGGNGCTTTMTNANTTAATAAAAATTTTTTTAAMGTTAAAA 

A AAAT NGTGGNAAANAAAAC I I I I I I I I I M I NTTTTTAAAAANNAGGNTTNNANNAAAC 

NTTTTNTNTTTNG CCGN ANNAAANCCCCCCCCCCCGGTTTNCNGGGGAAANAAAAAAAAT 

NNGGGCCCNC CNTTTTN TTrTTTTCNGGGGGGGGGGGGGGGGGGGGN l ITTTTTTTTT GN 

GMM GNGNTGTTTTT TNNCCCCCCCCCCCCCTCTTTTTAANANMAAMATTAATTGGG 

GGNNNI I I I I I I I I I INNATTNNNNACCCCCNCNAATTNGGT^^TTTTTAN^JTNAANANC 

CCCCCNGGCCNTTNNTNATANATGCCCCCCNCCCCCCC 

Sequence 2132 

GAGGTGGTTACATTCGTCGAAGGACACCAGCTGCGGAATTTGCGGNTTTGGCAGATTGAA 

ATCATGGCNGGTCCAGAAAGTGATGCGCAATACCAGTTCACTGGTATTAAAAAATATTTC 

AACTCTTATACTCTCACAGGTAGAATGAACTGTGTACAGAATCCATTCTCATTCTTTACT 

TGCTACATTATGACCATGAGGAGGGCANAGTAGAGGTGAACTCTCTGTATACTTGCTGAA 

AGTCTTCTTGTACCT 

Sequence 2133 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCAGTAAGAGATGGG 

GTTTCACTATGTTGGTCACTATGGTCTTGATCTCTTGATCTCGTGATCTACCCACCTTGG 

GTTCCCAAAATGTTGGGATTACAGGTGTGAGCCACTGCACCAGGCAAACTGCGATCTTTT 

AGTGGTGCCTCTTCTCTCTTTTGACTTAAGGATGTTGTCCCTTAAGGAAACCTGGAGGCT 

ACTACTGTGATACACTACTTGAGAGATGGATTGTTGCTCTTTCTTCTACAGTCTTACANG 

GAGT AGATT ATAMGACGGAAGATGTTACCATTTGCNTTAATTGTTGGAAGCTGANAGCT 

TT TAAl I I I iGGTTNCMCTGTTTTTGNGGGANNTCCCGNMAAATTTTNNNNNAATTTT 

TNm i i I i i NGGGAAAAGGGGGCNCNTTTTTAAAAACAGGGGGGGGNNGAAAAAATTTGN 

GGGNGGGGGGCCCCNNCCAAAAAMAAAAAAAMGGGGGTTTTTTNTTNCCNGNNNCCGG 

GGGGGGGGNNGGGNCNNTTTTTAAANCC I II I I I I I I GNTTNCCNCCNAAAAAANNAACC 

CCTNNNGGGGNGTTTTTTTTTTT 



Sequence 2134 

CCGGGCAGGTACAAGAGATAGAAAGACCAGTCCTTGCTGAAAGACAAGTCTGAATGCTCC 
AC l I ! I iCAATTCTCTCTCCATTCTTCAGTAAGTCAACTTCAATGTCGGATGGATGAAAC 
CCAGACACATAGCAATTCAGGAAATTTGACTTTCCATTCTNTGCTGGATGACGTGAGTAA 
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ACCTGAATCTTTGGAGTACAGGACAATNAAGACTACTCCTATNTGCGGAACAACTAGCTT 

TCTATTTAGTTCTAGAATGTTGAAACTGACCGATTGGCTGACATAAAAGTCACATTTTAC 

AAAAAAGTGTCTCCAMTGCTTTGACTAGGGGAAAMCCCCTTTTCAATTAGAGGGAGCC 

ATTNTGCMCA MTTTCCCACAAA TAATTCGCTTATTCCMGGGGCAANGGCACCATTTG 

ATATNGGGAMTTTTTTTGTTTTTNGNGCCAAAATTTAAGGGNA 

Sequence 2135 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCCAAGACTCAGCACTAGTCTGATG 

ACCTGCTAATTCACTGACAGCATAGGGCTGTCTGTTGTTTTTGCGCAAGTTGGTGTGAAC 

AAAGTTCACAATATCTGGTCGAATAGGAGCCTTGMTACAGCAGGCAAAGTGACATTTTT 

GCCAGATGACT CCCCC TTTTCGGAGTACCTTGTTCAAAAAACACCGCTGAGTCACTTCCA 

GGTGCTGTTMGTTTTCTTTAGTGAAGATGTCTATACCAGAGGGAGCATAGTTCCAGATG 

ATTTCCTCAGCGGCAATGTAGTAGTGTCTMCATGCTTCCCACGGATATTATCCTTTTGA 

TGAAGACTTTGTTACACTCCTGGACCTGGAAAAAGGCTTGCAAACCGGC 

Sequence 21 36 

TNGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTGAATGTAGAAA 

CAATGAGGATGGACCTGGTTTAATAATGGAAGAACAGCACAAGTGTTCTTCGAAGAGCCT 

TGAACATAAAACACAGACACCTCCTGTGGAGGAGAATGTAACTCAAAAAATTAGTGCCTG 

GNAAATTTGGGCTGATGAGCTTTCTGGATCCTMGCCACCTACCGAATThrTTGGANGGTT 

GGCTGNNGGTGTGGGGAAACACAAGTCTTTTCCAA7TTTTACAAAACGAACCAATTGACC 

CCAGGGACTTNTTTGGTTTATTGGGTGGGG 

Sequence 2137 

CCGCGGTGGCGGCCGNNGTNCNCGGNGCCNGAAGAGGAAGATTTCTGAAGAGTGCNGCTG 

TCTGAACCGAGCCCTGCCNAACAGCTGANGAATTGNACTGCAACCATGACTGAGAACANT 

AAGAANTCCTTGGAGAGCACCCTACGGCAACTAAAATGCCATTTCACCTTGNAACTTGAT 

GGAGGGGAGAAAACT 

Sequence 2138 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNCNGGCCAGGTACCANATGAANNG 
NNAAGACAAGGCCATNCNCCACTTTATAGAGGGNGTNAAANTAAACCANAGNTCCNGGGA 
GAAAGAAANG 
Sequence 2139 

CCGGGCAGGTAC I I I I I I I I 1 1 I I I I I I I I I II I GTTTNATCATATTTAGGTTATTTATT 

MTGAAAGGTATNTGANATTTTCAGGAATACAAATTTTGCACCCTGATGACCTCAAATGC 

GTGCA ACAAGATGTTTAATACANGAAAATAACACAAAAACTGTTGTTACAGTGGTTAGAA 

Mill AACTTTAAMAACCATGAATTTGTATTGNTTTAATTGCACAATAAAATAATGTTG 

ATATATACTTMGCTTAMTTMTTNCAACANGGNAAACATTTTCCAACCCAGAGGTGTG 

GCCTGATGTTGGGGTTCMNTCTGGACTTNTATTTTTTGGGTANCACACTCMCTTTTGA 

ATTGTNTTAAGGGNTTATTTCNANCCATTCTAACTCTANGAAAAAATNTTNAAATTNCGT 

TTCCCAMGNCATTANCCGNGGMTTTmGTTGATTTTCTNCMGAAAAM 

GAGGTTAAAAGGAAGGNGTNTMTATCACTTMTTATCCCAACATTTTCTAAAGGGGGGG 

GGAAAAC 

Sequence 2140 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTATCAGCTAATTGGGCTCCTTT 

GAAATGCTGAGGATCTGCTTACGCAGGTTAACTCTTTGAGGAAGGGGGTNGGGTAAGTAG 

CCCTTAATGTCTTGTAAATCAAGGGGCCAGATGGCGTTCGTCAGGCTTTCCCAGCTAAGG 

GAGAGTCTACTCATATGGGAAACAAGCCTAGGTAATTAAGGAGACAAAAAGGGAAAATTT 

AAAAATAGGGTTAGTAAAAACAAGGTTAAAATAGGGTTAGTAAAAAAACAAGGTCAGGCA 

TTACAAAA GGAT TCTCCTGTCTCAGCCTCCACTGGGATTACAGGCTCGCACCACCAAGCC 

CAGTTMCTTTTGTATTTTTAGTACCTGCCCGGGCGGC 

Sequence 2141 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI H I I M I I I II I I I I I I I I 1 1 
NTNAGCAAGCACGTGCACTTTATTGAATGACACTGTAAACAGGTGTGTGGGTATAAACTG 
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CTGTATCTAGGGGCAGGACCAAGGGG 
Sequence 21 42 

TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTACCCTTGTCCCTCTCCTAA 

AGGGAGCACAAGGAAACTGAAGAGACTGAAAAAGAAGAGAGTTTGTAGCTGCAAAAGAAT 

AGGGATAGCAAGGAAACCCAGAACTGCATTCCCCTAAGTGGGGCCATCCCATGTGATTGA 

ATTGTCCATAGCTTGCCTATGGTGAGAAATGTGCATGCTCCGTGAGCTGGTCTCTTGAAA 

CAGGACTTATGCTTCCTCTATATTCTGGTTAAATTTTCCAAACACATAAGTTCACTGAGC 

ACAGATTTCTTATCCAGAGACAAGTAGAATCTAACCGCAGACTGTTGGCAGAGTTTCCAG 

GCACTTAGCCATGTTCCCTTCCTGACTCAAATCCCCAAAGGCCTTCACTCTCACTGAGAA 

TCACACTACTGTCCCATAGATAAGGCAGGCATTGAAGCACCTGTCGTGATCCTCTAGGGG 

GGAGAATGAAAGGTTATTTTCCTGCATTGCATCATCATAGCTTTTAATA 

Sequence 2143 

AGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNGGNGCNNNGCTCTACAGAATAGAGGCI 
ATNCTTTAGCTTAAGCCTGTCTGCTGACCAGAGAATGGAATTNTGCGTGGNCTCANGGAA 
CAAAAGGAAACTAGGCAGGGAAGGGGAAGAAAAGTGC 
Sequence 2144 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGNGGNACTTNI I I I I I I II 1 1 1 1 I 1 1 1 1 1 1 

TTTTATMCTGATMTTCTTTATTACTAAAGGTTTATTTACATAGNGTTTANNGCNTAAT 

AAAAAATAAATTACAATACAAAAGTGCTNTTTAGGAAGGAGACACTAAACAACAGGCCCA 

ThnT ACCCC TTGCTNTTTNGCCMGACATMGCTACAAATTTTGCCGGAAAAACTGNCCA 

TACATTTTTAACTACTTCTTCATTATTCTTATGGACCATCATCCAGGACATNTGTTTGAA 

GAAATATCCAGTTATMTATTTTCAAAGGNTNGAATTGNGAAGAAAAAAATATAAAATGT 

GATTAAAGGATNTNTAGCCTTCAG ATGTAATTTACNGGGTTTAAAAATTGGCCCCTTTAA 

AACTTTTGCTTTTTTAGACNAGNNNTAAAAAGGCACAATGGAGACNCCATATTGTTANTG 

GATTTTCANMCNGGGGGNTCTTNTGNGNAATNCNTACCANTTTTTTT 

Sequence 2145 

AGGCACNAGANGANTTTNC I I I I I I I I I I I I I I I I I 1 1 NCTGNNATACAAAGAGCAGATT 

TTTATTGMCTTGGGCNATMCTATATTNCCATACAATNTAMTATTCATGMTAGTTTC 

CCAA GTCTGGAG CGACCACATAGGGAGAAAATGTAMTGTCTCAATTTTTGGTTNCACNA 

AAATGI I I I I I 1 1 ATATCAAAATTTGNTTNTNAAAGCTTGGNGGATTAGCTTAAAAAANA 

AAAAAMGTTTCCTTGAAATCNGGGAAACAAGACATTTTAAATGAATCAANCAAAATTTC 

AAATTAAAAAAATTATGAAAATATTATCCTCATTAGTTTCA7TTAGTGCCCATGGAAANT 

TAATTATTCTCTCTGCNTTGATCTTTGGGGGGACAAGTTTCAATGGAAAGCCTGTCAGNT 

TAANTTCATTAAAAGTTT 

Sequence 2146 

GCTCCCCGCGGTGGCGGCCGAGGTACAAAGGAATAAAAATGTTACAACCTAAAAAAACAA 

AACACCGTAAMCATTTAGAGTMAACATGACAAACGTGCTGCTTTTAGAGGAAAAGAAG 

TTTCATTTGAAGAATTTGGACTTCAAGCTACAACATCTCAATGAATAAGTGCAAGACAAA 

TTGAATCATGCAAGGATTGCCGATTACAAGAAGAATGGGTCGTGAAGGACACGTTAACAT - 

AAGAATCTTCCCTCATCTATCTCTTACTAAAAAGCCTATTGGAGTTAGAATGGGATCTGG 

TAMGGTTCAGCTGAAAMTGAGTAGCTGTTGTTAAAGAAAATACAGTTATGTTTGAAGT 

TGGTGGAGTTAAAGAAGAAATAGCTTCGTGATGCACTTCGTTTAGGTGGACACAAACTAC 

CNTGTTAMTGGMNGMTMTTNNGCANGGGAGGAATAATGGAATTCAAAGATTTAAAG 

AAAAAMCTACTGAGCATTTTAGAACCGTTATTAANTTGMTATTAAATCAGAGCTTTTT 

ACCCCTTCGTTTTAGAAATTAAACCTCAACCAACTNGGATCATNCCTCNAAAATTCAAAC 

MGGTTAGAAMNGGTTTTNCCCGGAATTTTTACNNCAATTTAGCCACCCAAAAAAAAAA 

Sequence 2147 

TCCCCGCGGGGCGGCCGAGGTACATNTGCCTGNCTNCCTNCTGTCCTTCCTTTTTATTAT 

AAGGATACATTTATAGNACCCCATAGAAGGAAAAGATAAATTTCATAGGCTGNTAAAAGA 

GGCTAGGCCTAAGTTATAATGCCTCCTCCTCACAGNCCAATTTNCCCAAGGGGCNTTANC 
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t 

ACCAGAGCAGNTTTTCTAGCTTGNGGACAATNCCNNCAGGCTTGAGTGATAATGNCCCNG 

TNGCGGTAGCTCTCCACTTGNTNAAGGACCAAANACACCTTAGCAG 

Sequence 2148 

GGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI I II 1 1 i I I M I II I 
I I I I I I (GACTTTATATATMTACTTTATTTTATACAATTCAGTATATCATGTTAACATA 
TTTCACTTTTAATTTTATGATATGTC 
TATCATA ATTTTTTTAAAGTTTGW 

TTTTTTrCCTGTAGCTTCTGAGMCACATTTTTAGATACCCGGCTTCTAGTTATACCTG 
AGCTCCACAGTGTAGACATGTTTTGGCCMCCTTGTTTTATCGGTGTATGAMTTTGTGC 
TMTAGGATGGATCCAATTGTCATTACATTAGAAAACTAAATGGGAAACATTTCTT 
Sequence 2149 

TCCTATAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI I I I I I I I 
I M 1 1 I I I II I I I I GACT TTATATATMTACTTTATTTTATACAATTCAGTATATCATGT 
TMCATATTTCACTTTTAAT 

ATTATA CTATCATA A I I I M I \ AMGTTTGTATCTTTCATTAATAAGGAGGTTCCTTATT 

AATGGAI I I II I I ICCTGTAGCTTCTGAGAACACATTTTTAGATACCCGGCTTCTAGTTA 

TACCTGMGCTCCACAGTGTANACATGTTTTGGCCMCCTTGTTTTATCGGNGTATGAAA 

TTTGTGCTAATAGGATGGATCCMTTGTCATTACATTAGAAACTMTNGGGMCATTTTT 

TTTTACATTNGGGGCAACNTACCAAAATTTT 

Sequence 2150 

CCGGGCAGGTACACGGGCAAAGGGGCTTGAGAAGGCCCGGNGGCGAAGCCGAAGAGAAGC 

AACTGTGCCCCGGAGAAGAGAAGCTCGCCCATTCCAGACTGGGAACCAGCTTTCAGTGAA 

GATGGCAGGGCCAGAACTGTTGCTCGACTCCAACATCCGCCTCTGGGTGGTCTTACCCAT 

CGTTATCATCACTTTCTTCGTAGGCATGATCCGCCACTACGTGTCCATCCTGCTGCAGAG 

CGACAAGAAGCTCACCCAGGAACAAGTATCTGACAGTCAAGTCCTAATTCGAAGCAGAGT 

CCTCAGGGAMATGGAAAATACATTCCCAAACAGTCTTTCTTGACACGAAMTATTATTT 

CMCAACCCAGAGGATGGATTTTTCAAAAAAACTAAACGGAAGGTAGTGCCACCTTCTCC 

TATGACTGATCCTACTATGTTGACAGACATGATGAAAGGGAATGTAACAAATGTCCTCCC 

TATGATTCTTATTGGTGGATGGATCAACATGACATTCTCAGGCTTTGTCACAACCAAGGT 

CCATTTCCACTGACCCTTCGTTTTAAGC 

Sequence 2151 

CCGGGCAGGTACGCGGGGNANTGCNANANACNCAAANCNNGNTANTACANTGCATCAAAC 

ATGTTCMGATTNNCCMTTGACGGGATTGGATTNAMGATATNCCACCACTTTTAGCAA 

GATGGNGAAGTGCTAAATNACACAATTAATCAACTGGCTGAGTTAGCTAAAGATGCATAT 

GTTATTAT AGGT CCANACGCAAGANGTTTCTTNCTTGGGACACCTACTGCANCTNTTTTA 

AAAAAACCTTTTATTATGGTAAGAAAACCTAAA 

Sequence 2152 

AGGTACAGCCTCTCGGCCCGGCTAAACATCATCGTCTTGGNNGGCCATTACCCTACCAAC 

TMCTAATGTTCCGCACCCCCATTTTTMGTGMGCTGTGAAGCTCCTTTCTATTACTCA 

TCATGCGATAMTMCTATATCCGGTATTAGCTATTGTTTCCAATAGTTATCCCAGTCTT 

AAAGGTAGGTTAGGTACCTGCCCG 

Sequence 21 53 

AGGTACACTTCTAGCACCTAGCAGAGAGAGGCTTCACTACATNATGCTTCCTGACATCTC 

TCCCTTTGMGAGCAGTCAGACTCCTGCTTTGCTCTTCAGACTTMTTTGGGGGTTTAAC 

AGGTGAGGTTGCTGGGGGMCTCTTTTACAACATCTCTCTGAAAGAATCCGGGCTGCCAG 

TTTCATTTGGTTTGGGTGTCAGTAGCATGATGGAAAGACAAAAAAACACAACTTGACATC 

TGCAGAMTGGGTTCAMTTTTACCTGCAACTCACCAATTCTGTGGCCTTGGTTCAGCAA 

TTAAACTCCCTAAMTTCAGTTTTTTCTTTGTAAMTGGGGTTATGAACAGTACCAAATG 

AAGTGTGAAGACAAGGCCATCCACCACTTTATAGAGGGTGTAAAAATAAACCAGAAATCA 

AGGGAGAAAGAAAAGATGAAAGACAAACTGCAAAAAATTGCCAAAATGCGACTTTCTAAA 

MTGGAGCAGATTCTGAGGCTTTGCATGTCTTGGCATTCCTTCAGGAGCTGAATGAAAAA 
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ATGCAACAAGCAGATGAAGACTCTGAGAGGGGTTTGGAGTCTGGAAGCCTCATCCCTTCA 

GCATCAAGCTGGAATGGGG 

Sequence 2154 

GNCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACTGTGCACGTNGGTTAN 
AGGCTGCGTGGCAGGANNGNGNTCAGATTNTCCCCTGCACNGNAATTGGGCTTTNAGGGG 
GAAATGGTGGGGCCATCT 
Sequence 2155 

GAGCTCCCCGCCGTGGCGGCCGCCCGGGCAAGGTACANGTGTGTGTGGGTGAAATGGAGA 

TTTGGAATTGAACTCTCTGCCTGTAAATGTTCCCCAAATAATTGTTGTGTGTATGATACC 

GTGTATAATAAAAGTATTCTTGTTAGAATCTGAAAAAAAAAAAATGTAAAAAAANNNTAA 

AATAAAGGTTCCTTGGCCCGCTNTANAACTAGTTGGATCCCCCGGG 

Sequence 2156 

CGGCCGCCCGGGCANGGTACGCGGGGACACAAGACATCATCTTGAAGGAAGGATGGCTTT 

GGCCAGACCAAGACCGAGACTTGGAGACCTGATTGAGATTTCTCGCTTTGGCTATGCACA 

CTGGGCCATCTACGTGGGAGATGGCTATGTGGTCCATCTGGCTCCGGCAAGTCACTGGTG 

CAGTCACGACAGTAGGTGTGGCAGCAGGCCTGCTGGCTGCCGCAAGCCTTGTGGGGATCC 

TGCTGGCCAGAAGCAAGCGGGAAAGGCAATAAATCCAAGAAATTGTCCCAACAACCACCA 

ATTNT TACG GAGGMTATTATTTAGCCAGCMGGAGTGGAGTTTTGGTTTAC7TGATTTT 

ACT GNTT TTNTGGTTCATGGAATCTTTATTTTTAATTGGAGTTAAAMNC 

GTNTTTTGGAAATTGCACCTTNTTATNGMTTNTTTTTAAAGACACMNTTNG 

CCNAAAAAAAAAAAAA 

Sequence 2157 

CCGGGCAGGTACCATTTGGAGTTTAATTGCTTCGCTCCGATGAATGAATTCCTGGCCAAT 

GCACCAAAATGATACGGCTCCGATGACTGGAGGAACACCAGGGTCCTTGGTCTCGCACCA 

GTTTAGATAAMTGACACAGACACACATGTMTGGTTTTAAGGAGTGGAGAGTTTATTAG 

GCAAGAAGGAAGGAAGAAGAAAACAGCTCCCCCATACAGAGACAGAGGGAGGGGGGATTA 

GAACAAACAGAAACTTCCCCGCGTACCT 

Sequence 2158 

CCGGGCAGGTACAGCGTCATATAGGCTTTGCCTTTAATGATCTCTTACGGTTAGAAAACA 
CAATAAAAACAAACTGTTCGGCTACTGGACAGGTTGTATATTACCAGATCATCACTAGCC 
AGATGTNACG TTGG CACATTTGAAGTCCTTTATTGAMTTCATAAATAAAAGAATTGTTC 
TTTCTTTGTGGTTTT MTAAGAGTTCMGGAATTGNTCAGAGTCTTGTAAMTGTTATTT 
TA ATM TCCCTTTTAMTTTTTTATCNTGTTGCTGTT^ 

TATTTTAGATTTGCCTAATGNCCANTTCATTTCAAGGNAAAATTGCCCAAAGAGGGGTAT 
TTCC CCTT NGGG GGAAA ANNGGGGGNCCNTCTTTACCAAGTGGTAAAA NI 1 1 I I I I I CCC 
TCCCTTTTMCCCTTTTTGCTTTMTCATTCCMN^ 

MTNCCCCTTTGGTTGMGNGGCAANGTTTTGTTTGGAACCTTGGAAGTTT 
Sequence 2159 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCATCCTACCCGG 

CCTGGGCTGACCCATGGGGMGGCTGGCTMTTTCAGTGCTTCTGCTTGGTTGTTCAGGG 

CCATT TCAG GTTTGGGTGTTTTCTGGGGATGTTAACATGGGATTCAGGCTCAACTCACAA 

GAMCTTTTCCATCTCATGATGGATGCTGTTGGGCATGTCCAATGTATGACTTCATGAGT 

TACACAGATGCTMTTCGTAGGGGCACTTGGMTCACATGGTTGTTTTGTGTCCCATGGT 

CAAGCATTCTATCTTATCAGGGCCTACAGTAACATGCCAAAAGTTGCTTCCAACATATTT 

CTCTGCTTTGGATG 

Sequence 2160 

AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCCCCAGTCGTGGCCCTCTGGACAA 
GTGGCGGGCCCTGCACTCATGAGGGCTTCCAATGTGCTGCCCCCCTCTTAATACTCATTG 
TCMTTTGAGAAAMGGACATATGAGTTTTTGCATTTATTMTGAMCTTCCTTTGAAAA 
ACTGCTTTGMTTATGATCTCTGATTCATTGTCCATTTTACTACCAAATATTAACTAAGG 
CCTTATTAATTTTTATATAAATTATATCTTG NCCTAAAAAAAAAAAAAAA AAAA 
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Sequence 21 61 

ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCACGTAAAGACGTTAGGTCA 

AGGTGTAGCCCATGAGGTGGCAAGAAATGGGCTACATTTTCTACCCCAGAAAACTACGAT. 

AGCCCTTATGAAACTTAAGGGTCGTTGAAAANAAANGANTCANAANTNTANAAAAAAATN 

ANGNNCCTN 

Sequence 21 62 

NCCGGNCAGGTACGCGGGGCACAGCGGCTTNCTTGATCCTTGCCACCCGCGACTGAACAC 

CGACAGCAGCATGNCTCACCATGAAGTTGCTGATGGTCCTAATGCTNGNGGTCCCTTTNC 

CMCNACTGTTTACGCANGGCTTCTGNCATGCTCCCTTTATTTGGAGMTTGNGANTTTT 

T CAAGN ACAATTNCAATTNCACCAAGTTGTTCTAATNACCTGAAATATCCAACAGANACN 

NTTTTTNMNGMGTTTTCANTAANACCNNACMNTGGCCNACTTTACNAAAATTGGCCA 

TTATTATTGAAATNTGTAANNGNANATTGTTT 

Sequence 21 63 

GACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGACAACACAAA 

TTACTGCAACTCCAACAACTATGGCAATAACAATCACAGCAATGACCCAAACATCCAAAC 

TGTCMGTATTCCTTCCTCAGGTTTAGGATATCCTGGATAATTTGAGACTGGAGGCTTGT 

AAGTAGGCCTAGGACCTGAGGCACTGGACGCTGGAGATTTTGTAGTGGAAGAAGTCGACA 

CTTTAAGACACTTTGGAACTGGGGGATCCCAAGTACTTGTGGAAGGTGGACACCATCACA 

TCCAGGGCCTTCTCCAGAGGGCACGCCATGACAGCAGTCAGGATCAAACCAAGAAAGAAG 

AAGCGCTTAGGAGAGAGGGGTTGTAGAGAGGGGGAAAGAATTCCCGCGTACCTTGCCCCG 

GGCGGCCCGCTCTAGMCTAGGTGGGATTCCCCCNGGCTTGCCGGGMTTCCNATTTTCA 

AAGCTTTNTCGGATNACCCGGCGACCCTTNCAAGGGGGGG 

Sequence 21 64 

ACTATA GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTA CI I I I I I I I I N I I I I I 

Tl I II IGCGAGATGGNGTCTTGCTCTGTTGCCCAAGCTGGAGNGNAGGGGCGCAATCTTG 

GCTCACTGCAACCTCTTCCTTCCAGGTTCACGCCATTCTCCTGCCTNAGCCTCCCAAGTA 

GCTGGGACTACAGGNGCCAGCCACCACGCCTGGCTAATTTTTTGTATTTTTAGTAGAGAC 

GGGATTTCACTGTGTTAGCCAGGATGGTCTCAAACTCCAGACCTGGNGATCCGCCCACCT 

GGGG CTCCCNAAGNGCTGGGATTACNGGCATGAGCCACTGGGCCCNGGCAAAAAAACAAC 

TTTITAAAAAATGGACCCTGGCCCNGGGCGGGGCCGTTTTTAAAAACTAANGGGGGATNC 

CCCCCNGGCCTNGCANGGGMTTTCCAANNATCAMGCTTTTNTTTGGATAACCCGCCNN 

CCCCNNTNNGGGGGGGGGGG 

Sequence 21 65 

AGGTACMGATAGTCATCTCAGTAAMGGTCTATTATCTMCTTGCCAAACTTGTTTACT 

GAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGNGCACAGCTTGNAAAGCANTTAGAG 

TMGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGCCTGGCTGAAGGCCCANG 

AGGGAAGGAAATATAAGAACCAACAATAAAAATAGCAANTNGCAATAAGGAAGAATGCCA 

TCCCCATGGTAGGCANCACNCAATANATTTNTGGAAACCCACCTNTTCCCGGGATCANGG 

CCTTTCCATTTGCTCACCGGATGCNTNACCGCCTGGGGCCGGGCCCGNCTTCTAAGAACN 

TAGNTGGGATCCCCCCGGGGCTTGGCMGGGMMTTTCTAATMTTCAAAGCCTTTAAN 

TNGG AATACCC CGTNCGGAACCCCTTCCGAGGGGGGGGGGGGGCCCCCGGGTTACCCCCA 

AGGCI I 1 1 I I I GGNNTTCC 

Sequence 2166 

CCGGGCAGGTACGCGGNGACGAAGTTCCGGTCCAGGTCTCTGACTTCGGGCTTGTTCGCT 
GGTGGCNGTCGGAGCCGAGCCGGACTGGTCAGGATGANCACGGACGTGCAGNTCGCCATC 
TTCGACAACATG 
Sequence 2167 

AG GTACT CCTGGGCAACAGCGAGACTCCGTNTCAAAAAAAAAAAAAAAAAAAGAAACCAT 
TTATTTTAAAMTGATTAGATTGCTATGCCTCAACTCATAGAAGATGAACCCTTCAAGAA 
AACGTGAAGNAGGAACCGGGNGGGCCANNAAATGAAAACAGGCAAGTANAGNTATTANTT 
NGGAAAAACATTTTNTCAACACCAAATGTTAAAMGACTTTCCTTTTGNTAAAACCTGGG 
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ATTATGAGNMGMCTTTTTCMTTGGGGTTANNTTTCTAGGGATANACTCMGTATNTTT 
GCANCCACCTTMNAAMCTTGCCATTAGAAAAMCCTGNAAMGGTTTATTGTTTCCCA 
GNATAACTTTTCCGTTGTTTTACCCAAA 1 1 I I I I NTTTAAGAACTTTG 
Sequence 2168 

AGGTACTCCAGGGCTTCATTCATATTTCCTTCCAATTTGTAGACGAACCCAAGGAGGCTC 

MGCTTTCCAGATCTMTGCCTTTCTCCGAAGTTTCCTTAAMCCAATTTCTTCAMGA^ 

TTTGATACTTTTATCCCTTTGTTMTGATAGCCCTGTTCTAT^ 

MTGGATMTTTGCATTGACTGTCAGATTTCTTTTGAAATTCCTGNAAACNCGACNCATN 
AGTTGG AAAATTGN TATGTCTCTGCATTGTTTTCTTCTACCACNTGGNTTTCATCGCNAT 
TAAACAAI mm IGAAAAMTTTCTCTTTCAAGCCTTTNTCTGTGGATTGGCCTGCCT 
TCTAATAT TACCMTTTC CTTGGCCANGGGTTCTTAGGATGTAGCCCACCCTCAAAAATG 
TGGGGCCI I I I 1 1 1 I I ICCCACCTGGCAAGAATTCAAANAATCGGAAAAATAATGGGGCC 
NTG 

Sequence 2169 

CGAGGTACATTTTNAMGAGTTGTTTTTTGGCCGGNNTTCAGTGGCTCANGCCTGAAAT 

CCAGCACTTTGGGAGGCCGAGGTGGGCGGANCACGAGGGCTGGAGATNGAGACCATCCTG 

GCTAACAAAAGAAGAAANCCCGTCTCTACNAAAAATACAANAAAATTAGCCAGGCGNGGG 

NGGCTGNGCACNCTAGTAGGCCCCAGGCCTACTTTNGGCAGGCCTGAGTGCAGGNAGTAA 

TGGNCCGCGCANCCCTGNCAAGNGAAANTAAGGNTNTGCNGGTCGGAGNCCCAAAGNANT 

GCGGNCCCCTTGCCACNCCCAAGGCNCTNGGGCCAAACCAGAAGGCNAAAGGANCTCCCT 

ATTCCTCNAANAACNCAAGATATACCNCAANNGCGAGGGAGGTTTNNGGTTNCTTACCTC 

ANTTGGGCNNCAAATTNATAAGGGNTCAATANNANCNAGGNNTAACCAATAAANNACCCN 

CACAGGGGNNTCCTTTAACCAAACCCNTAAAANNCACCTGGGGTCAAGTCCNAAATAAAA 

A 

Sequence 2170 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACCTGCTGGGAGAAGTT 

TCTGAACAACNATATCTGGGATACACAGAAAAGTNTGGAAGANGAGAAAGAAANGCCTAA 

ATNGGAATGAGATCCAAGACTAAACGCNAGAGCTAGATTGAGCCGCATTTGAAANCTCCT 

TCCCNTTGGGGCNTTGGCAGAGGGGGAGAAAAGGCTTCAAAGGAACTNGGTGGCATNANC 

ACCCCCCTCCCCCAATGAGGACACCT 

Sequence 2171 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCACTGAGCTCAAGA 

CACACTTGGTGCTTGCAGTGCTGATACCTGCTCTTTGCCGGCCCCCGCGTACCATCCCAT 

GTGGAATCTGTGAGTGTCCTCTTAAGTAGCGTGGGCTAGCCAATCTGCTCGTTNATGGGT 

GTATTTGTAAACTCCGAATTCCATATGTAATANGGATGCAAGTCTAAGCGTTTCATGTGG 

ACATAAATGTATCTAMTMAACTTTCCCTAGCACTGTGGCTGACCTCACCCTTACTTTT 

ATACTTTAGTATGAAACTGATGAGAACTTTGGTAGNGAGT A I I I I I M I ATATATATACA 

TATATATGTATCTATCTATATATATATCTCAAGCATCTTTCAGGTCTTTGTGTNGTGGNT 

TTTNTTAAAGCCCCTGTTGTAAAAAAAATTACTATTGTGGGATGGGCAGTCTCTCACATC 

ACAGATGTNGAAAAGNATAATTTTTATANTNNGTATTTNCAAANAAAATAAAATCTGGGG 

MAGGTNNCATTCCTNTACTGGNGGNCCAAMAATCAATNGGTTTTGTNTGNCCAAAAAA 

AAATATTAAAATTAMMTATANTTNTMTTGAACTTAAAAAAAAA 

Sequence 2172 

AGGGTACATTTTTAAAGAGTTGTTTTTTGGCCGGGCGCAGTGGCTCATGCNTGTAATCCC 

AGNACTTTGGGNGGCCGNGGNGGGCGGATCACGAGGTCTGGAGTTTGAGACCATCCTGGC 

TAACACAGTGAAATCCCGTCTNTACTAAMATACAAAAAAATTTAGCCAGGCGTGGTGGG 

CTGNNAACCAGTAGTNNCAGNTACTTGGGAGGCTGAGGNANNGANAATGGGCCNTGAACC 

TGGAAGGAAGAGGTTGCANTGAGCCAAAAANTGCGCCCCTTGCAACTNCAGCCTTGGGCA 

ACAAGANCAAGACTCCCATCTCAAGAAAAAAAAAACAAATAAGNAAAAGTACCTTGCCCG 

GGCGGCCCGCTCTANAAACTAGNGGATCCGCCCGGGGCTGNAGGGAAANTCTATANCAAA 

GCTTANNGAATTCCGCCNACCTNGGAGGGGGG 
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Sequence 2173 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATCCCAGCCCCCCAGGTTCGC 

CTAATAGTCTTGGCTACTTCCCTACAGCTGCTAATCTTAGCGGTGTCCCTCCACAGCCTG 

GCACGGTGGTCAGAATGCAGGGCCTGGCCTACAATACTGGAGTTAAGGAAATTCTTAACT 

TCTTCCAAGGTTACCAGTATGCAACCGAGGATGGACTTATACACACAAATGACCAGGCCA 

GGACTCTACCCAMGAATGGGTTTGTATTTAAGGGCCCCAGCAGTTAGAACATCCTCAGA 

AAAGAAGTGTTTGAAAGATGTATGGTGATCTTGAAACCTCCAGACACAAGAAAACTTCTA 

GCAMTTCAGGGGAAGTTTGTCTACACTCAGGCTGCAGTATTTTCAGCAMCTTGATTGG 

Sequence 21 74 

CCGCGGTGGCGGCCGAGGTACGCGGGGACTCGCGTCGGTTGGCGACTCCCGGACGTAGGT 

AGT7TGTTGGGCCGGGTTCTGAGGCCTTGCTTCTCTTTACTTTTCCACTCTAGGCCACGA 

TGCCGCAGTACGCGGGGGGGTGAAGAAGGGGCCGGCCTTCAAGCAACAGCGACGCAAGAT 

GGCAGCCACCACGGGCTCGGGAGTAAAAGTCCCTCGCAATTTCCGACTGTTGGAAGAACT 

CGAAGAAGGCCAGAAAGGAGTAGGAGATGGCACAGTTAGCTGGGGTCTAGAAGATGACGA 

AGACATGACACTTACAAGATGGACAGGGATGATAATTGGGCCTCCAAGAACAATTTATGA 

AAACCGAATATACAGCCTTAAAATAGAATGTGGGACCTAAATACCCAGAAGCACCCCCCT 

TTGTAAGATTTGTAACAAAAATTAATATGAATGGAGTAAATAG 

Sequence 2175 

CCGCGGTGGCGGCCGAGGTACCTTAAAGTCCTCCTGGCTCTGAAGCTTCATAAGATGCGT 
GAAGAAGGTGAAGAG GAAGGNGAAGGAGTCGAGCGGCCGCCCGGGCAGGTAC l I I 1 1 1 II 
I 1 1 I I I I 1 I I 1 1 I I I I 1 I GGGGTTTATCATATTTAGGTTATTTATTAATGAAAATATATG 
ACATTTTCAGGAATACAMTTTTGCACCCTGTGACCTCAAATGCGTGCAACAAGATGTTT 
AATACAGAAAATAACACAAAAACTGTTGTTACAGGGGTTAGAATTTTTAACTTTA 
Sequence 2176 

CCGCGGTGGCGGCCGAGGTACTTCTTACAGTCTTCAGGAAATTCATTAAATCAGTGCCTC 

CAG7TCCTTTGGCTTCCAGTTTTGMGGGTCTTCAGAGGTCTTATTCTCCTTTGGCTGCT 

GGCTTGCAGGAATCAGGATGTACTGTTCCTGTTGGCCGAGTGGAGACTGGTGTTCTCAAA 

CCCGGTATGGTGGTCACCTTTGCTCCAGTCAACGTTACAACGGAAGTAAAATCTGTCGAA 

ATGCACCATGAAGCTTTGAGTGAAGCTCTTCCTGGGGACAATGTGGGCTTCAATGTC 

Sequence 2177 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTTGTAGAGTGGTGCTGCTTTAATTCATAAATC 

ACAAATAAAAGCCAATTAGCTCTATAACTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

AAAAAAAAAAAAAAAAGGAAAAAAAAAAAAAAAAAAAAAAAAAGGTCCGGNNCANTCGNG 

GNGGCNTAACCNCNCTNGGGGNGGNTNTAGGGNTGGAAAACNCTNAAAATNNNTGTTTTN 

AN NG G G NTAAAG N NTTG AAATTTTG CC AG NTTT^A AAGTC ATTAAAG NCM AAGTTTTT 

TTAAGGGGTN I I I I I I 1 1 1 1 AANAAAACCCATTAACNTTCCATTGGGCCTTTANAAANGG 

CCCCCGGGGGTTTTNGGNNAMNTTNTTTTT^ 

GGGGCCCCCMNTTTTTNGNTGNCCCCNCCCNCCCCCANTTGGGGGGGNNGGGNGGGGAN 
ANCCCCCN I I H I I I I I 1 1 
Sequence 2178 

GGGCGAATTGG AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I M I I I I I M 1 1 l I I 
M 1 1 I I I I I I I I I GCCAATTGTTATTTAGTTTTTATTTCATAATCATAAACTTAACTCTG 
CAATCCAGCTAGGCNTGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCG 
AGAGCACAAAGATTCTAGGATACTGCGAGCAAATGGGGTGGNGGGGTGCTCTCCTGAGCT 
ACANAAGGAATGATCTGGTGGCMNAMTTAACCANTGCTTTTATTATCNANGCACTTNN 
CACTATGCACTTTTTCTGAAATATnTTGTAACACI Mil I GTATTTTTGCCATTTGAA 
Sequence 2179 

CCGGGCAGGTACAAATGATGAAACGGAAAGACAMGGAAATTTTCCATTTTTGAAGAAAA 
AGTGTTCAGTGTTAATGGAGCCNGGGGAAACCACATGGACTTTGGTCAGCTCTATCAGTT 
CTTAMCACCAAAGGATGTGGGGATGTTTTCCAGATGTTCTTTGGTGTANAAGGACAATG 



WO 01/070979 



PCT/US01/09126 



TABLE 1 

363/467 

ACATCAAGAGTNGTTGAANGTATCTTGCCACTGNTGGCCTTTTGAI I 1 I 1 INTCCCACTT 

TTTCTTGAMGATTMGTAATTTTATTTTAGTTCCATTCTAGMTGTTGGGGAGTGNGGC 

ACAAGAAAAAATANTATANCTGAAATGCATCTGTTAAAAATGTNATGATTGNAAGCATAA 

CTGAGTTTCA 

Sequence 2180 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGTAGCATGCCAGGAAAGGAAGAGAC 

TGGGATGGTTTTTTATCTGTCGCTTTCTTAAATCAAGGGCCGCCGGGCCGGAGATGGATG 

GAGGGACCGGGGATTTGGGAACTCGAAAACGAGCTGAGGGAAGGGAGCCTGTGGAAATAG 

ACTGGAGTCTGGGTAGTGTCGTTTCCTAGAGAATGGTCTCGAAGTAACTTCTCGGTAAAG 

TCTTCACGGMTTTCCAGACCACGCTTGCCCACTGGGAGGC7TTTAGGACCCGAGACGTG 

TGCAGGCTTTTCCAGCCAAAATGMGTTTAATCCCTTTGTGACTTCCGACCGAAGCAAGA 

ATCGCAAAAGGCATTTCAATGCACCTTCCCACATTCGAAGGAAGATTATGTCTTCCCCTC 

TT 

Sequence 2181 

I I I I 1 1 I I I I I I I INGTTACATAAATTAACCCATTTATTATAGGCCAGTGATGTCTCAAA 

GAGTAGAGGAGCGTCTACTGGTCTTTCMCTCCTTCAGTCTTCTGATGGCGGACTTTACC 

GNGACAGCGGAAGTGGTATTGNACCTGATTTTATTTCCAGTTTTGATCCGAATCCACTGG 

GGMTGGGACGATTTTGCTTTTGTTTCTTGGCCAGGAATCGCTTAATCCT 

Sequence 2182 

AGGTACTCATCGGCCAGCACGGAGATGCACAGGTTAAATGGTTTACCATCCTGAAAGGGC 
ATATTGNGGCATGTCACCTCATACTGCCAAGCCCCATTCACGCGGCTGTTCATGACCACC 
CAATGACCAAAGTACCTGCCCG 
Sequence 2183 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGCCCGAAGCGTT 

TACTTTGAAAAAATTAAGAGTGTTCAAAGNAGGCCCGAGCCGCCTGGATACCGCAGGTAG 

GAATAATGGMTMGGACCGCGGTTCTATTTTGNTGGTTTTCGGAACTGAGGCCATGATT 

AAGAGGGA 

Sequence 2184 

AGGTACGCGGGGACTCGCGTCGGTTGGCGACTCCCGGACGTAGGTAGTTTGTTGGGCCGG 

GTTCTGAGGCCTTGCTTCTCTTTACTTTTCCACTCTAGGCCACGATGCCGCAGTACGCGG 

GGGGGTGAAGAAGGGGCCGGCCTTCAAGCAACAGCGACGCAAGATGGCAGCCACCACGGG 

CTCGGGAGTAAAAGTCCCTCGCAATTTCCGACTGTTGGAAGAACTCGAAGAAGGCCAGAA 

AGGAGTAGGAGATGGCACAGTTAGCTGGGGTCTAGAAGATGACGAAGACATGACACTTAC 

AAGATGGACAGGGATGATAATTGGGCCTCCAAGAACAATTTATGAAAACCGAATATACAG 

CCTTAAAATAGAATGTGGACCTAAATACCCAGAAGCACCCCCCTTTGTAAGATTTGTAAC 

AAAAATTAATATGAATGGAGTAAATAGTTCTAATGGAGTGGTGGACCCAAGAGCCATATC 

AGTGCTAGCAAAAATG 

Sequence 21 85 

CCGCGGNGGCGGCCGCCCGGGCCGGTACGCGGGGAACTGAAAATCCACAAGACAGANTAN 

CCAGATCTCAGAGGAGCCTGGCTAAGCAAAACCCTGCAGAACGGCTGCCTAATTTACAGC 

AACCATGAGGAAAGGCCACTTAAGGATGCAGCAAGAAGGAGCCATCTGCAATCCAGGAAG 

AMTTCCTTGCCAGGAACCAAATTGGTTGTCACCTTCATCTAGGACTTCTAGCCTCGAGA 

ACTTACAAATGGTGATGATCATNAGGTCAAGGATAGTCTGGAGCAATTGAGATGTCACTT 

TAC 

Sequence 2186 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I 1 1 I I I I I I I I I M II I M I I 1 1 1 I I NGGGNA 

AAGAAACCTTTAATGAGGATTCAAGGTTAATAAGGAAGACNCAGAGGGCCAGCACTCAGC 

CCCAACCTNCTCACGTGTACCAAGAAAAATAAAGAAGAGGCTGCAGAATATGCTAAACTT 

TTGGCCAAGAGAATGAAGGAGGCTAAGGAGAAGCGCCAGGAACAAATTGCGAAGAGACNC 

ANACTTTCCTNTCTGCGAGCTTNTACTTNTAAGTCTGMTCCAGTCAGAAATAAGATTTT 

TTGNGTAACAAATAAATAAGATCAGAGTCGGTAAAAAAAA 
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Sequence 2187 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAAAGACTATTGTAGAGACTATCCGGGTTAGTT 

TGCAAGGGAAGCCAATGATGAGTAATTTGAAAGAAATTCACCTGGTGAGCAATGAGGACC 

CTACTGTTGCTGCCTTTAAAGCTGCTTCAGAATTCATCCTAGGGAAGAGTGAGCTGGGAC 

AAGAAACCACCCCTTCTTTCAATGCAATGGTCGTGAACAACCTGACCCTCCAGATTGTCC 

AGGGCCACATTGAATGGCAGACGGCAGATGTAATTGTTAATTCTGTAAACCCACATGATA 

TTACAGTTGGACCTGTGGCAAAGTCAATTCTACAACAAGCAGGAGTTGAAATGAAATCGG 

AATTTCTTGCCACAAAGGCTAMCAGTTTCAACGGTCCCAGTTGGTACCTCGGCCGCTCT 

AGAACTAG 

Sequence 2188 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCCAGGACAAA 

TCTCTCCACTTGGAAGGAGATCCCAATCCTTCTGCAGCCCCAACATCCACCTGCGCACCT 

AGGAAAATGCCCAAAAGGATTTCAATATCCAAACAACTGGCTTCAGTGAAAGCTCTGAGG 

MGTGCTCAGATCTGGAAAAAGCTATTGCCACCACTGCTCTGATTTTCAGAAATTCTTCT 

GACTCTGATGGTAAACTTGAAAAAGCTATTGCCAAAGATCTGCTGCAAACCCAATTTAGG 

MTTTCGCAGAGGGACMGAMCX)AAGCCAAMTACAGAGAGATCCTTTCTGAACTTGAT 

GAGCACACAGAAAATAAGCTAGATTTTGAAGACTTCATGATCTTGCTCTTAAGCATCACT 

GTCATGTCAGATCTGCTACAAAATATACGGAATGTAAAAATTATGAAATGAACAGTTTTA 

AATATGCTGTATAAAATAATGGCAAAAGACAGTGTTATTAAAATGTTTCCATCTCAAAAA 

AAAAAAAAAAAAAAA 

Sequence 21 89 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAAGTTGAAAGTCACA 
MACCATAGMAAGTCTCAGTTTCCTACTGTAGAATTCTTGGCATTTCAGTAGTCACAGT 
TAMCTTGTTCTAGGTTTTTCAGATGCTCTTTTCCAGGACTTCATATGTTTTGATATT^ 
TAAAAATTCTTTCTTTCAAC I I I I I I G ATTATAAGCCTTGCTTGCTTCG ATTTGGGATTT 
AGGCATCGTTGTCCTTTATTTTCTTTCAGGGCMTAATC^ 

TTACTTGGGTACCTAAAACCCGAAGAACCTTCTGTAAGAAGTGTGGCAAGCATCAGCCTC 

ACAAAGTGACACAGTATAAGAAGGGCAAGGATTCTTTGTATGCCCAGGGAAGGAGGCGCT 

ATGATCGGAAGCAGAGTGGCTATGGTGGGCAGACAAAGCCAATTTTCCGGAAGAAGGCTA 

AAGACCACAAAGAAAGA 

Sequence 2190 

CCGCGGTGGCGGCCGAGGNAC I I I I I I I I M I I I I I i I I II I I I I I I C M I I I I I 1 1 I I I 

TGCNANCACANAGGTCAGCAACAAGTTTATTTTGCAGCTAGCAAGGNAACAGGGTAGGGC 

ATGGTTNCATGTTTAGGTCAACTTCCTTTGTCGNGGTTGATTGGTTTGTCTTTATGGGGG 

GGGGNGGGNNAGNNGAANTCCAATNTGAATCCCACAGAAACCAGGGGCTGTANAAACANA 

ACCTGANAAGAACGAGCACTCAACCNAGCTNACTNNGGTTCANNANAAAAAATNCCAANA 

AGGAAAGCCAA 

Sequence 2191 

CCGCGGTGGCGGCCGAGGTACAAAGATAAGTCATCTCAGTAAAAGGTCTATTATCTAACT 
TGCCAAACTTGTTTACTGAGAGCCCTAAGGAACTAAAACTGCCATAATGCCGTGCACAGC 
TTGAAAAGCAATTAGAGTMGCMGATTAGTTTTTCCTCCCTTCCAGTTCCTCAGCAGGC 
CTGGCTGMGGCCCAGGAGGGANGGANTTTTNANACCCCCTTTAAANNTNNCTTTCGCNT 
TAAGAAGAGNCCCCCCGNGGGNGNCNCCC I I 1 1 1 1 1 I I GGAANCNCCCTTTTCCCCGGGA 
ATNAGGG I I I I I I I I I I I 1 1 I NAAAAANTCCCCCCCNNGGGGGGGNCCNNCNNTNTTTTT 
I I I I I I II I INGAANCCCCCCCCCCCCCNGGTTNNNNCNNTCCTTCTTTNACAAAAAAAA 
AAAAAATTNCCNCNCCCCCAAAAAAAAANNNN I I I I I I I I NTCCCCN NCCCCCN N NTTTT 
TANAAAAAAAAAAAAAAN I I I I I I I I I I II I I I 1 1 I I I » I N 
Sequence 2192 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCCGAGA 

GAAGCAGTAGTCAATAAAGAGAGTGCCGTATTTCGCAGATTGGAGCTGAGCTGTGGCTGC 

CAGAAGATAGCGAACGAATGGAAACTGAAAGTGGAAATCAGGAAAAGGTAATGGAAGAAG 
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AAAGCACTGAAAAGAAAAAAGAAGTTGAAAAAAAGAAACGGTCACGAGTAAACAGGTGCT 

TGCAGATATTGCTMGCAAGTGGACTTCTGGTTTGGGGATGCAAATCTTCACAAGGATAG 

ATTTCTTCGAGAACAGATAGAAAAATCTAGAGATGGATATGTTGATATATCACTACTTGT 

GTCTTTTAACAAAATGAAAAAATTGACTACTGATGGGAAGTTAATTGCCAGAGCATTGAG 

MGTTCAGCTGTTGTAGAGCTTGATTTGGAAGGCACCAGAATCCGGAGGAAAAAACCTCT 

GGGGGAAAGACCAAAGGA 

Sequence 2193 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGTCCATGGGTCCACAA 

AATCCTCTTCAGCTTCTGTGGCATCTGGGCCATGATTACTGGTAGGTGCTGGGTTCCCTG 

GAGGACAGTCAGCCTTGTMTCCTCCCCCGCGGCAGCTTTGTAGCTCATTTTTATGACAC 

CAACATCTTTACTGCAGAATCAATTTCATCCCAGGTTGTGGAAGACACTGCAGAGGTGGC 

CACTGG TCACGCTGGGCAGTTGAGCTGCTGAGAGTCCCCCGCGTACTATCTTCACTCTTT 

TTTTTTCAGAAGCCMTGTTCTCTAMTCTGCAGCTTCATTCCACAGCTTTACAGAATCA 

TAATCTCTTGMTATATTTCCAATGTTATTAAAAAATAAAAAATCATACAAGATATATTT 

AGCACATTAAAACTTAAGAGGTTACAGTATAACTGTCCAGACCTCCAGGTACCTGCCCGG 

GC 

Sequence 2194 

GTGGCGGCCGCCCGCCAGGTACATTTTTAAAGAGTTGTTTTTNGGCCGNGCGCAGTGGCT 

CATGCCTGTAATCCCAGCACTTNGGGAGGCCGAGGTGGGCGGATCACGAGGTCTGGANTT 

TGAGACCATNCTGNCTAACACAGTGAATCCCGTCTCTACTAAAAATACAAAAAATTANCC 

AGGCGTGGTGGCTGGCACXTGTAGTCCCAGCTACTTGGGAGGCTGANGCAGGANAATGGC 

GTGAACCTGGAAGGAAGAGGTTGCAGTGAGCCAAGATTGCGCCCCTGCACTCCAGCCTGG 

GCAACANAGCANGACTCCATCTNANANAAAAAAAAAAAAAANCAANGAGNNGTTTTCTNA 

TGTTCATTATAGGGCATTACAGTTACATNGNCCGAAGGTCTTACAACTAATCACTGGGTA 

GCAATANATGCTTCNGGCCCACATGATGCTGATTAGTTCNCANTTTTCATNCAGTTGACA 

ATATTAACCCCCATTNCTCCCTCCCTGCCCAAGGNTCATAAANTNGTGACTGCCTAACAA 

CCA AAATT NGGMGGCCCANTCTTNATTTTCANTCAGACTTCTTGGAACCTGAAAGATTA 

AACNTTTTGGCTMCCCTGGAMTATCTTTTTATCTCACTTATAGCNTTNAAGGCCATTG 

NATNAAMCNTGATTTCTTGGAMGGGCCATACCCATAAANTCMGGGGGGTTTTTCCTG 

AAAAGTAAANGNTTCCAAATNAANNCCAAACANTTTCAACCCCAAAAAAAAAAAAAAA 

Sequence 2195 

AGGTACACTGAACACCAAGGCTCTCACTCTTGAGTCTAGGGCACCATATATATAAAGGGA 

GTTCGGAACTTGTCAAGGCAATCCTATTGTTCGTAGTCAAATCACACTGAAATCCGGAGG 

CCTGGTGTGAGAACCTACNAGCGGACACCGGGCATTTAATATTTTTTGCACACCCACACA 

GCCAGGGCCAAAGTGGTCAAGGCACTCTCCTAACACAGACAGGATCTTCTGAGTTGCAAT 

TCTTTCTGMGGMCATTTTTCACTTGAMTTCCATCAGAAMTTTCCTGAGATCTTGTG 

AGCGCTGAGGTTTCTGGTTTCATCTAGATCCAAAAACATGTCCAAATCACATCCTAACTT 

CCCAAAAGTGTTGACTGAGGAGCCAAAGGGTCTGACTATGCAGTCTGGAAAATACCCCGG 

CGGCCATGTCTTCAATAAGAGAACAGGTGAGATATCGGAGCTTAAGTGTTCTCCTCTGTT 

AGCTGGAACTCCTTCAAGAAGAGTGTTCAAGCTTGATCCGTCTATACTTTCTGCATAACA 

AAGTAATTCAMMGCTGCTTGTTTGAACCGTGGCAACTGANTACTTGACCCGTACCTTG 

CCCCGGGCGGGCCCTNTAAGAACTAGTGGATCCCCCGGC 

Sequence 2196 

CGCCC GGGC AGGTACAGTGGTCCTTTTCAGAGTTGGACTTCTAGACTCACCTGTTCTCAC 

TCCCTGTTTTAATTCAACCCAGCCATGCAATGCCAAATAATAGAATTGCTCCCTACCAGC 

TGAACAGGGAGGAGTCTGTGCAGTCTGACACTTGTTGTTGAACATGGCTAAATACAATGG 

GTATCGCTGAGACTAAGTTGTAGAAATTAACAAATGTGCTGCTTGGTTAAAATGGCTACA 

CTCATCTGACTCATTCTTTATTCTATTTTAGTTGGTTTGTATCTTGCCTAAGGTGCGTAG 

TCCAACTCTTGGTATTACCCTCCTAATAGTCATACTAGTAGTCATACTCCCTGGTGTAGT 

GTATTCTCTAAAAGCTTTAAATGTCTGCATGCAGCCAGCCATCAAATAGTGAATGGTCTC 

TCTTTGGCTGGAATTACCAAAACTCAGAGAAAATGTGTCATNAG'GAGGAACATCATAACC 
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CCATGGANGGAAAAAAGCCCCCCAAATGGGTGGGAACTGGATAATAAGCACTNATGCTTT 
AAGAATTGGGCACACTTCTCACCTAAGGTGAGCGCATTGNGNCCAGGGGGTGCTTAAATG 
CTTACATACCTCCAACTGGAAATGGNTAAGGGAAGAAGATTGATNCCAATTTNAAAAAAA 
AATTT AAAAN CCC ANTTTN AAAAAAAAAA 
Sequence 2197 

CGCCCGGCAGGTACGCGGGGGTGGAGAGAGGCCTCTAGACTTCAGTTTCAGTTTCCTGGC 

TCTGGGCAGCAGCAAGAATTCCTCTGCCCCCCATCCTACCATTCACTGTCTTGCCGGCAG 

CCAGCTGAGAGCAATGGGAAATGGGAGTCCCAGCTGTCCTCGGTGCCTGCTCAGAAGCTG 

GGTTGGTTTATCCAGGAATACCTGAAGCCCTACGAAGAATGTCAGACACTGATCGACGAG 

ATGGTGAACACCATCTGTGACGTCCTGCAGGAACCCGAACAGTTCCCCCTGGTGCAGGGA 

GTGGCCATAGGTGGCTCCTATGGACGGAAAACAGTCTTAAGAGGCAACTCCGATGGTACC 

T 

Sequence 2198 

CCGGGCAGGTACCAAAATTGTAAGAAGAAGCTTGGGAAGCTGCCACCTCAGTATGCCCTG 

GAGCTCCTGACGGTCTATGCTTGGGAGCGAGGGAGCATGAAAACACATTTCAACACAGCC 

CAGGGATTTCGGACGGCTTGGAATTAGTCATAAACTACCAGCAACTCTGCATCTACTGGA 

CMAGTATTATGACTTTAAMACCCCATTATTGAAAAGTACACAGGAGGCAAAGTGTTTC 

ACATTATAGACTTCACTTCCAACTCCTTGGAATGTTCATTTCTTTGGCTTACAGGAGAGA 

CTAGACAGGAAGGCCAGGCAATGC7TAGGCAACTAAAATGAGGTTGGGGGTAATGCTAAC 

GTCACCCTCACAGGGATGGCCACGGGGACTGTTATTCGCMGCTGGTTTTCTAGGCCTGT 

TAGCTGGMGCATGGTGAGCACCAATTTCTGGGACGCTCAAGGCCGTGTCGGGGCTTCAA 

GTCATCTTCACCACACCANGTACCTTNGGNCGNTTCTANNAACTANTGGGATNCCCCCGG 

GCTGTAGGGAMTTTNANTTTNAAGCCTTATTCGATTACCCGTTGANCCTCTGGGGGGGG 

GGNC 

Sequence 2199 

CCGCGGTGGCGGCCCGAGGTACAAGATNGNCATCTCAGTAAAAGGTCTATTATCTAACTN 

GCCAAACTTGTTNACTGAGAGCCCTANGGAACTAAAACTGCCATAATGCCGTGCACAGCT 

TGAAAAGCAATTAGAGTMGCANGATTAGTTTTTCCTCCCTTNCAGNCTNAGNAGGCCTG 

GCTGAAGGCCCANGANGGAAGGAANTNTANNANCCANCANTAAAAATAGCNATATGCAAT 

NNNAAGAATGCCATCCCATGGAGCACACCA 

Sequence 2200 

AGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACMTTTTTAGAAT 

AGTTGATGTCTTTMGATGGATATTTTACAATTTCAGTAAAAAAAATACAACATGAAGCT 

GCTGCTGTCACAGATCACTGTAGTAAAAAGATATAAATGCAATACCATGTCGTAGAACAA 

TATATATATCCTCTGATATTTTACAMCTTTGTACTGTGTCCAACAGCTGMGGAATTTG 

AGGGGAAGACTTTAGTGTCAGTCACCAAAGAAGGCCTGGAACTTCCAGAGGATGAAGAAG 

AGAAAAAGAAGCAGGAAGAGAAAAAAACAAAGTTTGAGAACCTCTGCAAAATCATGAAAG 

ACATATTGGAGAAAAAAGTTGA 

Sequence 2201 

GCTCCACCGTTTrGGCNNCCGAGGCCCCTTGTCCTCTTCTTCAGTGACTTAAAACAATTC 

CAGGATCAGAAGAGAAGCCATTTTGACATCCTCGATAAAACTGGGGATNNGCTGNANTTC 

TGTCTGTTCACGMGTGGTTGAAAAACAATTTCGAGATCCAGAAGTCTTGATGGGTTCAC 

CATCCAGGTGTTCACAAAAAATCAGAGAATCTCTTTCGAGGTGCTGGCCGCCTTCAACGC 

TCTGAGTAAGCATTGCTGGGTGTCAGGAGAGAAAAGCCAAAGAAGCGGGTGCCAGACAGC 

TCTGTGCAACCTCTAGGCCATGAGTGGGATAGATACCACTGCTGCTTTAAAAAATGGGAG 

ACCATAGACCCTCAGGAGAAAAGAATCCCTTCTACCCTGGACTCGCTCTCTTCTCTGGAA 

CTAACTTCTCCCCCATACCCTGATTGTCTTTGGAGAAAATGTTCTGGATTCTAGAATCTA 

AGGCAGAGCCTTTTAAGCCATACTGTACCTGCCCG 

Sequence 2202 

AGGTACAGACAGGGTTTCTTCATGTTGGTCAGGCTGGACTCGAACTCCTGGTCTCAACTG 
CCTCAGCCTCCCAAAGTGCTGGATTATAGGCATGAGATACCGTGCCTGGCCTCCATCACT 



# 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

367/467 



GATTCTATGTATGCACATGACTCCTTTTTTACTTTTACTTAGGAGGCTTAAMCACAAT^ 
ATTCA TTCAATATMTTATAMGCAGAGTCTTTGTTMTTTCTGAACTTGGTAAAAACAT 
TTTTTATATTTTATCTATAGCTCATTMTTTAGGTATCAA 

AGCCCAAGCTGCCMTTGTTTGATATCATCATCCTCTTCTTCTGCCCTCTGGGAAGATGC 

TGCTCGGGATCGAC 

Sequence 2203 

CCGGGCAGGTACTCCAAAGATTCAGGTTTACTCACGTCATCCAGCAGAGAATGGAAAGTC 

AMTTTCCTGMTTGCTATGTGTCTGGGTTTCATCCATCCGACATTGAAGTTGACTTACT 

GAAGAATGGAGAGAGAATTGAAAAAGTGGAGCATTCAGACTTGTCTTTCAGCAAGGACTG 

GTCTTTCTATCTCTTGTACACTGTATGCATCATCAGTCCAAAGTTTTTCAGAGTTCTATG 

TGACTGGTTGAGGAAACACTGCTTGAATTTCCCCATAGCCTCTGCAGCGGCATCACTGTC 

TATGMCTCTTGAAGAAGCTCTTTGTATTCAGGTATAGATATGTCGGAATTGATGGTCTT 

TTCAACCATGTCCTCCAGGAGTTTGCAGCCAGAATCTGCATAGCAGTGCAGGAGGAGGGC 

CGCCAGCATGAGGACCATCAGCAGCTTCATGGCGAGGCGGCTGCTGTCTGTGTTCAGTCG 

TGTGTGGCAGGGATCAAGGAAGTTGCTGTCCCCGCGTACCTN 

Sequence 2204 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAGTGGTTGNCAGGATTCAA 

GCCGAATGGCTTGCCTNANAGGGNCAATGGCGTTCTGAGATGGTGGCCA 

Sequence 2205 

ACACTACTATAGGGTGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCTTGGTTTAT 

CAATGGGACGTTCCAGCAATCCACACAAGAGCTCTTTATCCCCAACATCACTGTGAATAA 

TAGCGGATCCTATATGTGCCAAGCCCATAACTCAGCCACTGGCCTCAATAGGACCACAGT 

CACGATGATCACAGTCTCTGGAAGTGCTCCTGTCCTCTCAGCTGTGGCCACCGTCGGCAT 

CACGATTGGAGTGCTGGCCAGGGTGGCTCTGATATAGCAGCCCTGGTGTATTTTCGATAT 

TTCAGGAAGACTGGCAGATTGGACCAGACCCTGAATTCTTCTAGCTCCTNCAATCCCATT 

TTATCCCATGGMCNCTTAAMCMNGGGTTGGTTTTGGTTCCTGAAGCCC I I I I I I GCT 

GGGGAATGGGGCACCTTAATGGAAAATTTAAAGGGGAAAANCCCTCAGGCCTGNNGGGGG 

NGGGGCCCCTTCACAGCCTTTTNCTTAACTNAGAAANCGGGGAAAATTGNAAACCATTGG 

GGGAGMAATGGACNACTTCNCACTNTTGGACNGGTTTTTTTCCAAAANTGTNAAAACAA 

GAATTCTTTATTMTGGAAANGGGGTTTTACCCCCCTTTTAAATT 

Sequence 2206 

CCACTATAGGTTTATGGGAACTCCCCGCGGNGGCCTCACTTTTNACCTTCTGGAANCCGC 

CCACCACTTTCAAGCTCACTATTGAATCCACGCCGTTCAATGTCACAGAGGGGAAGGAGG 

TGCTTCTACTTGTCCACMTCTGCCCCAGCATCTTTTTGGCTACAGCTGGTACAGCTTTC 

TTCGTCCTCCATGCTAAGAGATGTAAAAGCTTAAGGGTCAAACAATACCAATTGTATAGG 

CTTCAMAACCATCTMGTTAGGGCATTCTNTAGTTTTAGCTAAGATACACCTGGAACAC 

TGACAAGTCATCACTTACATAGMTMTGTGMGTAMTTTTTTGAAAAATAMTTTTAG 

TGGAACAATCCTGAAGGATAACACCAGAAGAATAGCAGGTTACCAGTAAGGGGTCAGCCA 

ATTTGTTCCAGTCACTTTTGMTCCATGTTCTATMTCTAAAMTTATTCTCTTTCCCTA 

AGCTGAGAGCTTCCTATCATGTCAGTATCTATGNTATGAAG 

Sequence 2207 

GGCGCCGAAAGCGAAGAAGGAAGCTTCAGCCCCTCCAAAAGCTGAAGCCAAAGCGAAGGC 

TTTAAAGGCCAAGAAGGCAGNGGNGAAAGGGGNCCACAGCCACAAAAAGAAGAAGANCCG 

CACGNCACCCACCNTCCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnCTNGGGCGGAAAACAANGGCAAAAGCNGGNNCC 

CGGGGGGGAAAANGGGNAACCCGNNNCACAAAANCCACNACAACAAACCGAGCCCGGGAG 

CAAAAAAGNGGGAAAAGCCCCGGGGGGGGGCCCAAAGGAGGGGAGGCCAACCCCANAATA 

AAANGGGGGGGGNGCCCCACNGGCCCCCNTTCCAAACCGGGGNAAAAC 

Sequence 2208 

GGCGAATTGNNGCTCNTTCGCGGTGGCGGCCGAGGTACTGAGAGAGGTGGTGGACATACT 
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TACAGAGCAACTGCGTGACATTCAGCTGGAGAATTACACACCCAAGGAACCCCTCACCCT 

GCAGGCCAGGATGCCTTGTGAGCAGAAAGCTGAAGGACACAGCAGTGGATCTTGGCAGTT 

CAGTTTCGGTGGGCAGATCTTCCTCCTCTTTGACTCAGAGMGAGAATGTGGACAACGGT 

TCATCCTGGAGCCAGAAAGATGAAAGAAAAGTGGGAGAATGACAAGGTTGTGGCCATGTC 

CTTCATT 

Sequence 2209 

AMTCCCGCGGCGGCCGNTCTAGMTTAGGGGANCNCNTTTNACCNCAGGNNTTTTGAAC 

Cnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGGNAAAGAGGGNNCCNGGGNGAAAA 

NGGGGAACCGCGCACAAANCCACACAACAAACCGAGCCGGGAGCAAAAAGGGGAAAGCCC 

CGGGGGGCCNAAAGNGGGGNGCNAACCCAAAANAAAANGNGGNGGGGCNNAAAGGGCCCG 

GNTTNCCAGGGCNGGGGAAAAACCCGGNCGGGGCCAACNNGGNAAAAAAAAGGAAAANGG 

GGNCCAACCCCCCCCNAAAAAAAGGGCCCGGNNNNGGGGGAAAAAGGGGGGGNGGCCNNN 

NACCGGGNNANCCCCGGGGNAAAAAAAAANAAANNNNNNGGGGGCCCCGGGGGGGGGNNN 

NGNNNGGGGGGGGGANNGGGGGGGGNNNAANGNNNAAANCCNAAAANGGGGGGGGGGAAA 

AAGGGGGGGNNNCCNCAAAAAAAAANANGGGGGGGAAAACCCCGGGGGNAAAAAAAAAAG 

GGGGGGGAAAAAGGGCCCCCCNAAANGGGGGGGGGAACCCCNAAAAAAGGGGGCNNNNNN 

GGGGGGGGGNNNNAAAAAAGGGGNCNNCCCCCCCGGGGGGGGG 

Sequence 2210 

AANCCOCGNGGGGGCGGCCGNGGNNCCCCCGACGGNNNCAACCCCGNNGTANAAAACCCC 

CAGGAACAGAAGAGAAGGGGNCGGGNCAAGNNCGAAANCGNGGGGGAAAAGCAGAAGACC 

CGACNGNCCACGAAGNGGCGCGAAAAACAAAANCGAGAACCAGAAGGCCCGGGAGGGGAA 

CACCANCCAGGGGNCCACAAAAAAACAGAGAAGCCCNNGCGAGGGGCNGGCCGCCNGCAA 

CGCGCNGAGCNNAAAGGAAAACCCCAGCCCCCGGAACCANCGAGAGCNCAAAAGANCCNG 

GGAAAANACAAAGGCCAGNCCNGGGGAGAANNGCAGNCCGCANCACNGAACCCCAGCAGA 

NAGGGNGNNNGGACAACCGGGCCCGGAAAAANA 

Sequence 221 1 

TACTATAGGGC GMTT GGAGCTCNCCGCG GNGGCGGCCGCCCGGGCAGGGNACTTTTTTT 

I I 1 1 M I I I 1 1 IGGNGG) I I < 1 1 I I I I I I I I 1 1 1 ) I 1 1 M I I I I 1 1 I I » 1 1 1 I I I 1 1 1 I | 

I M I I I I I M I I I I I 1 1 I 1 1 M I I II I I I I 1 1 1 1 NAAAAAAANAGCCAATTTTTNNAAAN 

NNAAANNTNCNNGGNTAMNGGNNNNNGNNAAAAATTTGGGGGGNAAAAGGGNTNNTTTfSl 

ANCCCACTGNGNGGGGCANCCCTTANAANNNGGGGGAANNCCCCNGGNAAAAAANGGGGC 

NGGGGNCCCNNNTNNGNANNNTTCCCNGGGGGGGGGCCCCANNAAAANNNNNNNCCCCGG 

GGNGGGGGGNNCCCCANTTTNNCCNAAAAANCTANGTCCCCGGGNCCNNNAAAAAAAAGG 

GGGC 

Sequence 2212 

CCGGGCAGGTACCCGGGTGGTTAAACTTCGCAAAATGCCTAGATATTATCCTACTGAAGA 

TGTGCCTCGAAAGCTGTTGAGCCACGGCAAAAAACCCTTCAGTCAGCACCGTGATGAAAA 

CTTGCGGAGCCAGCATTTACCCCCGGGACCATTCTGATCATCCTCACTTGGACGCCACAA 

GGGGCMGGAGGGTGGTTTTCCTGAAGCAGCTTGGCTAGTTGGNCTTATTTAACTNTGTN 

GACCTGG ACCCT NTTGGTTCCCTCAATTCNGAGGTTCCCTCTTACCGAAGNAACACACCC 

ANGAAAAATTTTGGTCCATTTGCNCACCTTTTCAACCCAMNMTTCGGATNTTCAAGGC 

AMTTGTTANAMTAMTCCCCCAAAAMCAATCNTTAACNTGGMTGGCCTTTACTTTT 

CA 

Sequence 2213 

CCGC GGTG GCGGCCGCCCGGGCAGGTAC I N II I I II I I ) I I I ) I I } I I I I NACCGGAAA 
TTGTCTTTTATTTTTCTTTGCTGGGGTTCAAGGTCATTTNTTTAATCAAACCTC^ 
GGATATAGAGTCATCAAAAGAAAMNCATTCTTCTTCCTCTCTGTTTCCCTAAAAANAAA 
ACAGTCACAGGAGACTTTCCTTTTTT CTTAAT^ 

GCTGCTGTTATGTGTTTATGTTTCCC 1 I I 1 1 I I I I AAGTTTTGAGGGTAGATCGNGTCAC 
TGGGCGTAAGANACTGAGAAATCAGGTAGCTTCACTGGCTGGTAGGGGACTTTTGCTTTT 
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GAAACGTCACAGGCTGTCTTCCTTTGGATGTAATGGGACGTCCTGATGACCCA 
Sequence 2214 

ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI I 1 1 I I I 1 1 I I 
I I I I 1 1 1 I M I I I I I 1 1 I NATTAGGGTTGAAAAGGTAAGTCCTATATTANATACATAGGG 
GCTGGTAGTCTTATCCTCTNTATAAATACNCAC I I I 1 1 GNCACTGCCCCTTCCCTGCCAT 
GCAGAGTCCCCATGGCTGAATCTCCCTATGTCTCCAAGNGNGCGTCACTTGCTGTCCTNT 
GGGCTGGGTCCCATGNGCTGNGCCCACAGTCTT , 
Sequence 2215 

CCGGGCAGGTACMTACACTAGAAACCAACATAATGTATTTNTTTAAAACCTGTGTGAAA 

AAATAAATGTTCCACCAGTAGGGATAGGGGAAAAGTAACCNAAAGAGAGAANNGGAGAAA 

GGGMTGCTGGTTTATCTTTTGTANGATTGTAATTCGAATGNGAGAMTTTTGCAGTATT 

TTAGCCACTTATTAAGGGAA I I I! I I I I I I I GTAAAANTGGAAGACCTGANACNTCTGTT 

CAAAATGGCTTTTCANTGGAACCCTTGNGTNTTGGAGACCGGTTAGNGAAAAGGCANACA 

AAAACCGTGGGTAACCCTGGTGGACCTAAAGGGGCNCTGNGTGCCAAGGGNACCTTGGGG 

NAAATTGGTCCATTNGATTANTAGGAATGGGTGGGGGG 1 1 1 I 1 1 I CCCCCCCCTTTTTNA 

GNAMNTGGTNTTGNGGATTATTTTAAAGNTGGANTTTTAAAAAACACCNI I I II < I 1 1 A 

AACTTCCCGGAAA 

Sequence 2216 

CCGGGCAGGTAC I I I II I I I I I I I I I I I I I I I II I I I I I I ATACATTAAATATTTATTAA 

T GTTT TGGGAATGTTGTGTATCAGTAATTATACCTAGTAAAAAGiAAAGATCCACATCCAA 

ATTTTCCAGAAAACCGGGTGGCCAGGTNNGGGNGGAAGAAGAGCATTTTAAAAACCAAAA 

CCAAMCAAGGGAAMGCTAAAAAGGAGTTTNTTCCAAATANTCCTGAATTCATTAGGAG 

GGTTGGAATTNACATTTTTTCAGGNCTNTATTCCCCCCAAAGTTGAAAI I I 1 1 I AAGGGG 

ACCTTGGANAMGNCTTTATTACCCCCATNTGAAAAACAATGCCTGCCAATGGATTTACC 

CTNTTTTTCACGGTTCTTCCAGGGTTCCCAANTNACCGNTTGTCCTGGTNMTCCCTTTT 

TGGGCCCI I I I I I I 1 1 I I I 

Sequence 221 7 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGAGCTGTGTGCATGCT 

GAGCAACACCACAGCCATTGCTGAGGCCTGGGCTCGCCTGGACCACAAGTTTGACCTGAT 

GTATGCCAAGCGTGCCTTTGTTCACTGGTACGGGAAGGATTCCTGCCAGGGTGATTCTGG 

GGGTCCGCTGGTATGTGGAGACCACCTCCGAGGCCTTGTGTCATGGGGTAACATCCCCTG 

TGGATCAAAGGAGAAGCCAGGAGTCTACACCAACGTCTGCAGATACACGAACTGGATCCA 

AAAAACCATTCAGGCCAAGTGACCCTGACATGTGACATCTACCTCCCGACCTACCACCCC 

ACTGGCTGGTTCCAGGAACGTCTCTCACCTAGACCTTGCCTCCCCTCCTCTCCTGCCCAG 

CTCTGACCCTGATGCTTAATAAACGCAGCNACGTGGAGGGTCCTGATTCTCCCTTGGGT 

Sequence 2218 

CCGCGGTGGCGGCCGCCCGGGCAAGGTAC I I 1 1 I I I I I 1 1 < I I I U I I I I GGGATGGAGT 

CTTGCTCTGTTGCCCAGGCTGGAGTGCAGGGGCGCAATCTTGGCTCACTGCAACCTCTTC 

CTTCCAGGTTCACGCCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTACAGGTGCC 

AGCCACCACGCCTGGCTAA I I I I I I GTATTTTTAGTAGAGACGGGATTTCACTGTGTTAG 

CCAGGATGGTCTCAAACTCCAGACCTCNTGATCCGNCCACCTTGGCCTCCAAAAGNGCTG 

GGGATTACANGGCATGAGCCANTGGGCCCCGGCCAAAAMNMANTTTTTTAAAAANGGC 

NCCNNGCCCCTTTTAAAAATTGGGGNGNCCCCNNGGGGGNGGGGGAAAATNNAAAAAAAA 

TTTNTNNCCCCCCCCC 

Sequence 2219 

GGCGGCCGAGGTCGGCCCACTGGGACTGAGATACGGCCCAGACTCCTACGGGAGGCACGC 

ATGTAAGGAATTTTCCACAATGAGCGAAAGCTTGATGGAGCGACACAGCGTGCAGGATGA 

AGTTCTTCGGAATGTAAACTGCTGTTATAAGGGAAAAAAAAAAAAAAAAAAAAAAAAAAN 

GTACCTGCCCG 

Sequence 2220 

CGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTT 
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TTNTNNTTTNTTN I I 1 1 I I I I I I I I CCCGGCCATATTCCATTNTTTNANAACTTGAATTG 
CNCTGCCATCCACAMGGCTNGCNCGGTTTTATCCACATTAAAATCAAACTGTNGGAGCN 
CCAGGACTATTTCATTATTGCTTTTGTNGGGAACMCTGATCTMCTGCATAMTCTTTT 
TCCTTG^CATTCACATGAGTNTTGGNGTTTCANCCCTTTTTGCTT^ 
NAATTTCNG GAAMNANG GGGGANCCTTTTNCCTTTTTACCCCAAAANTCCCNAANGGNA 
AAANCCTTN 1 11 11 I 1 1 1 I AMNTTTTTTTCCCCCNNNTNCCCAAAAAMNNGNTTTTTT 
Tl I n I I I I I I I I NTTGTGNGGGGGGNGGGGNGNGGGNANAAMANANNCNNNTTTTTTN 
NNNNNNTN I I 1 1 I I I I I I ANAAAAAAAAAAAAAAAAAAAAAAAAAAANNNGGGNGNNNTT 
TTTNNNNNANAAAATNNATNNNNN I I 11 I 1 1 1 1 I 1 I I 1 11 1 1 
Sequence 2221 

CACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCTTTGT 

TTTGGCACACTTTTCCTGACAAACAGCCAGTGTTCTCAACACATAAATACTAGTCCACGT 

TAACAACAATAGCATATGAGACCGCTCTCCGTAAAGATGCCAGATTGGATGCAAATGGAC 

TGGAMTACCTTGGAGGGTTTCACAAAAATAAGACAAAGGGCAAAGGAACTTTGCCAAAG 

GAGATGGAGAGCAATTCTTTAAAGTTAGTGGGAGGGAGGAAGCAAAGAGCTCATAAATAC 

AAGCCTNTTAAAATGGGACGCATTTGCCTCGCGCCTACTGGGGTGTCTGCAGCTCAGCTT 

GGTGCCCCACACAGGACACCG 

Sequence 2222 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCCGGAAGTCTCTACTGAGGAAAGC 

TATGAGGATACTCTGTTCGTAAGCTCCCGGTGAATTTTGTTCCACAGACTCGGAAGAAAG 

GTTGGATAAGAGTTCACTGGAGATTGACAAGTACCTGGAGGAGCCCCTCACCCAAGTGCG 

CATCTTAACGGCATTGGTGGAAGCTGGGGTCAGAAAAGAGAAATGACCATTTGGAGGGGC 

GGGGCCTCCTAGMGAACCTTCTTAGACMTGGGGGGAGGATGGGACTTTGTTTTTTCCA 

AGAATAAACTTCAACTCCTGT 

Sequence 2223 

CGAGGTACTTCCCACCACAGTGTTGAAAGGGAGAGCAAAGTCTTATGGATAAACCCTCCT 

TTCTTTTGGGGACACATGGCTCTCACTTGAGAAGCTCACCTGTGCTGAATGTCCACATGG 

TCACTAAACATGTTATCCTTAAACCCCCGATGCCTGAGTTGAAAGGGCTCTCTCTTATTA 

GGTTTTCATGGGAACATGAGGCAGCAAATCTATTGCTAAGACTTTACCAGGCTCAAATCA 

TCTGAGGCTGATAGATATTTGACTTGGTAAGACTTAAGTAAGGCTCTGGCTCCCAGGGGC 

ATMGCAACAGTTTCTTGMTGTGCCATCTGAGMGGGAGACCCAGGTTATGAAGTTTTC 

CTTTGAACACATTGGTCTTTTCTCAMGTTCCTGCCTTGCTAGACTGTTAGCTCTTTGAG 

GACAGGGACTATGTCTTATCMTCACTATTATTTTCCTGTTACCTAGCATGGGACAAGTA 

CCACCTGCTGGGGATCACTCTCTAGGATCACCCCTCCATCCACAATTTCATCCACAAGCC 

AAGAACAAGCCCCTTCATGTTCTCCAGCAGTGCTCGCTTTTCTAC 

Sequence 2224 

TCACTNTAGGGCG AATTGGAGCTCCCCGNGGTGGCGGCCCGCCCGGGCAGGNACTTTNTT 
I M I I I M I I M I I INCI I I I I I I I I I I I I I 1 1 I I I I I I IGGCCI I 1 1 1 1 AATATCTTTA 
TTTGACAAAATAAAAGTCAGCAACCTATCTCG ATTTCCAA I I 1 1 I I GTGGTGTTTGAAAT 
TCCAATTGANACCTAAAGCATAGCTCTGGCCTTGGAGAGATTTCCAGGAGAGTCANAGCC 
CANAAGGGAGCAGGATCCAGGAGGCCCTCATNTCCCAGCACTCCAGCTGAGCCAGCCGGG 
TTATGGAACATCACTGAGCMTTAAMTATTATCMCAGACAAAAAAAGTTTATTGAATA 
CAAAACTCAAAGGCATCAACAGTCCTGGGCCCAAGAGATCCATGGCAGGA 
Sequence 2225 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGGGCTCTTTCAGGTCCCCCT 

TCCTCTGCAGACTCAGCCTCCAGGCTGTTTAGCTCTCCCAGATAGCTGCCCCTGCCACGC 

AGGCCAGGAGTCTTCACACTCAGGCACCAGGCCTGGTCCAGGAGGAGCTGTGGCACAGTC 

GTGGTTCAAGTGTCCACATGCACCTGTTAGTCCCTGAGAGGTGGTGGGAATGGCTGCTTC 

ATTCCTCGAGGATGCCCGGGCCCCACCTGGGCTTGTCTTTCTGTTTAGAGGGAAGTGTAA 

CATATCTGCCATGAGGAACATAAATTCATGTAMGCCATTTTCTCTTAAACAAAACAAAA 

CTTTCTAAGTACMTTTGTTACAAATAACGCAGACTTCAAAAACAAAAAAATNACAACCC 
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AAACAAACCAAAATTTAAATGATCAGAATTGGCAAGCACAAAGAAAACGCCCTNTCCTGA 
CTTNTATTGTGGGCAGTTCTGMCGCCCCCAGAAMTTTGTGCCAAAGAGTTTTAGAAAA 
NTAAATATTCCAATAAAAGTAAACACTATACNCCACCAAAACAGGC 
Sequence 2226 

GGCGAAT TNGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACI M I 1 1 M I I I I I I I I 
ITTTTn IC TMCT TTATATATMTACTTTATTTTATACAATTCAGTATATCATGTTAAC 
ATATTTCACTTTTMTTTTATGATATGTGTGACATAT^ 
TA CTATCATM TTTTTTTAAAGTTTGTATCTTTC 

GAM 1 1 I I I ICCTGTAGCTTNTGAGMCACATTTTTAGATNCCCGGCTTNTAGTTATACC 
TGAAGCTCCACAGTGTANACATGTTTTGGCCAACCTTGTTTTATCGGNGTATGAAATTTG 
NGCTAATAGGAGGGATCCAATTGTCATTACATTAGAAAACTAAATGGGAAA 
Sequence 2227 

CCGGGCAGGTACGCGGGGCCCTTCTAGAGGCAGGCAGAGGGAAGAGAAAGGGTCTGTTGT 

TTTTCTCTCCTGTTTCTCGCTCCCTCTCTGCTGATCACAAAGCTGCTGACCGGGTCAGAA 

AGTCCTGATGGAAATCCACCAGCGCTGGGCAGGCCCCTCCTCCTCCAGGGAGCTTGTCCT 

TGCCTAATTTTTCTTCGTCCTGATGAGAACAAAAAAGAGAGAGAGAAGAAAAGAAAAACC 

ACAAACTTCCTTTGAAAACCAGCTTGTAGTCAGGGCCCGGAGCGCATGCCATAGACTCGG 

CGACTCAGGAATCCTGAAGACTCTCTGAGCGACCTGGAGCACCTTGGGCTGTGTCCCTGC 

CTGCCTTCACCCTCCTCCAGTGCCCCCAGTACTMGGAATCTTTCTGTTTTGGGGTT 

Sequence 2228 

CCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGACACAAGACATCATCTTGAAGGAAGG 

ATGGCTTTGGCCAGACCAAGACCAAGACTTGGAGACCTGATTGAGATTTCTCGCTTTGGC 

TATGCACACTGGGCCATCTACGTGGGAGATGGCTATGTGGTCCATCTGGCTCCGGCAAGT 

GAAATTGCTGGAGCTGGTGCGGCCAGTGTCCTGTCTGCCCTGACCAACAAAGCCATAGTG 

AAGAAGGAACTGCTGTCTGTGGTGGCTGGGGGAGACAACTACAGGGTCAATAACAAGCAC 

GATGACAGATACACACCACTGCCTTCCAACAAAATCGTCAAGCGGGCAGAGGAGTTGGTG 

GGGCAGGGAGTTGCCTTATTC 

Sequence 2229 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACTANAAGATATT 
GATCCTAGTCAATTAGGCATTGTAGACTGTNATGACCACTTAATAAAAAATTNTGGACCT 
GANGCTCACGAGCATCCAG 
Sequence 2230 

NTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGNGGTACCTGCAGNNCTNNTACACCT 

ACCT CTCTNTGGGCTTNTATTTCGACCGNGATGATGTGGCTCTGGAAGGCGTGAGCCACT 

TTTTCCGCGAATTGNNCGAGGAGAAGCCGCAGAGGGCTACGAGCAGTNTCCTGAAGATGC 

AAAACCAGCAGTGGC 

Sequence 2231 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGTTTCAGAGTTTC 

ATAAGACACAGTCTCMCAGGAGTTTTTTTAATAAATGCTCTATCTCTAGTTCCAGAAAC 

TGAATTCCAAGAAGCTACACTGAGGATAATTCAGCTCTGATATTGTGATTACTGTGATGT 

TCTTTCATTCATACAGTAAGTATCTGCCCAATACGTAACTACCGAGATCTATTGCTTCCT 

ACATAATTAGACAAGCTCCTTACACATATGGGCCCATGCCCGGCACATAGCAGGCACTTA 

ACAAGG GGTTG CTTMCTACNGGMGGMTMTATMTTNGCCTTTCCNTTTTTAACTGN 

TTTACCCTTTTTCTATACNTTGNTATATTTTGGAAMCNACATGCTTGCAAAACTAAAAA 

TCTMCATGCATTACTAACTTTATAAAAGATCCTTCAGTATTTTTCAAAAAGGGAAAAAA 

TNATTAAAACCAATCCCCCAAAT 

Sequence 2232 

TACTTAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCAGAGCTTAAGAGACCCTTCAG 
CTTCTCGGTGTAGGTCACCAGTCATGTCACGCTCACAACCCTTTCGCCTCCCTCTAGGTC 
AGGCACCGCATCCAAGTTAAGCAACCGACGCCGGCTCCTTGGTGTCCCAACAAGGTCTAG 
TAGAAAAGCAACTTTATAATTCGTTTCACAGATTTGATTAAGAGCCAACTGAGTAGTGGG 
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CCTTGTGCTAGGGCTCCGAGGGCCCAAGGACCCCATGGAACCAGATTTAGTCGCTGACCT 

CTAGGAGCTCACAGGTGAGTGACTCCCCCCGCGTACCTCGGCCGCTCTAGAACTAGGTGG 

GATCCCCCGGGGCTGCAGGAATTCCGATATCAAGCTTATCGAATACCCGTCGACCTTCGA 

GGGGGGGGCCCCGGTACCCAAGCTTTT 

Sequence 2233 

CCGGGCAGGTACCGNTGTGTCCGGGTGGGTGGTCAGAATGCCGNGCTCCAGGTGTTCACA 

GCTGCTTCGTGGAAGACCATGTGCTCCGATGACTGGAAGGGTCACTACGCAAATGTTGCC 

TGTGCCCAACTGNGTTTCCCAAGCTATGTGAGTTCANATAACCTCANAGTGAGCTCGCTG 

GAGGGGCAGTTCCGGGAGGAGTTTGTGTCCATCGATCACCTCTTGCCAGATGACAAGGTG 

ACT 

Sequence 2234 

ATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATTGGCACGTCA 

CGATGTCTTGAGTTTCATTCACTAGGTGGCAGCCTGCATCGTTCCACTGCAAATGACTGA 

AATCCCAAAACACACAATGAGGCTGGCTCAGGTTTGACTCTATCTTGGAAAAAAATAGGA 

AAACTTCATTTATGGAATAGTTTTGAATAACCGTGGATATCACAGGTCCATTGACCTGAG 

CATTTCCATTTTTGGAAACGGGTAG 

Sequence 2235 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGTCATCACACGTGACAGGATGTCA 

ATGACMTCCTCCAGAGTTTACTGCCATGACGTTTTATGGTGAAGTTCCTGAGAACAGGG 

TAGACATCATAGTAGCTAATCTAACTGCGACCGATAAGGATCAACCCCATACACCAGCCT 

GGAACGCAGTGTACCT 

Sequence 2236 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATATGAAGCT 

GTGCTTTGCAGCAGGCCMCTTGTAAAGCTGTTCTCAACCCACTTTGTCAGGTTGATTAT 

CGAGCAAAACTTTGGGCCTGTMTTTTCTGTTTTCAMGAMTCAGTTTCCTCCAGCTTA 

TGGAGGCATATCTGAGGTGMTCMCCTGCCGMTTGATGCCCCAGTTTTCTACAATTGA 

GTACGCGGNGACAGCGGNTTCCTTGATCCTTGCCACCCGCG 

Sequence 2237 

GGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCGGGTACNCNGGGCCCTGTGCTG 

TCTGCACCGAG.GAGAGCGGCCTGNCGGAAGNGGGCCACCATATCTGGAAACTACAGTCTA 

TG NTTN GAAGCGCANAAGGGAATAAACANTTAANGACTCCCCCNGGGACCTGNAGGATGG 

NCTTTTCCATGGGGGCCGGAGCNGCAGCTTACMTGNAAMTNACANACTGGNGCTNTTG 

GAGMAACTATANTTGGCATAATTCCCATTAACCACNATGACTTCAAAATTTTAA 

Sequence 2238 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCCTGA 
AATGGCTTCTGCAGAGATGGACCTATGCCGGGGACAGCACACCTGGACTCAAAAGGCAGG 
G ATTGG ATAGGAAGAGGAATAAAATATAAAAATCAGAGAACTGCTGAAATTCTGTGACCC 
C I I I 1 1 AGTTAAAAAAAAAAAAAAAANGAAATAAAAGTACCT 
Sequence 2239 

GGTACATTTTTAMGAGGTTGTTTTTTGGCCGGGCGCAGTGGCTCATGCCTGTAATCCCA 

GCACTTTGGGAGGCCGAGGTGGGCGGATCACGAGGTCTGGAGTTTGAGACCATCCTGGCT 

AACACAGTGAAATCCCGTCTCTACTAAAAATACAAAAAATTAGCCAGGCGTGGTGGCTGG 

CACCTGTAGTCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATGGCGTGAACCTGGAAGGA 

AGAGGTTGCAGTGAGCCAAGATTGCGCCCCTGCACTCCAGCCTGGGCAACAGAGCAAGAC 

TCCATCCCAAAAAAAAAAANTTNAAAAAAAANNNNGGTACCTTGCCCGGGGCGGCCGCTC 

TANAMCTMGGNGGGGATCCCCCCGGGCTTGNMGGGMTTTCNATATTCAAAGCTTTA 

TTCGAATNCCGTTCGACCTTCGAGGGGGGGGGG 

Sequence 2240 

CGGTGGCGGCCGCCCGGGCAGGTACNTGGGGAAGGCGCNNCGTGCGTCAGCTTGACAGCA 

GAAGCAGGAGGAGAGCTGGCGGGAAGACATGCACCCCTTGAAGACCCAGAGAGAGGCCGT 

CTGTCTACCGCGTAGCAGTTACATCAGACTGAGACACTTCCTGTTTACAGGAGACTATAA 
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AATTCCTGCCCCGTGCTCATTTGGGGCTGACGCCATTTTAGACCTCAGCCCATCTGCACC 

CAGGCGCTCACTGAAACAGTGTGTTGCTCCACACCGCCTTGTTTTGCTTGNTGGCGCGCT 

CTCAGG GTTCC GACCMTCCMGAGCCTTGCAGAMGCATTAACGTGCTTTTNTCTTTGG 

CAAGAGTTTTTCTTTGCTCTGATCTTGGAGACCATCCCTCTGCCTAGGGGGAAAACATAN 

GGGAATACAGA 

Sequence 2241 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACACCATGTTAAGA 

GATAGCAATGCAACCGCAGCCAAGATGTCTTTAGATGTAATGATTGAACTCTACAGAAGG 

AACATCTGGAATGATGCAAAAACTGTCAATGTTATCACAACTGCATGTTTCTCTAAGGTC 

ACCMGATATTAGTTGCCGCTTTGACATTCTTTCTTGGGAAAGATGAAGATGAAAAACAG 

GACAGTGACTCCGAATCTGAGGATGATGGACCAACAGCAAGAGACCTGCTAGTACCCTCT 

CTTCCAGCACCCAGGCCAGTATTGAGATCGATTCTCTCTATGAAGGAATCGACTTCTATA 

CCTCCATTACCCGTGCCCGATTTGAAGAACTGAATGCTGACCTGTTTCCGTGGCACCCTG 

GACCCAGTAGA 

Sequence 2242 

TAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTGGCGTGGGCC 

TGCTTTCCCGCCAGTGCCCACTCAAAGTTTCTTACGGCATAGGCCGATGAGGAGCATGAT 

CAGGAAGGCCGGGTGATTGTGGCTGAATTTGACTCGTTTGTGCTGGTAACAGCATATGTA 

CCCTTGTCCTCTTCTTCAGTGACTTAAAACAATTCCAGGATCAGAAGAGAAGCCAACGTG 

ACATCCTCGATAAAACTGGGGATAAGCTGAAGTTCTGTCTGTTCACCGAAGTGGTTGAAA 

AACAATTTCGAGATCCAGAAGTCCCTTGATGGGTTCACCATCAGGTGTTCACAAAAA 

Sequence 2243 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCAGGTACCTTCACCTG 
CTTTGGAGAAATCCTGAATGTGTCCCCAGAAATGAGACGAGAAGTCTTCTCCTGATCAAA 
GGTCTTTTCTTGATTGAAGGGTTCGTGGTTTCACAGGCTTCAAGGAAAGAAGCCATGGAC 
CTCAGTGGTGAGTGTTACAGNTCCATTAGAGAAACATGCAGACCCCCCCGCGTACCTN 
Sequence 2244 

GCCGCCGGGCAGGTACTTTCCCTTCCCTAGAAAAGCNNGACTNGNCGCTAACGGTGAAGG 

ACCANGGCAGGCGTCCCTGAGTGGTCTGACACCTTTGAAACGTGAGTGAATAATCAGAGA 

GGTGTCCCTGNAATGATAAACACCAAGGAAAGGCTGCCTTCCCAGTCTGNGACCAAGCGC 

CAGAGTTTTGGGTCCACGGATAAAACGTGTCTCGTTTGTCTCTACCAGAAAATGAANGGA 

ATTTGAAATTAAGAGAAGGGAGAGATTAAAAAAAAAAAAAGAAAAAAAAAAAGTACCTCG 

GCCG 

Sequence 2245 

AGGTACGCTTCGACCCACGCGTCCGAGAAAAGACTATTAATTATTTGTGCCTTTAAAGGT 

TAATTATTTCAACCTTAAAAGAATCTTATAAGAACCCATTAATAAAAAAGAATGCAAATA 

TATTGCCCAAAAAAATACACAAATTATTGTCTTTCATATAGGGGAGGAAATAAATATGTT 

TCATCTGAATTGTCTGAAGTAAGTCATGAATGGAGTCAGGGTAAAAAAAAAAAAAAAAAA 

AGTGCGGCCACCTGCCCG 

Sequence 2246 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGCGAATACTGCATACTGTTTAAGGTAGTA 

TATMGTTTATGAGAGAAGTGGAGAGCTTTCTTCCTTGAAMGTCGGTATTTGTTGAGAT 

ACCATTTGCCTCACAGAGAGGTGTTCCCCACTCCCATCCCCATTGCCAGATAATAAATAT 

TTTGAGAAAAGTGACCTAAAACAGCTGAAATCTTAGGTGCATCTGTCTGCAGACCTCCTT 

MGCAGGCTGTATCTTACAATTCCCTTACTGCACTGGGTMGTGTTMCTTAGTTTTTGT 

TGTTTGCTCTTTTGCTTTAAATATTCTCCAAATTACACCT 

Sequence 2247 

AGGTGTACAAGCTTCNACCCACGCGTCCGAAAGATTCTTGGNTGAGCATGGTGGCTCATG 
TCTGTCATCTCANCAATTTGGGAGGCAAGGGCAGGAGGATAGTTTGAGCCCATGAGTTTG 
AGACCAGCCTGGACAACATAGTGAGACCCCATCTCAGCAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAGTGCGGCCGGCCGCAC I I I I I I I I I I I I I I I I M I GCTCAATANAAGTNTGG 
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MTMTTCCAGGTMTTTAMGCATNTTTTTNAATTGGTGTAAGCTGCTCCATGAAGTCA 

NCTNGCTCCTCTMTTGGGATGGTTNTTNATNACACGGCATGTTNTAAGAGTTTGATGAC 

AGGANCATNAGTGTNNGGNCCCAGNACCCCCTTCTGNGCGGACTTNTNGGCATCCTCCCT 

TCAMNCTCAATNCTTTTNANGGGGCCCTGAGGAAAGTNNTTTATTAAAAGGGNTNTTTG 

CAAA I I I I IG 

Sequence 2248 

AGGTCACI I I I 1 1 1 I I N I I I I 1 1 AAAATAATTCTTTTAATTATTGATGCTTTGAATAAN 

MGTCCATTTTACTAMTTTAGTATAAATTATCGGACGCGTGGGTCGAAGCTTGACCTGC 
CCG 

Sequence 2249 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTCACTGACACGTTTCTCTTGACA 

CTGAGATGGTCGCAAACAAAACACCGTTCTTGCCTGGATGAAGCAAGAGTTCACATAAAA 

GAGCTTTATAAAATGTCTATGAAGGAGAATTGATAATATCAGAAGAGCTCCAGCACTTCA 

ATTGAATATAATCCTCTATTATTCTTTTCTTGATTTAATTTCTGTAGCTCCCGAAAACTT 

ACTT CAATCTTGTTGAGCTCAGAATAAACAGATATCTAAAGAAAAGAGTTTCAGAGTAAC 

TTTTACCATAATGTCTACATAGAAACAAATCTTCCAAGAGTAAGCTATTTAAGGACAAAT 

TTTMGAATAATTGTAATTCAGGTGATACTTGGGTTTTTGTACAAGCTTCGACCCACGCG 

TCCGACCTGCCCG 

Sequence 2250 

CCGGGCAGGTCMGCTTCGACCCACGCGTCCGATMTTTATACTAAATTTAGTAAAATGG 
ACTTCTTATTCAMGCATCMTAATTAAAAGAATTATTTTAAAAAAAAAAAAAAAAAAAA 
AAAGTGACCT 
Sequence 2251 

TGAGCTAACTCACATTAATTGCCGTTGCGCTCACNTGCCCGCTTTCCAGACGGGGNAAAC 

CTGTCGTGCCAGCCTGCATTAAATGGAATCGGCTNAACGCGCGGGGGTAGGAGGCGAGTT 

ANGCCGTATTGGGCCGCTCTTCCGNTTTCCTCCGCTCACTGGACTTCGCTNGCCGCTTCG 

GGTCCGTNTCGGGCTGCGGCCGAGGCGGTTATCAGCTTNACTCAAAAGGCCGGGTAATAC 

GGGTTATCCACCAGAAAATCAAGGGGGATAAACGGCAGGGAAAAGAAACATGTGGAGGCA 

AAAAGGCCAGCAAAAAGGCCCAGGGAACCCGTAAAAAAGGCCCCGTTGCCTGGGCGGTTT 

TTTNCA7TAGGCTTCCCGCCCCCCCTGGACGAAGCATTCACAAAAAATCCGACNGCTTCA 

AAGTTNAGAAGGTGGGCGAAAACCCCGAACAAGGGACTAATTAAAGGANNCCCAGGGCGT 

TTTCCCC 

Sequence 2252 

AGGTCTTCGACCCACGCGTCCGATTTTACTCTGTGTATTCGTGGTGGTGGGTTATAGTNT 
ACCAAGTTGTCCAAAATGGATAATTCATGTGAGATAGGCAAGANCAAGNGTTGTATCCCA 
ATACCATTTGGTGAAAAAGTTGAACTTCANAAAGGATAAGTANTTTGCCAAGATCACCTG 
NCAACCTGCATAGTCAAATTTGAGATGGAACTCAGCTACTCCAATATCTGTATTTGCAG 
Sequence 2253 

CCGGGCAGG I ! I I I I I I I I 1 1 1 1 I I I GCAACATTGCCCTTTGATGTCCCCATGAGGGCCA 

GGCCCAGGCAGMCCCATCCCATTTTATCCTTAAACTCANAAGGAAATTTGTCTAAATAT 

TAAAGGATTAATATGGGAATAAAAAATGAACCTTAAACCCTGCCACTGATACACAAGCTG 

TCTCTCTTAGAGTTCAATGMCACTTCAGGAGAGTATTTCCAACAATATTTAGATATTGG 

AATATCTAAATATTGTTGATTTAGATMCCACCCTAGATTTCTCACCACCCTAGAACATT 

TAGTGGGGAGACATTCTTTTCTCCTTTTTCTGATMCTTGGTCAGAAGTGATTGACTGNG 

CAAATGGTATTTNTCCGGACGCGTGGGTCGAAGCTTGACCTnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnGAATnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnATTGCGCGCTTTGGCCG 

TAAATCATGGTCATAGCnnnnnnnnnnnnnnnnnnnnnnnnTCCGCTCACAAAATTTNCA 

NCACAACATACCGAGCCCGGGAGCCATAAAAGTGTAAAAG 

Sequence 2254 

AGGTACATGTGCCANNTTTGTTATATCTTTANCTATATNCNNNNCTACTTGCCCCTGATC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

375/467 



ATCTATGNGTAGACMTMTTAGTMGGTCAMTTTTTCACAAACTCCTCCTTCAGCTGC 

TAGCMGTMTCMGAGCTAGTCTATTTTGATAGATAGCATTTCTCATCAGAGTCTCTTG 

CCAGGCAAGAACAGTCAAGGCTTGAGTGGTNTNNTTANTGNCTGCATTCTAAAACTGCTT 

GGCNCCATATGACANNGCCGANCATGTAGATGGGGGTTCGATATCCCCATGAGACATCTT 

GTGCCCAAGTGGCAGGTCCATAGTATTGTAAG A I I 1 1 1 I CAGGGGGCCATTCATCGTCTT 

TCCAATCACCTATGGCTATGCTTCATTTTTCACAGGAAGCACAGACTGGGMGCCCAGAA 

GTTTACCTGTTTTTATGGNCCCCTAAAAMMAANANGGTTCNATGGGTGCCAAATTTAC 

ACAGNTTACCTGGCCCACTTGATTNAGGGMNNTTTTNANNCATTAAGNNCCGAANCCCN 

NNGGGNATCAAAAAGNTTTGGCCCCTGGCCCGGNNNGGGCCCGGCCCNCC 

Sequence 2255 

ACCTGCCCGGGTGGCCGCTCGAGGCCGCCCGGGCAGGTTTTATTTAACATTCAAACTTCA 

TTMGACATGTGCAATATGGCMTTTTACTGGGGATTAAACCCTACCTAGGATTGCTTGC 

TGGGGCTTAGCMCAGGGTCCAGTTCACACTTAGCACTMTTAAATACTTTATTGAATAA 

ATACAATACCAAACAAAATGCATTCAAATGCCGGACGCGTGGGTCGACTCAAGCTAGGTC 

GGACGCGTACCT 

Sequence 2256 

ACI I I I I I I I I I I I I I I 1 1 I I I I IAAAAATTCAAAAAATTAGTTTATTAGCTTAATATAA 

TTAGGTCMTGGMTCCTGTTTTGATCTCAATACTTCCCATATTGCAATATATAAATGTG 

ACAAATTCAGCTGTTTTGCGGCATAGATAAGTGTCTAAGCTGGGCAGTTAGTCTACCCGT 

TTATAGTTCATGTTCTTCATGGCTTTGCAGCATTTGTCACTTTCTATGATGTGTTCAAAG 

ACCA GAAAAGGCCACACTTGACCTGTCAGCTGGTCCTTGAACAGCTGTAGG I I NIMH 

TTTTGAGACAGAGTCTCCCTNTGTTGNCCAGGCTGGAGTGCANTANCGCAACCTCGACTN 

ACTGNAACCTCTGCCTCCAGGTTCAAGNGATTCTNCTCCTCANACTCCTGAGTAGCTGGG 

ATTACAGGCCCATGCCACCATGCCCGGCTGATT7TTATATTTTTAGTAGAGATGGGG^^^ 

CACCATGTTGGCCAGGCTGGTMTTGAAAACCCTCGGCCGNTTCTANAAACTAGTGGGATC 

CCCCCGGGCTTGCAAGGGAAATTTCCNATTNTTCAAAAGCNTTAATCGGATACCCCGTCC 

AACCNTNNGANGGGGGGGGGGCCCCGGGANCCCCCAGCI I I I IGGTT 

Sequence 2257 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGCAGGACCTCAGATTAAAAATAT 

TG CTG AG GTCAG ACGCCACMTTTTCATG ACTTTCTTC AGMGTAGCACATTTTCGTG AC 

TTCCGCTGTCCTCTGAAAAACMAGTTATTTGGAACATGTTCATGCAAAAGTGATTCTGA 

CCMGTCTAMTCGAGCTTTTCTACTGACATGAMCTGNTGGAAAACTGATCTTATTTTA 

TAAG MAGTGTTTTCCCCTNAGGGGGGGNNGTGTGNTTTCNGAACANCCCNCNNATTATT 

TTTTCCCCCGGNNCCNTATAAAATATTGGGGTCCCCCCCCNNNCGGGGGGGNNGGGGNAT 

TTTAAAAATTNTTTTTTTNNTCCCCCCCCCNCTGGGGGG 

Sequence 2258 

GGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTGTACAAAGCTTCGACCCACGCGTCC 

GAGCAGAACTTGGCAGCAACAGAGGAAGGGCCCCTGGAGCCGGCTGTCGTGGATGCCTTT 

AATCAAGCCTGGCATTTGGTTGCTCACGAATGTCCCAACTACTTCCGCTAGGCCCATCAT 

GGCTCAGGCTGCCCAAGGCTTTTCTGTCACCTNTTTTGTTCTCTCACACTGACCAAGTCT 

TACCTGCCCG 

Sequence 2259 

CGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACCCACGCGTCCGTA 
GTAATAGGAATTAACTGACCCCTTTNGGATGGGGGAGAGCATCAGGCTGGGGTCAGGTAA 
GTGTAAATGGCCTTCTGAGCATGCTCTTCTAGGCTGACTCC 
Sequence 2260 

CCCCGCGGTGGCGGCCGCCCGGGCAGGTACAATTAGTTATCAATTCATGGGCTATGGCCA 

CTGGTTTGCTGGATGGTCAGGGACTTGGAAGGAACATGACTGGAAAATTGGTGACAAAAC 

GGTCTGTGGAAGAGGTATATACACAGATCTTTCTGAATGGGTGAAAAATGTGAGGATATT 

TGTGTCTCATATGAATGCCCCCAAAGAATGACTTCAGCAGAAGAGGATTTTAATAACCAA 

GTAGATAGGATGACCTGTTCAGCCCCTTTCTCCAACCATGCAGGTAACTGCCCAATGAGC 
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TCATGAATAAAGTGGCCATGGTGGCAGGGGTTGTTCGGACGCCGTGGTT 
Sequence 2261 

TGGCGAATTGGACTCCCCGCTGTGGCGGGGNGNCGGNCGNCNGAGTAAGACTTCGTCTCA 

AAAAAAACAAAAACAAAAACAAAAACAAAGAAGTTAGCCTGGTTGGGCATGGTGGCTCAT 

ATCTGTAATCCTAGCACTTTGGGAGGCCCACTGGGGATCATTTGAGGCCMGAAATTTTG 

AGACCGAAGCCTGGGCAACGTTAGTTGAACCCTCATCTNTTACCCAAAAAAAAAAAAAAA 

AAAAAAAAAGTTGCCGNGCCCGCTTCTTANTAAACTAAGTTGGATTCCCCCCCGGGGCCT 

GGCAGGGGAAAATTTCGAATATTCAAAGGCTTTATTCTGAATACCCCGTTCNGACCCTTC 

GNAGGGGGGGGG 

Sequence 2262 

AGGTGGACTTGATTCATAACCAACCAACCCAGGCTGTCATGCTAAAAAAAAGGGCAGAAT 
CTTTGCTGCTGMCTACTNGCATAGGGTTGACTACCATTGNTCCTTTAGTNTTAGTTTGG 
GCTAGCAAAAAGGTGNGTCCTTTGCCATGTAAATAAAAGCCNTNTCNNGGTAATNAAAAN 
GGTNTNTN I I I I IAACTT 
Sequence 2263 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTCCCAAAGTGCTGGAATCACAGGAAT 

GAGCCACCACACCCAGCCAMTTGGGCACAAATTTAAAATTTGACTTTTATTAATGATAT 

GGTAAAGAGATCTAGCTTGGTCATGACACCCTTGTGTTATACGGTGACAGGCAAATCATT 

TAAAMTATCTAMCTATAATTTNCTGTAGTTCACATGAATTGGATATTCTTGAAGCGGA 

CGCGTG 

Sequence 2264 

GGCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATTTTTTGGGGGGAGAGACACAG 
ATTTTTACACTMTATA TGGAC CTAGCTTGAGGCMTTTTAATCCCCTGCACTAGGCAGG 
TAATMTAMGGTTGAGTTTTCCAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCTGCC 
CGGGCGGCCGGCCGCCCGGGCAGGTNTCGGGCCCGAGAAGACCTCC I I i I IC 
Sequence 2265 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGCGTCCGAGCTAACGAATGCTNGACT 

AACTAGATNCCAAAGCTTGCTCTGTGAAAATTCCCGNATAACCNNTGAAGTGGGCGACAC 

CNTAACCCTGCACACCTTACTCCTGGTNTCAGAGAGCCCAGTIMTGAACATAAACTGNGTA 

GAGGTGTTAGACTCANCCTACCCTAGTAANGCCCAACCTCCGAGACCAACCTTAAACATC 

AGTAGACTGCGAGCTGTATGTGGATAGGAGCAGTTTNGNCAACCCCTGCNAAGTGACTCT 

GAAAAAGAC 

Sequence 2266 

CCGGGCAGGTCTTGAGTCGACCCACGCGTCCGCCTAGCAAAGCTGTTTCCACTGAATGCA 
TCTAAGCANNGATGGANCTATGCCAAAACCACCACAGGNGTTTCACTTNAATGATACCNC 
GAAACAAGG 
Sequence 2267 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATATGCTACAGGGAAGA 

AAACGTTCCAAAAACAAGAAAAAGTTGGAAAAGGCAATGAAAGTGCTCAAGAAACAGAAA 

AAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCAC I I I I I I I I I I I 1 1 I I I I I I ICAATN 

TTATTTATCA AAATM ATTTATTAAAAGTTTTCAAAGACCNCTTTAAAGTGTANCTGGCT 

TNMNACAGATTTTTGGNACTCNTAACGGACACTGCAGTTTTNAACNCCATAGCACTCAT 

TNTATTTCACACATCA I I I I IAAC 

Sequence 2268 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGGTATGTAATCCAGG 
AAGfGACCAGCCTGATGCGTGTTATGACTCACTGNAAGCCTCCCATGATTAAGGAC 
Sequence 2269 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTCGACCCAC 
GCGTCCGCCGGAGAnnnnnnnnnnnnnnnnnnnnnnnnTGATCTAGATCACGAGCAGCCG 
CAC l I I I I i I I I | | | n I I I I I II I I I H I GGACGAATTCAGGAGTCCTTTATTAGCCGG 
CAGCCGANAGACAGCTAGCGCTCAAAATTNTNTTGGCCCCGAAGAAGGAGCTAGATTTTC 
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TTTTATACTTTGGmAMCAGGGGAGAGGGGGAGTNTAGTTGAMCAAT^ACAGAAG 
TAAAGTAGGCAAAAAGTTAAAAGGATAAACGGTTACAGGAAAGTAAACAGTTCCAGGNGC 
AGAGGCTTTAAGTNTATCCTAAGGNGATGGACCCCGGGCTTTGGGC 
Sequence 2270 

AATTGGAGCTCCCCGCGGTGGCGGNCGAGGTCAAGCTTCGACCCACGCGTCCGGTTTGTT 

TTTTTCTTACGGCAACTCAAAGCAAAGAGCTGGAGGAGCCAGNCATTATAATTGCTTACT 

CTCATCGCTTAGCGCCCCAGGTGGGATGTGTTTCCAAMCACATTTTTGTNTTTATAAGG 

AMTGTAGTTAGGATTAATTTTATTGTCCTMTTAGMCTCACATTTTGGTTAAATCCTC 

A ATTTC ATT AAAAA A AAAA AAA AAAAAA AAN G G N 

Sequence 2271 

CNCTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTC 

GACCCACGCGTCCGGTGATATGTCACAATGCCGTGTAGCCAGAGCCTAGACAAAAGTTAC 

AGCACCTGGGAGATCAGTGCAGAGATATGTCACAATGTCCCCAGTAGGCAGAGACCAGGC 

AACAGTTGGATCACCTCGGGATCAGTGCAGAGACATGTCTCAATCCCCCTGTGGGCACAG 

CCTAGACAAGAGTTAAATCACCTCGGTTAACAGTGCAGAGATATGTCAATATTCCCCTGT 

AGGCCGAGCCTACACAAGTGTTACATCACTAAGGTGATCAACGCATAGATATGTCAAAAT 

ATTGCCGTGAAAGCAGAGTCTAGACAAGAGTTACATCACCTGGGCGATCAGTGCAGAGGT 

ATGTGACAAGGCCCCTTTAAGCAGAGCCTAGACAATAGTTACATCACCTGAGTGATCAGT 

GCAGAGGTCTGTCACAATGCCCCTTTAGGCAAGAGCTTAAACACCTCGGC 

Sequence 2272 

AGGTACTTGACCCCACAGCCATCTGGGATGAGCCGCTTTTCAGCCACCATGTCTTCAAAT 

TCATCAGCATNGAACNNGGTGAAGCCCCACTTNTTTGAGATGNTGGATCTTCTGGCCGGC 

CMGGAAACTTGAACTTGGCCCTGCNGCAGGGCCTCMTCACATTGCTCCTTTGTTTCTT 

GCAAGCTTTTGGGTTGCCGGGATTGGGGACATNGATAAACTTGGGCCAAATTGTGAAACC 

CTGGGCCACAAGTGCCCTGGGGGNGCCTTTCCAAAAAGGGCCACCCTCCGCCATGCNCTG 

TTNTGGGAAGNCCTTGTTCAAGCCCCCCAANCNACAGGGGAACCAAACCAATTCNTTNGT 

TTGGAATGGCCGGGAATTGAACCGTGGNAAAGGGGGGGTTGGGGAAGCCCCGCCACCCCC 

NGGNATAATGGGGAMGCCAATCTTTTGCCMCAMCTTTTTTTMCCAATGTTACCCTT 

GCCCGGGGCCGGGTCCGCTTCTAAGAAACCTAGGTGGGGGATCCCCCCCGGGGGCCTGGC 

GAGGGGAAATTTTCGAATTATTCNAAAGCTTTAATTCTGAATACCCGTCCGGACCCTANG 

AGGGGG 

Sequence 2273 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTAGCTTGAGTCGACCCACGC 
GTCCGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA AAAAA 
AAAAAAAAAAAAAAAAAAAGGGGNGGCCGGCCGCCCGGGCGGGTCCCNTCANANANAAGG 
GGGNGGGNGCTAATCCAGTACCAAACNTTC 
Sequence 2274 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATTTCTACTGT 

AGAGGAAAATATATGACAATTATCACTGTCACTGCCTGACATACAAAAATGGAACAGAAC 

AGTGGGGTAATGAAGGGAGGGGAGGGAAAGGGAAGAGCAGGAGAGAGAGGAGTTGGAGGA 

GAGGGGAAACAAAGGGAAAAGGGTCTATTAAACAGAGGCCTAGAGAAGCTAAAATTTGGA 

AATGGCAAATCTGAGAAGAGCCTGAATAAAAAGTGGGGGTGAGGCCATGCACAGTGGCTC 

ACGCCTATAATCTCAGCACTTTGGGAGGCCGAGGCAGGTGGATCACCTGAGATCAGGAGT 

TCGAGACCAGCCTGGCCAACATGGCAAAACACCT 

Sequence 2275 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCG 
TCCGAAAAATGGGAGACAATTTCACATGGACTTTGGAAAAT A I I I I I I I CCTTTGCATTC 
ATCTCTCAAACTTAGTTTTTATCTTTGACCAACCGAACATGACCAAAAACCAAAAGTGCA 
TTCAACCTTACCAAAAAAAAAAAAAAAAAGACCTGCCCG 
Sequence 2276 

CCGCGGTGGCGGCCGCCCGGGCAGGTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGACTAG 




WO 01/070979 



PCTYUS01/09126 



TABLE 1 

378/467 



AGGGATGAACCACCATGCCCAGGTMTTTTTTMTTTTTAGTAAAGGTCGGGTCTCACTA 

TCTTTCCCTGCTCAGCTGGTCTGGAACTCCTGGGTTCAAGTGATCTTCCCACCTCAGCCT 

CCCAAAGTGCTGGGATTAAAGAAGTAAACTACCACACTCAGCCACACATAGGTAATTTAA 

MTATTTCCATAGTCACMTTAAAMCACATATAMTAGGTAAAATTAATAACATTTTAT 

TTMCCCAATATATTAAMTATTTCCACTTTAAAAAGAGACCTCGGCCGCTCTAGAACTA 

GTGGGATCCCCCGGGCTGCAGGGAATTCGATATCAAAGCT 

Sequence 2277 

ATAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTATTCGACCCACGCGTCCGTAG 
TTTTTATCTTTGACCAACCGAACATGACCAAAAACCAAAAGTGCATTCAACCTTACCAAA 
AAAAAAAAAAAAAAAAAANGT 
Sequence 2278 

GNACC CAGTAATCACATAMGGTNTGCAGGTCATGNTGTTTATTTAGCTTMCTGTTTTT 
TTTTTATTTGTTGMGGGGGTNGGTGTTATTTCAGNCTTT^ 

CTTGGTAAAATCTGTMGAAAGTTCCATAMTTATTGGGGGGGAAGGNATTTCCTCTGAA 

ATTGGGCTAAAATTCCTTGTAGCTGAAAAAAAAAANAACAAAAAAACAATAAAATANGGN 

GGCCGGGCCCGCTTCTAAGAAAACTAGGNGGGGGATCCCCCGGGGGCCTGCAGGGGAATT 

CCGNATATCAAAGCCTTATCGGATTACCCGNCGNACCTCGGAAGGGGGGGGGGGGGCCCC 

GGGGTACCCMGCTTTTTGGTTTCCCTTTTMGTGGMGGGGGTTAAATTTGCCGCCG 

TGGGGCCGTAAAATCAATGGGTCATAAGGCTGGTNTTCCCTGGTGGNNGAAAAATTTGGT 

TAATCCCGCTTCAACAAATTTCCCACCACCAACCANTACCGAAGCCCGGGGAAGCCATAA 

AAAGGTGGTNAAAAGCCCTGGG 

Sequence 2279 

NGGNGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAAAAAAAAAAAAAA 

ACAAAA AAAAACCACTTCAACTAATTCATCTGACAATGCTGTTCATATTCATGACGCCAT 

i I M l iGTTTGTTGTTGTTGTTGTTTTCCTAATAATAAAGAAGGAGACTTAGGGCTGTTG 

GGCTGATATATGTTTGGGGGTCCCACCTCCCCGCCTCATCCGTACCTCGGCCGACCAGTG 

CAAATATCTACCCAGTTAGAAGAGTAAATACCATCTTAGTGTTATTATCAAAATATTNTG 

AACTCATGAACCTCCTCAGACTGTTGCTGGGGACTCCCAGATATCAATACTCTGAGAACC 

ACTGATCTMTGTTTCTTTAGTCAGTTTCTATTTGTTCTCTAGTATAACCAAGCATAAAA 

GTAAT 

Sequence 2280 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCTCAG 

GGCACAAAGTGATCATTTGGGATCCTAAGTTAAAAAGGAAATGCAAGAGTAGGATACTCC 

AATTCCAGAGTCTTTGCAGGAGGCTAATCCCACAAGAAGGGTAGCATCAGAGAAAGTGGC 

ATTGGTCTTAGTGGTGGATCATCAGGTAGACAAGTGATAGTGTGTGTAACCCATCTGAAA 

TTCATTTTACCGTCACCACTCTTACAMGGACAGTTTATTCCCAAGGACAGTGCTGACGG 

GGAGGGGGACAGGCAGGGAGTTAGGAGGGTTTTCGAGGATTTCAAACAGGTGGAACCCAT 

CCAT CCCTA TTCCCAAGGGCCACTTACAACTCTAAGGGGTGGTTACAGGATTAACTACCA 

GTTCATTTTCAAAATGCTGCTTTGAACTCAGAGGGTTGA^ 

TGTAAAACTTTTTACAAAATAGT 

Sequence 2281 

CCGGGCAGGTTACAAGCTTCGACCCACGCGTCCGGTCATTATTACCCTCACTGTCAACCC 

AACACAGGCATGCTCATAAGGAAAGGTTAAAAAAAAAAAAAAAAAAAAAGTGCGGCCTCG 

AGCGGCCGCCCGGGCAGGTACAAAAAAAACCTTACATAAATTAAGAATGAATACATTTAC 

AGGCGTAAATGCAAACCGCTTCCAACTCAAAGCAAGTAACAGCCCACGGTGTTCTGGCCA 

AAGACATCAGCTAAGAAAGGAAACTGGGTCCTACGGCTTGGACTTTCCAACCCTGACAGA 

CCCGCAAGACAAAACAACTGGTTCTTGCCAGCCTCTAGAGAAATCCCAGAACACTCAGCC 

CTGACACGTTAATAC 

Sequence 2282 

GCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCTACCCACAGCGTCCGTTT 
ATGTTCAGCAATAAAGGTTCTATCCGTAAAAAAAAAAAAAAAAAAAGNGCGGCCGGCCGC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

379/467 



CCGGGCAGGTACTATGACTAGACACATGATGCATGGCTAAAAAGCTCTTCTGGATAACTC 
CTTAGNGAAGGNCTAACNNCCCCACCACCATCAACCTACAGCCCTGCCI 1 1 I I 1 11 11 I I 
TTMNAAGTCTGTCACCACMTNTTGTCTGGNGCTNGATTTCAAATAATACATTTNTAGA 
ACCTGCCCGGGCACACGCCCNATMCGANTGGTTTTNNNTTATATCAATTAACGTTAAA 
Sequence 2283 

CCGCGGTGGCGGCCGAGGTACTCCAGCCTGGGCGACAGAGCAAGGCTCAGTCTCAAAAAA 

AAAAAAAAAAAAAAGGAGAGGAATAGTAAATTTATAGTGGAGAAATCTGCAGTCACTAAC 

TTAACCAAATAACCACATGGATGTATCCCTTATTAAAGGTAATCGAAAGGGCACAGCGTT 

ACTTCTGTGGAATTCTTGCCCAAAATGCATAATCTCAATCAAATCATAAGAAAACATCAA 

AATTGAGAGGCATTCTACAAAACCAATAATTAACCAATATTCATCAAAAGTGTCAAGGTC 

ATAAAAGACAAGATGTrTATAGAAATACATTAATTTTGGTTTCTACTTAAATTTTAT^ 

TTAAMTAATTGTTTTAGAGATGTGGCCTTGTTAGTTGGCCA 

Sequence 2284 

ATA GGGCGAATT GGAGC TCCC CGCGGTGGCGGCCGAGGTGGG AI I I I I 1 1 1 I 1 IGTTTTT 

TTCI I I I I I 1 1 I I GGTTTGTTTTAAATCAGTGCATAMTTTTTCTTTTCTCATTTCAGCA 

GATGGACAAACAGATGGACTCTACAGCTAAGTGGAATATCAAAGGTAGAGGGGTGATTCT 

GTGAGACTGATAGGCCTGACTATTCTCAATTCTCCCCACTGCAGTGTTCACGCGGACGCG 

TGGGTCGAANACCTGCCCG 

Sequence 2285 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCGCGTCGTAACAGTG 

TAACATGTATTATGGTAACTTCTAATCTTGTGGCCTTAGACAGTCTAGTCCAAAGGCATA 

AAGAMGTNTGCTTTAAAAAAAAAAAAAGGAATGGTTATCTTCAAAAAAAAAAAAAAAAA 

AAAAAAGT 

Sequence 2286 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACGACTCATATAGGGATCTAGA 

TCACGAGCGGNCGGCCGCCCGGGCAGGTCTCCCATCTTGCGCAAGTTGGTCACGTGGTCA 

CCCMTTCTTTGATGGCTTTCACCTGCTCATTCAGGTAATGTGTCTCAATGAAGTCACCG 

GACGCGTGGGTCGAAGACCT 

Sequence 2287 

GCGAATTGGAGCTCCCCGCGGTGGCGGCCGTTAAAGGAATAATCTGCAGAACATCTTGAT 

TTACAAGGGACAAAATGATGCAAATTATATGCTGTCCAACCTACTGGTGAACTGGATCAG 

AATGGTCCMGGACTGTTAMCAGAGGAAGTATTTACATTTTGAAAACTTGCGGACGCGT 

GGGTCGAAGCTTGTACACCT 

Sequence 2288 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGNCTGGTACATAAAANAATTTN 

TTNGTCTTTAMTNGATACNMTGTCTATCANCTTTAATCAAGTTGTAACTTATATTGAA 

GACANTTNGATACATAATAAAAAATTATGACAATGTCCTGGAAAAAAAAAAAAAAANAGT 

GCGGCCGACCT 

Sequence 2289 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATACTTGTCATTG 

CGTAGACTTCTTATCAAAATTTCACATTNATCTGTAGGAAAATGTAAAGTTGGTAAAAAT 

TGTTTACACAMTCA CACA TTTTCCATCCTTGACAATTGCAGNGI I 11 I 1 1 1 IAAATATT 

GCTGTATTMGACAATTTTAACTGAAGTAGGTTGTAGAGGCTANAAACCTGATTAATAGA 

GCAGTATTAGACAATTCTAACTGAAGTAGGTTGCAGAGGCTAGAAGAAACCTGATTAATA 

CGGACGCGTGACCT 

Sequence 2290 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAAAGCTTCGACCCACGCGTGCG 

AGTCAGGGMGGTTTGTTTATGTTACATTTATTTCACCAGMCTATTTTAATATATCAAA 

GGGGTTTACTATGCCAAACAAAATTCTAGGGAAAAATACTGCTAAAAATGGATGCCTCAT 

CAGAACATGCTGTTGAGTCCAATGTGCCATMGACATTTTAGCATGTTAAATAGCACTTT 

TAATAGCAAAAAAAGGCACATCAACTGCGAAGTTATCCTTAGTTTGCAAATGCTTTTTCT 
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AGATTAATGATTTTTCAATCATTAGGGTACCTGCCCG 
Sequence 2291 

CGCTACCTGCCCGGGCGGCCGGCCGCACI I 1 1 I I I I I I I I I I I I I I I I CAAGTTTTATGA 

TTTATTTAACTTGTGGAACAAAAATCGGACGCGTGGGTCGGACGCGTGGGTCGAAGCTAC 

CT 

Sequence 2292 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCTACCTCCAAGCAGGATGATGGG 
CTAGACATGGAGCATACATAAACGGGCAAGATTCAGTCCCTGACCGCAAGGCACTTACAG 
TCTAGTTGGGAAGGGAGACACAAATGTACCT 
Sequence 2293 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCAGAGACTCCAGGAAAAATCAAAAATTTGTTT 

TTGCMTTAGCCGAGCACGTAGCCCAGTCTCTAAATGTCACTTCATATTATGTTTGTAAA 

ACAACTGTAATAAGAGATCAATGGCCATAAGAAGCCTGAGAATTAGTGCCTACAGACCCA 

GTTCCTGATGMTTCTCAGCCCAAAAGAATCACCCTGATCATTTCTAGGTTCTAAAAGTT 

TCAATTATTGGACAATATTGCATAGCTAGAAAAAAAAAAAAAAAAAAAGTGCGGCCTCGA 

GCGGCCGCCCGGGCAGGTACTTTGAGCAGGATMTAACATAAATTTCATTTAAAAAGTTG 

TATTTATAGCCCCAGTAACCGGAAAGAATTATAAGTAATTATGGAAGTATTATATTCTGA 

CCATACCAAGAGTTAAAAACAAAGAGTTCCTACTAAAGAGGAATATTTTCAAGATGATCT 

GGTCCATCATGTGCATAGTTAAAGAATGGTTGGTTTAATAMGATTCTTTTGCAAATAAA 

GAAT 

Sequence 2294 

AGGTACAAGCTTCGACCCACGCGTCCGATGAACAACTGGCTGCTCTGTCCCAGGGTCCAA 

TATCCAAGCCCAAGAGGAAAAGAGAGAAAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGG 

CCGCCCGGGCAGGTACAGATAAATATAGGAAACTACTTCAGATTATTGGACAAATAAGAA 

TTTCAGTGTGTCACTACCTATAATTAAGTAGCAGCACACCACAACCACAGCAAGTAAACA 

CTAATCTCCTCTACTGCCTTTTTGGGGTCTTTTCCAGTCACAGGAGCTAATTTACAGGGA 

TACCACTGGGTTTCMCCAATCTCTGAAAGTTCCTTTATTATGAGTTTTGAMTTTAACT 

AGTGCATCACCTACAATTCTGTTGGCTAG I I I I I I I CTTACTCTTCACTAATTAAGTTTT 

AATACAATATGTAATGGTTTCAAAATTTTT 

Sequence 2295 1 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGAAA 

CTTTAATTTTTAGGAATAACATATTGACTTACTGAACTGAAGCATTCTGAGTTGAAAGGA 

GCCCCAGAGGAAAGGAGTTCTGTGTTGCTCACATGTTAAAAGCTTGCTCACCTTCAGAGC 

AGAGGGAATACCTATCTTCAGATATCCGCCCATT7TCATCTCTTCATTATAGTCAAACAG 

TGTGACTTGAGAGTGTTGCTCTGGTGTCTGTATTCTGGCTTATGMGATTATTTGAAAAA 

GAACTCTTACTACATTGAMTGCAGACTTTTAAAAATTTAAATATTGGATTAGGCAGTCA 

AAAAACCAAACAAGCATAAAAGGTCAGTAAGTTGTAATCTTAAAAGTAAAGGTGGAAAAC 

TCATTATAAATGGAAGAAAAGTTTTGATTTCC I I I I I I GTTTGATGGGCAGTATGCCATA 

TTATATCCAAAGTTGGTTTAAAAAATACTTCCATCAACTATTTTTATTTAAAATAAACAT 

TT 

Sequence 2296 

CCGCGGTGGCGGCCGAGGTATACTTTGCACCTTGAAAATATAAAATAAAATAAATTTAAA 

MTAMGGTMTTTTGTTCTTCCATGTCAGCTGAAAATAAGTGAAGACTGGGTCAGTAAT 

AACATTGCTTTGCTGAATTCAGAGAATTCTAATAAAATA I I I 1 I AGTTGGGAAGCTATCT 

GTATTAAAAAAAATGATCTAAGGCTGGTAACAGTGGCCATACTTATAATCCCAGTGCTTT 

GGGAGGCCAAGGCAGAAAAATCACTTGAGGCCAGGAGTTTGAGATCAGCCTGGGCAACAT 

AGTAAGACCTTATCTCTACCCGGACGCGTGGGTCGACTCAAGACCTGCCCGGGCGGCCGG 

CCGCAC I I I I I I I n 1 I I 1 I I 1 I I I I AAGCTGCTCCTTGAGGATAAGGGCTAACTCACAG 

GCAGTGCACCAAGAGCCACTATAAAAAGATCCTTAATGAGCAAAATATATCC 

Sequence 2297 

CCGCGGTGGCGGCCGCCCGGGCAGGTCAAGTCGACCCACGCGTCCGCTTGTTTTGCTCTA 
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TCCCATAGGAGTTGGTATGTTGTGTTTCCAATATCATTTACTAAAAGAAATTTTCCTTTT 

TATTTCTTCATTGACCAACTGGTCATTTGCATGTTCTTTMTTACTATGTGTTTGTATAG 

TmCAAAATTTCTCTTATTMTTTCTAGGTTTTCCTGTGGTCAGAGMGATG 

CTACTTTAAI I I II I GMTGTTTTAATACTTGTTTTGTGACCTAACATATG ATCTACCCT 

TGAGAATTATCTATATGCTGAGGAAAATAATGTGTATTACACAGCCATTGGATGAAATAT 

TCTGTAACTATCTCTTAGGTACCT 

Sequence 2298 

GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACGCTTTATTTATTTCTTTTAGGAAT 

TGCAGGTTCCTAACAAAGTAGGGGTGAGGGGGGTGTTACAAACCAGTCACTAGGCAGGAA 

CATTAGACTCCAAAAGCAGAGAMTGCTTMTTTTTCTTCTACCTGTTTCACCACATTCA 

TGTAGAACTGTAGTAAAAAAGATGGTGAATCAGGCTGAATCAATCTAAATAACAACTTAA 

GGCTCCCAAATCACATGAACCTAGGACCACTAAATCCAATGTCAGACGTGTTTAAATGGT 

GCACTGCTCTACATTTTTCTATTATGCAAAGAGCTAGAAAATAATGGTAGTGTCATTATG 

ACATTCCATGAAAATGAANGAAAATCTTTCAGGAAAAATTAAGAAAATAAAAATGTTTAC 

TAAAGAAAGAATGGTCCGGCTAAGTGCTANAGTTTNTTTCNN 1 1 1 I I I I I I A 

Sequence 2299 

AGGTACTATCAAGCTTTATTTTACCTGCAAAAATATTTTAGCTACACTTGGAAAAAAATA 
AACTTGAGAATATAACTTCACATTTCTAAGGCCAGCGGACGCCGTGGTCGAAGCTCGACC 
TGCCCGGGCG 
Sequence 2300 

AGGTACGATTTTCCCTTCGCTTGMTATTATCCCTGTATATTGCATGAATGAGAGATTTC 

CCATATTTCCATCAGAGTMTAMTATACTTGCTTTAATTCTTAAGCATAAGTAAACATG 

ATATAAAAATATATGCTGMTTACTTGTGMGAATGCATTTAAAGCTATTTTAAATGTGT 

TTTTATTTGTAAGACATTACTTATTMGMATTGGTTATTATGCTTACTGTTCTAATCTG 

GNGGTAAAGGTATTCTTMGMTTTGCAGGTACTACAGATTTTCAAAACTGAATGAGAGA 

AAATTTGTATAACCCATCCTGCTGTTCCACCTGCCCCGGGCCGGCCCGCTCTAGAAACTA 

GGTGGGATCCCCCGGGCCTGCAGGGAATTCGATATCAAGCTTAATCGATACCCGTCGACC 

CTCGANGGGGGGGCCCGGGTACCCAGCTTTTT 

Sequence 2301 

GGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAAGCTTCGACCCACGCGTCCGGTGGCGA 

GTGCCTGTAATCCCAGCTACTGGGAGGCTGCGACAGAAGAATCACTTGAACCTGGGAGGC 

AGAGGTTGCAGTTAGCCAAGATCATGCCACTGCACTCCAGCCTANGCAACAGAATGAGAC 

TCCATCTCAAANNAGAAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCTGCCCGGGCGGCC 

GGCCGCACI I I II I I I I I I I I I I I I I GCCATACAGGTTCATTTTATTGAGTGGAAAGCTT 

ACAAAAGGGCCACTGGCCCCTTCC 

Sequence 2302 

CCGCGGTGGCGGCCGCAC I 1 1 I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I II I 
TTTTAGGGTTGAGGGGGAATGCTGGANATTGNAATGGGTNTGGANACATATNATATAAGT 
AATGCTAGGGNGAGTGGTAGGMGTrnTTCATAGGAGGNGTATGAGTTGGTCGTAGCGG 
AATNGGGGGTATNCTGTTCGAANACCTGCCCGGGCGGNCGGCCNCACI I 1 1 I I I I I I I I I 
TTTTACTTTGGCNGGGGN I I I 1 1 ICTTTTCI I I I I I I I I I CAGCTACNGG AATTTANCCN 
ATTCANAGGAMTCTTCCCCATAATTANGGAACTTTNTTACANANTTTACCAAGTNTGGG 
CNNCCCNATAAGAAAAAGACTGAAATAACAACAACNACTTTAANAAAT 
Sequence 2303 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCNNTNTN I I I I I I 1 1 1 NTTNGCCAG 

NTANMTCTNAGCTTTTTATTTGTAGGAAAAAATAAACAGATTNCCCTCCNNAACANGGC 

GTNACAANAAANGAGGCAATNAAGGGAAAAANGCANATNCTAAACGGACNCNTGGGTTNA 

ANCTTGACCT 

Sequence 2304 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGT 
CCGAGAAGAGTTTGCAAATGCMCAAAATATTTAATTACCGGTTGTTAAAACTGGTTTAG 
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CACMTTTATATTTTCCCTCTCTTGCCTTTC 
TTTTAMTGACATTTTTGATAAATTATGTTTGTACCTGCCCG 
Sequence 2305 

A GGGCGAATTGGAGCTCCC CGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCCGCA 

CI I I I 1 1 I I I I I I I I I I I I I I GNTTTCTGGGTGMGTTTATTCTGTTTTCACATCTAGGT 

TGTTGGGGAGAGTGATAGACAAAGTTCTGGATTCTGGGCATCGTCGGCGCATGCTTGTAA 

TCCTACTTGGGAGGTTGAGACAGGAGGATCACTTGAGGCTAGGAGTTGGAGGCTGCAGTG 

TACCTGCCCG 

Sequence 2306 

ACTTAGGGCGAAT TGGAGCTCCCCGCGGTGGCGGCCGAGGTCGGCCGCAC I I 1 1 11 M 11 

I I I I I I 1 1 I rTTTTCTTCTTTCTCTTGCCTGGATTCAGTCCCAGAAATGTTAGGACTACC 

TCAGTTTTGCTCCAAACCAAACTCAAACAACAGCAGCCACTGGAAATCAAGGAAACTTCA 

CTMGMTTTMCAGATCATCAAMCACCGCCTCCTTCCCATTTTAGCCGGACGCGTGGG 

TCGAAGCTTGTACACCTGCCCG 

Sequence 2307 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATTAAATA 
TTCTGAGAGGTGAATGTAAAATATAAAAGGTATAGG I I I I I I I I I.I I I AAAGAAAACAAT 
NAACTTTCAAAGAGAAAACCAAAAAAAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCC 
GCACTTN I rTTTTTTTTTTTTTI I AAAANANANATGAGGTTTTGNTATGTTGCCCAGGCT 
G 

Sequence 2308 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACC 
CACGCGTCCGTTCTAAATGATCGACAACTCTCAAGCAATAACTTGACTGTTGAATAGAAG 
ATTAAGAAAAGTTGGTTAAAAAAAAAAAAAAAAAAAAGT 
Sequence 2309 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAAGCTCG 
ACCCACGCGTCCG AAATAATAAAGCTAG AAGTAATA I I I I I CCTTTTGTCTATTTTCCAA 
ATTGACTCGATATTGATGGCTACTTTTGTAAGTTTTTATTTAAGTTTAAAGGGAAT^ 
ATTGATCACCTCTATGTGCTCAGTACCT 
Sequence 2310 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAAGCTTCGACCCACGCGTCCGCA 

MGTTCACCMATTCGCTACAGCCTAAGACTAGCCTCACCAGTCCTTTTTCCCATTAATC 

MAACTTTGCAGMGAGACAATGATTTTTACCATTCATTCMCCAGTTTGCACAGAGAGA 

GGCTGMGCCTGACTTGTMGAMCTCTTGTCCTTTTGCCAGTGTGCCAGGTTTCTGGGT 

TCCCCTTCTCTGAGTGGCTTTGATGACCCTGCTTGCTGTGCCATAGCTATGGGGGGGCCA 

AGCCATGTTACCCAAAAAAAAAAAACNNTNNNNNNNGGTGCGGNCNGNCCCTCGNGATCT 

AAAN CCCCATAGGGGGNGGGATTMCMTTNCCATTCCCNNGGGGAAATTTTTTGGCNCC 

TTTTNTTGGGG AANAAAA I I I I I I I I I I GNAAATTAAAANGGTTTAAAAANCCNCCCCTT 

TTTG N AAA AATTT A NTG G NTTTT A ACCCCCC 

Sequence 231 1 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACI I I I I 1 1 I I I I I I I 1 1 I AANGCTG 

CTCCTTGAGGATAAGGGCTAACTCACAGGCAGNGCACCAAGAGCCACTATGAAAAGATCC 

TTMTGAGCAAAATATATNCCCTATTATTTTCCTACMGTTGCTTTTTACTTGAGTAGGA 

ACCCTTGATTG 

Sequence 2312 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACCTGGTATGCCAGA 

TGAGMTGACAGGAGCCATCCGCAAGCAGTTGGCGGCTTTCTTAGAAGGCTTCTATGAGA 

TCATTCCAAAGCGCCTCATTTCCATCTTCACTGAGCAGGAGTTAGAGCTGCTTATATCAG 

GACTGCCCACCATTGACATCGATGATCTGAAATCCAACACTGAATACCACAAGTACCTGC 

CCGGGCGGCCGGCCGCACI I I I I I I I 1 1 I I I I I I ACCTGAAAATGCTTATTCTAGCTTCA 

CATTTGATTGTTTGGCTAAGAAGAAAATTATTTATTAGACTTMTTTTCCTCACGAGTTT 
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AAAGATTGCTTCAGATCTTAAACTTCTAATGAGGAAAGCTGAGA 
Sequence 2313 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGCAAGCTTCGACCCACGCGTC 

CGCAAAGTTCACCAMTTCGCTACAGCCTMGACTAGCCTCACCAGTCCTTTTTCCCATT 

MTCAAAACTTTGCAGMGAGACMTGATTTTTACCATTCATTCAACCAGTTTGCACAGA 

GAGAGGCTGAAGCCTGACTTGTMGAAACTCTTGTCCTTTTGCCAGTGTGCCAGGTTTCT 

GGGTTCCCCTTCTCTGAGTGGCTTTGATGACCCTGCTTGCTGTGCCATAGCTATGGGGGG 

GGCCAAGCCATGTTACCCAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCA 

GGTGTCACTTTCM CTTGGTT ATGCCTAAAACAAAGTCTCCCTCATCTCCAAACAATTTC 

TCCCGACTTTCTTTCTTTTTTGAGATGGAGTCTTGCTCTGTCGCCCACGCTGGAGTGCAG 

TGGCATGATCTTGGCTCCCTGC 

Sequence 2314 

CCG CGGTG GCGGCCGCAC I I I 1 1 1 II 1 1 1 < 1 1 I 1 1 1 1 i I 1 U 1 1 1 NAAATGCTCAAAAGA 
ACM TTTTTATGTMGTTTGTGATAGAGGCCTCAGGTAATTCTACAAAAATTAAAACCCA 
TTTTCAATGCAAMTTCCCGAACATAAACAMTGCTTTTAAAAATATGGATGGNGTGGTT 
ACTCTTTTAC^ 

TGTGI I I 1 1 I I I I I I IGNGNGNGI I I I I I I I I I I I I I I I I I ATTATACNCTAAGTTTTAG 

GGTACCTGCC 

Sequence 2315 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACC 
CACGCGTCCGACTTrTTGTCTTAGACCCAGTTAGGGTCACCTTACAGTGCAGGTGGAAAG 
AAAGCAGGACTGCTGAGAGGAGCTCAGGA 
Sequence 2316 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCACGCGT 
CCGGATTAAAGTCCTACGTGATCTGAGTTCAGACCGGAGTAATCCAGGNNGGTTTCTATC 
TACTTCAAATTCCTCCCTGTACCTGCCCG 
Sequence 2317 

. TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGNCGAGGTGTACAANGCTTCGACCCACGCG 
TCCGCCACACGTMCTGAGATGCTCCTTTAMTAMGCGTTTGTGTTNGANGTTAAAAAA 
AAAAAAAANAAAAAAAAAAGT 
Sequence 2318 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACTTTTNN i ( I < I I I I I I I I I l 

ATTATCTAMTCAGNTTTATTTMGAATTTCCAACANTGACAACTNTTATAAAGGGGCAT 

CCAAGCACAGGACACANAACTGCNACAAACAGCATTCTTACGGACGCGTGGGTCGAAGAC 

CTCGGCCGCT 

Sequence 2319 

CCGCGGTGGCGGCCGAGGTACAGAATGGTAAAAATTCCAATCAGTCAAAAGAGGTCAATG 
AATTMAAGGCTTGCAACTTTTTCAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCC 
GGGCAGGTCG AGCGGCAGCAC I I II I I II I 1 I I I M I I I II I I ATGGNTTTTATTTTTCA 
Ann i ATTTTGGTTTTCTTACAAAGGTTGACATTTTCCATAACAGGTGTAAGAGTGTTG 
AAAAAAAMTTCAAATTTTTGGGGGAGCGGGGGAAGGAGTTAATGAAACTGTATTGCACA 
ATGCTCTGATCMTCCTTCTTTTTCCGGACGCGTGGGTCGAANACCTCGGCCGCTCTAGA 
ACTAGTGGGATCCCC 
Sequence 2320 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTCTTGTCTAGT 

ATACTCAAGGCAGCCTAGTAAATTATTATTTATCTATACAATACTGGAAAAACTNGNAGA 

CAAAAACATGACTTGAATTGCTAAAAAAAAAAAAAAAAAAAANGAGGGAGAATGAAAACT 

TCCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 2321 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTNCATTTACATACAACTGA 
TCCAAACAGGAAGTAAAAGCNTTATGAAAAAAGAACATGATGCAAATCATTTCCCNNNGA 
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CAAAA6GAGGGATCTGCGTGAATTACAGCAAATCAAATGATTTCCACATTATAAAAGCAA 
GCCATGGCTTCTACAGATATTTTMTTCCTTGGGAGGAAGTTTCACCTNATCGGACGCGT 
GGGTCGAAGACCTGCCCGGGCGGC 
Sequence 2322 

CTATAGGGCGA ATTGG AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACC 

CACGCGTCCGATTTTTGTCTCCTGAGGTCAMCTGTGCCCATCTTTACTGCCCCTGTGTT 

GGTGTCATTCCTTTCAGCGCACTCCATGTTCTGGCAAAACACCCACAGCTCACGTTCTCG 

TTGTAGCCACATTACGCCTTTGCATTACTCTTTTGCCACTGACCTCACTGTCAGTCTTCT 

TGTCACGCTGTTTCCTCCAGGTGACTCTGACATCATTAGAAGCATGCCAGAACAGACTGG 

TGAAMGTAAACCTTTTCACCTACAAAATTTCACCTGCAMCCTTAMCCTGCAAAATTT 

TCCTTTAATAAAATTTGCTTGTTTTAAAAAAAAAAAAAAAAAANAAAGTGCGGCCGCTCT 

AGAACTAGnnnnnnnnnnnnnnnnnnnnnnnnnnCGATATCAAGCTTATCGATACCCGTC 

CGACCTCGAGGGGGG 

Sequence 2323 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCG 

CTGATCTGTGATTAAAGCAGTAATATTTTAAGATGGACTGGGAAAAACATNAACTCCTGA 

AGTTAGAAATAAGAATGGTTTGTAAAATCCACAGCTATATCCTGATGCTGGATGGTATTA 

ATCTTGTGTAGTCTTCAACTGGTTAGTGTGAAATAGTTCTGCCACCTCTGACGCACCACT 

GCCAATGCTGTACGTACCTGCCCG 

Sequence 2324 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTCGACCCACGCGTCCGCAT 

TTTCAGGCTTCTGCAGCTGTAGATTCTCACTGTGAATCCCTTGCTTGCTCATGCATAAGT 

GTATTTGCAATACCAAATATACAGGTTTAGTATTTTGCCTGTTAGTGATTGTTTCACATG 

TGTAACGTTTTGGTTGAGATGTTAAATGGTGGACGAGTACCTGCCCG 

Sequence 2325 

CTTAGGGCGMTTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCANGGTGGCCGCACTTTT 
I I I I 1 1 1 I I I I I I I I I ACTATTCGTTAGGCTTTTATTTTTCTCTATGTTCTGCAGTAACT 
AAGGAAAATCATGGTAAATGTCAATCTTCACACAACAGCAGACACAAAGGGTTTCAGAAA 
CCGGACGCGTGGGTCGAAGCTTGTACACCT 
Sequence 2326 , 

GGTGTACAAGCTTCGACCCACGCGTCCGGTTTCTGAAACCCTTTGTGTCTGCTGTTGTGT 

GMGATTGACATTTACCATGATTTTCCTTAGTTACTGCAGAACATANAGAAAAATAAAAG 

CCTAACGAATAGTAAAAAAAAAAAAAAAAAAAAGTGCGGCCACCTGCCCGGGCGGCCGGC 

CGCACI I I I I I I I I I I 1 1 I I I I I I I C ACTGTTT AAG G AATTTTT ATT AAAG C AAG AATTT 

TATMTCCAM1TACGTTTCCTTGCTCAGTTATCMTTCTGTTACTTAAAACAGAACTGA 

CATTCTGAGCTATTCCACAGTAAAGAATTACAAAATTAMGAAAGGAATGCTITAAATTT 

TTGTACCTGCCCG 

Sequence 2327 

CCGCGGTGGCGGCCGCAC 1 1 I I I I I I I I I I I I I I I I I CTTTCTACTnTAGGGTTTATTT 
ACTGGTAGTTTCAGGTCTGGAAATACAAAAAGGAACATGACATGGGTTCTGCCCTTGAGG 
TGCTCCCAGTCTGGTGAGGGACACAACACCTTTCAGGGGACAACCGTGGTACCT 
Sequence 2328 

TAGGGCGAATTGGAG CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCGCACI NIMH 

M I I II I I I I II I I I I I I I I I | ANAATTTAGCCACTTNTCAGGCCTTNTTCCNNATAATT 

TGGAACTTrCCTTTGGATTTGATCMGTTGGATAGAGTTGATCAAACCCTGATCAAGTTG 

GATAGAGTTGATCAAACCCAATGGGAAAAAGACCAAAACAACANTAAAAAACAGACAACA 

ACAACAAAAAACAGTTAGGCAAAACAAACAATGGCACAATTTATNTGATAACTGGGGTNG 

CTTTMTNGGAANGGANAAATTAAAACNNGGTrGGGCGGTrATTTTNACTTTAACCAAAA 

MNAAACCCCCAATTTTMGTTNCTTTNCTTANGGGGCGANGTTNTTGGGGTTTTAAAAA 

NGGGTTTTTTCCCTTTTTTAGNAAGGGNGGGGGGNAAAACCCCANCCCNNCCCTNANAAA 

ANCNCCNCNTTTTTTTCCNTTNNNGGGAGGGGGNGGCCAAAACNCCCNCCNNTTTNGAAA 
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AAAAAGAAAAI I I I I I 1 I I ITTTTTTAAGCCCNTTTTAACCNCCAAAAAAAANNAAGTGT 
TTTCCNN NCGNNNCTTTTNNAAMNNGNGGNACCCCCCNGGNGGGGGGGANNTTTTTTAA 
TTTTTTTNNCCCCCCCCTNGGGGGGGGGGCC 
Sequence 2329 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCACGGTACTGAGGAAATATT 

TTGTAMGTGAGC TTTGG GTATAACTTAGCCCCATCATTATTTAGAGAATAGAGGAGGAA 

GAMGAGGAAGGATTTTAMGGCAGACAATGACAGACCATTCAGGATAGGTAGGGTTTTA 

AAGGGAGCGGACGCGTGGGTCGAAGACCT 

Sequence 2330 

CCCCGCGGTGGCGGCCGAGGTACCCTAAAATTTAAAGTATAATAATAATAAAI II I I 1 1 1 

TAAAAAAAGAGTGTTGTCTTTGTCTTGTATTTTCTGCAGTTTGCATGTGATATTCTTAGG 

TATAGAI I 1 1 1 I I I AGTATTTGTCCTGTATATTGTTATTCGAGCTTCTGGGGATCTGTGT 

TTTGGT GTCTA TCATTAACTTTGGAATATTCTCAGTCATTACTGCTTCAAACATTCATTC 

TGTTGCTTTTTCTCTTCTGGTATTATCATTACACATATATCACACCTTTTGTAATTCTCC 

CACAGTTCATAGATATTCTGTTGTATTTATTTA I I II I I CTCTTTGCCTTTTAGTTTTAG 

AGATTTCTATTGACATCACTTAAAGATGATTGATGAGTTGATGAGAATTGAGAGAATTGA 

TGAGMTTGTTGATGAGMTTAnCATTTGTGTTAGTGTTTTTCATTTCTGCCA 

TTTGATTTTTAGAGCTTCCATCTCTCTGCTTACA 

Sequence 2331 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACCTTGACCGTGACCGTTTG 
CTATATTCCTTTTTCTATGAMTMTGTGAATGATAATAAAACAGCTTTGACTTGAAAAA 
AAAAAAAAAAAAAAAAGT 
Sequence 2332 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTATTTTATAGAAAAGAA 

MCAGAGGCTCAGAGAGGTTMTTTTTCCTGGATGTCCTAGATGTTAAATGTTACAACCT 

TAATTGACCGATTCCAGAATCAGAGCTATTAMCACAAAACTATTTTAATTCTCTCTAAA 

TTCTTAAAGACCCAAGAAAAACAAACTTTATTGAGATAATTAGGAAI I I I I 1 1 IAAAATA 

TCGGACGCGTGGGTCGAAGCTTGACCT 

Sequence 2333 

AGGTCAAGCTTCGACCCACGCGTCCGTGGTGAACACAGAGAAGACAGTCTTGTATATATT 

CCTCTGTATTCTGGGGAGCTTTGACCTTGGAGCTTTGTACCTGCCCG 

Sequence 2334 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGCAACAC 

CTATGTTTATMAAATTTGAAAACATTACATATTGTATTTAAAACTAATTAGNGAAGAGT 

AAGMAAAAACTAGCCAACAGAATTGTAGGTGATGCATTAGTTAAATTTCAAAACTCATA 

ATAMGGAACTTTCAGAGATTGGTTGAAACCCAGTGGTATCCCTGTAAATTAGCTCCTGT 

GACTGGAAAAGACCCCAAAMGGCAGTAGAGGAGATTAGTGTTTACTTGCTGTGGTTGTG 

GTGTGCTGCTACTTAATTATAGGTAGTGACACACTGAAATTCTTATTTGTCCAATAATCT 

GAAGTAGTTTCCTATATTTATCTGTACCTGCCCGGGCGGCCGGCCGCAC I I I I 1 1 f I I I I 

TTTTTTGCNGGGGN I I I I I I CTTTTC I I I I I I I I ICA 

Sequence 2335 

GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCACTNNNTTTTTTNI I I I I I I I I I I CTT 
tAAACACCANTTAGTTTATTTCAGGACMGAATTTACCATNTMCANTCTTTTNACATM 
ATTCTGNCTCCCCCCAC I 111 lil I Ml I GAAN ATAACCATTCC \ ITTTTm 
Sequence 2336 

CNAANNGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATCTNTATCAGAGCTTTT 

GGGTGACCAGGCACACTGTCMTGAGCAGTAATACGGGGAAAGGAATCTTTGGGGTTTTT 

TTTGGTTTGGTTTTTGTTTTGTCGTTATTTTTT^ Mill GTTTT 

TTTAGCAGTAGGCCTCAACAGTGGACTTAAAATACTCAGTAAACCATGCTGTTAACAGAT 

MGCTGTCATCCAGACTTTGTTGTTCCATTTCTAGAGCACAGAGCAGATTTAGCAGAATT 

CTTAAGGCTTTAGGATTTTCAGAATGGTAAATGAGCACTGGCTTCAACTTAGTCACCAGC 
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TGCATTGGCCCCTAACAAGACAGTCAGCCTGGCCTTTGAAGCGTTGCAGCC 
Sequence 2337 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTAAACTGCACATATTTAA 

AACATATAATTTGATACATTTTGACTCACAAAACAATCACCACAATCAAGANGATGAGNN 

TATAGATCACTCCCAMAGTTTCCCTGTAGTCTTTTGCAGTCCTTTCnTCATGGCCTTCT 

TCATCCATCCACCCCATCTCGGTAACCAATGATCTGCTTTCTGTCACCACAAATTAGTGA 

GCACTGTCTAGAATTTTATGTAAACTGMTMTAAAGATTTTACTCTTTCG 

Sequence 2338 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTC 
CGCACI I I I I I I I I I I I I I 1 I I I I I I I 1 1 I 1 1 I I I I I I INGGAATCATCATTAACTTTAT 
TTGNCACTNTTGATAGACATTGGTCCACTCCAACATAAAAAGNAGAATTCACCCACTTCC 
ACTTAATATTCTATAGAATGAAGTTGTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGT 
Sequence 2339 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAAGAI I I I I I IACACAA 
AGAMCTTAATGCTGTATTAATAAATTCAGTGTGTAGCTTCAATTGGGATAGTTCCAAAA 
GTGAAGATTTTGTGAGGAATAAGTGCAAA I I I I I I I I 1 I ATTTTAAAAAATTCTTTG AAA 
CTCTTAAGTCTTTGTGTCTGCAATAMATTGTACCTGCCCGGGCGGCCGGCCGCACTTTT 
I 1 1 I 1 1 I I I ! I I I I I I I M I 1 1 I I NGAMNGTTTGAAGTTMCTCATTTTATTTNTAGGA 
TTNGGATTTCMCATTTTMTTTNTTTGGMTATAAGTCANTTTTTGCAAGCTA 
AGAATCAAA 
Sequence 2340 

TTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGACTTGATTCATAACCAACCA 
ACCCAGGCTGTCATGCTAAAAAAAAGGGCAGAATCTTTGCTGCTGAACTACTGCATAGGG 
TGACTACCATTGCTCCTTTAGTTTAGTTTGGCTAGCAGAAGGTGGTCTTGCCATGTAAAT 
A AAGCCTCTC AGGTAATCAAAATGTTTC I Mill 1 ACTTTTGCTGGTG I Hill 1CTTTT 
CI I I 1 1 I I I I CAGCTACAGGAATTTAGCCAATTCAGAGGAAATCTTCCCCATAATTATGG 
MCTTTCTTACAGATTTTACCAAGTCTGGTCAACCCAATAAGAAAAAGACTGAAATAACA 
ACAACAACTTCAACAAATAAAAAAAAACAGTTAAGCTAAATAAACAGATG 
Sequence 2341 

AGGTACTTCTTACATAGTGATTGATGTCTCATGTCTCCCTAAAATGTATAAAACCAAGCT 

GTGGCCGGACCACCTTGGGCACATGTCATCAGGACTTCTTGAGGCTATGTCACTGGGCAT 

GfCTTCAACCTTGGCMAATAAACTTTCTAAATTAATTGAGACCTGTCTCAMTTTTGGG 

GGTTCACAGGTGAGTGGGCTCAGGCATGTGCACTAGTATGACTAAAGGTCATAGACTATT 

AGACTATTAGTCTATGACCTTCCTCTAGAAACACTCGACTGGTAAGGGAAGAATGCCTCA 

ACTGAGCATGTGCACAACTCCCATAAACACACTTGTGCTTGCGGAGCCTNTCAAGTGCTG 

GCAGGCCACTGCTCAGGTGGATTCTTCCCTCCTACCCGGAGGGAAGAATCAGGGGAGAAG 

GGACACAAGCCCCTGAATGCATGCCAACACGTAAA 

Sequence 2342 

CCGCGGTGGCGGCCGAGGTACCTTCAGGTATTGCCTATTTAATGATCATATATACTTGCA 
TMTATCATCCCTTCCCTTGATTTCTTTCAATCTAAAAATAAATATGAGAAAAACAAAAA 
AAAAAAAAAAAAAGT 
Sequence 2343 

ACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGGACTCAAGACC 

AGATGGCCAACTAGAAGCAGCCAGGAAGAACATCTCTCATGGAGAGACCAGGAATTTGGG 

AAGACTGGCACACTCTGAGCAGATCT7TTGAAGGAAAACATTGAGGGTGGATGAAGGGAG 

GATGCAGAGCATGGGCTGAGGGGGCAGGAATCTCAGAAGCCTGCACAGGGCTTCCAAGCA 

CTAGCATTCCTTACTAGCCCCCATTAACTCCTGGGGAAGGGGTGAGTTGAATAGGTGGNG 

GAGTGGCCCGCTCTTACCATGAAACTCCAGAATCCTAGCAGCAAGAGACCCCATGACCCC 

TGTGGACACGAGCTGTCCGGACGCGTGGGTCGAAGACCTGCCCG 
Sequence 2344 

CCGCGGTGGCGGCCGCTNNCTGCCCGGGCGGCCGGCCGCACI I I I I I I 1 1 I IC1 I I I I GG 
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AAMATNTNNNGACTCTGGGGATAAMNTNCNAATTAAATNNATNCNANNTTTTAAAGGC 

TATTMNNANAANAATATTNGCTAAATTNNCCTTNTGCATAACAAACTGTGGNTNTACCA 

TGTAANGTTTAAAAAAATGTNTAACCNCAATTTTACGCTCCTCTGTNACANGACAAGGAC 

TCCATTCANTGNCATTTAAGAACTNAATGGGTTGAN 

Sequence 2345 

AGGTCAAGCTTCGACCCACGCGTCCGATAAGCCAAAAAATGGGAGACAATTTCACATGGA 
CTTTGGAAAATATTTTTTTCCTTTGCATTCATCTCT^ 

AACCGAACATGACCAAAAACCAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAGTG 
CGGCCGGCCGCACTTTGTTTCTACTGGGTTTAGACCGTCGTGAGACAGGTTAGTTTTACC 
CTACTGATGATGTGTTGTTGCCATGGTAATCCTGCTCAGTACCT 
Sequence 2346 

CCGGGCAGGTACCTGCTCCATTTCTCCTGCAACATGTGGATACAGTAATATGATCATACC 

CTCCCTTGTTCCCCTCTAGGCCACTTTCCCCTTTAAATATTAACACCATCATAATCATCT 

TTGGAGAAAGACACCTGGATCTGTCCTGAACCTTGGCAAAAAATAAATAAATAAATAAAA 

ATAAAAMCTCTTCTCAATTGATTAATACCTGTCACAGATACATTTTGGTTTACAAATCA 

ATGAACAATGGAGGGAACTCTGTCCTTAATCTTGGTACCT 

Sequence 2347 

CCGCGGTGGCGGCCGAGGTCTCTTGTTTGNTATTACACTTCTACGTAGATTATATAATAT 

TGCTTGTGGACATAATTTGATCAATAATATATAATGTCAGTCACTACAGTGATCCAGAAT 

CTTATTCTGGCTATGGAGGAAGCTTMTTATTAAGCAACATCTTCCTAAAAAGCTTNTGA 

ATTTCTGATTCATAAGAAAAACAAAACAAATGAAAAGAGTATCTNTAACTGAAATAACAC 

TGAAGTTCGAGCTTGGGCCCCCTCCTTTGTGTTCAACATAATTAACTNTTCAAGATGAAA 

CCGGACGCGTG GGTCG AAGCTTGACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTGTTTN 

TTCT GGGATATCTTTTTCTTCTGGGCMCCTCCTCTTCTGGTTTAGGAACAATCTGTTCC 

TTTTCCGNAAGGATCATCTCAATGTGGCAGGGAGAGCTCAT 

Sequence 2348 

ACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACCCACGC 

GTCCGGGCAATTATCAAAMCACTTGGAAAMGATTTTTATTCCTACTTTTAAACATACA 

TCAAAATCTAAAATAAAACTAGGCACCTTCAGCTGGGCCTGGTGGCTCATGCCTGTAATC 

CCAGCACTTTGGGAGGCTGAAGTGGGCAGATCACTGGATGTCAGAAGTTCGAGACCAGCC 

TGGCCTACATGGCGAAACCCCCTNTCTACTAAAAATACAAAAATTAGCCGGGCTTGGTGG 

TGGGGACCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAATCACTTGAACCTGGG 

AGGNGGAGGTTGCAGTTAGCTGAGATCACACCACTGCACTNCAGCCTGGGCCACAGAGCA 

GGACTCCATCTTAGAACAAAAAACAAACAAACAAACAAACCTCATGCACCTTCAAGAAAA 

TCAAACAAGTTTTTATCTMTTAAAGAAAGAATTTT 

Sequence 2349 

AACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGC 

GTNCGCTTTMCACACACTAGGCTCTTTGTGTATTATGATNCAGTGCTATTTGTAACTGT 

GTCCCAGNGACCAAATTGCACTCGACTCGATCAGCTGTTCATCCATTTCGTG I I M I I CC 

TGTCAMCATTMTCCAGCAAATATATGAGGTATTTACCMTTTATTTTCTTAGTATTAC 

AAAATAATTCATTAGCATAAAGTACCTGCCCG 

Sequence 2350 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTAAATCATAGAGC 

TGCCCCAACATCTAGACAGTCTCTCCTACTGATTATAAATGAGTGAAAACTATCAGTTAG 

AAAMTCT AATT TMGTTGTTAATACATGTTTCTTTGGTGAGCACCTGGATATATTTATC 

ACAMT TCTTT TATACAAATGTCGAAAATGCTTTCAACAAACCTAAGTGTCCTAATTACA 

TGCCACTTTTAAGCATCACTTTAAGGTMACAAAMTGAAAACCATMTTTTAAATTAAA 

ATTTGCGGACGCGGGGGGTCGACTCAAGACCTCGGCCGCTCTAGAACTAG 

Sequence 2351 

CCGCGGTGGCGGCCGCCCGGGCAGGTGCTTCGACCCACGCGTCCGGATGGCTTGGGTCAT 
CAGGACGTCCATTACATCCAAAGGAAGACAGCCTGTGACGTTTCAAAAGCAAAAGTCCCC 
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TACCAGCCAGTGAAGCTACCTGATTTCTCAGTATCTTACGCCCAGNGACACGATCTACCC 
TCAAAACTTAAAAAAAAAAAAAAGGGAAACATAAACACATAACAGCAGACCTN 
Sequence 2352 

GGGCCGNGGNCACAACATTCCCCCTTCCCCAAACAGTAATATGGACACTGATTTAACANG 

ACTTATAAAAAAATMGGCNCATTTATTTTGATNTGGTAATTTTAAAATAGAAACCCCTT 

C 

Sequence 2353 

GCGNGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTNC 

GAGCTTGAGTCGACCCACGCGTCCGTGAAAATGTTGTCTCCTTTCTAAATTCTCTGCCGA 

TTTGGGAAAMGCAMCTTGACTTTACCCCGAGGMTTGGTGAAAMTTACT7TTACGCC 

TTGCAGCTGTGGAACTTGGTCTTACAGCCTCTGCTCTTCTGCCCAAACGGGCCATGCAGT 

TTGGATCAAGAATTGCAAAAATGGAAAAAATTAATGAAAAGGCATCTGATAAATGTGGAC 

GGCTCCAMTCATGTCCTTAGAAAATCTTTCTATTGAAAAGGAGACTAAATTGTAATGTG 

ATTCACAATGTAACAATATAAAMTMGTTTTTATATAATTATTAAAAGGNAGATACTCT 

GGTGCTTTACTATTGGATAAMTMGTAAACCTGCCCGGGCGGGCCGGCCGACTTTTTTT 

TTTTT 

Sequence 2354 

TCCCCGCGGTGGCGGCCGAGGTTTCAAAGACCNGCCTGGCCAACATGGTGAAACCCCATC 

TCTACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATGTGCCTGTAATCCCAGCTACT 

CAGGAGTCTGAGGAGGAGAATCACTTGAACCTGGAGGCAGAGGTTGCAGTGAGTCGAGGT 

TGCGCTACTGCACTCCAGCCTGGACAACAGAGGGAGACTCTGTCTCAAAAAAAAAAAACC 

TACAGCTGTTCAAGGACCAGCTGCAGGGNCAAGNGGGGGCCTTTTTTGGTCTTTGAACAC 

ATCATAGAAAGNGGNCAAATGCTGCAAAGCCATGAAGAACATGAACCTTTAACCGGGTAG 

ACTAACTGCC CCACTTANACNC I I I I 11 I I GCCCCCAAAAACAAGNTTGATTTTGCCCCT 

nTTTTm rTNTCCATTGGGGGGAAGTTTTGGAAAAANAAAANGGGGCCNCCCCCCCCT 

IIMIIII I NNNNCNAAAAANCC I I I I I I I I GGTTTTTTTAAAAAAAAAAA 

Sequence 2355 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGGCCAGAGACTTC 

AAGTC TATC TGAAAAGTCTCCAGAGGTCTAACCCCAGATAAATAGCCAACAGGGTGTAGA 

GTACATTTTACACCCCAAAGAGTGTGCCCCATGGTGATGAAAATAAAGTGAACATGTTGC 

AAAATGAAAAAAAAAACCT 

Sequence 2356 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCCAACACAAACTATTCAATAAAGTAATCTGCT 

TTAAAAATAAAACACACTGAAAGGCCGAGGCAGGTGGATCACCTGACATCATTAGTTCAA 

GACCAGTGTGGCCAAACTGGTGAAAATTAGTCTCGACTAAAAATACAAACATTAGCTGGG 

CGTGGTGGCAGGCGCCTCTAATTCCAGCTACTCAGGAGGATGAGGCAGGAGAATCACTTG 

AAGCAAGGAGGTGGAAGTTGCAGTGAGCTGAGATCGTGCCATTGCACTGCAGCCTGGGCA 

ACAGAGTGAGACTCCGTCTCAAAAACACCACCACCAACAAAATAAACACAACAGAATTAT 

TCTGCAAATACAGATATTGGAGTAGCTGAGTTNCATCTCAAATTTGACTATGCAGGTTGC 

AGGGTGATCTTGGCCAACTACTTATTCTTTTNTGAAGTTCAA CI I I 111 

Sequence 2357 

CCGCGGTGGCGGCCCGAGCAAGTGGGCCTGTAGCCCGACTCTTAATCCAGGTTGGTGCTA 

TTCMAGAGATCATCTTTCACCCGAGGGATTTCTGGGCATCTATTTTGCGGATCAGAAAG 

TAGAGAAAGAAGGTAACTTTGCTGAAAGCTAGTCTGGGGAGTTAGTAGCTGATACAGATC 

AGCATTTCCTMCTATGAGATTTCATAATATTCTCTCTTGTCTCGATTCTGAGTCACTGG 

TGCCTGCTGTGGTGGCATTGTTCATGAACATGTACCTGCCCG 

Sequence 2358 

AGGTCATGTGCACATTGTGCAGGTTAGTTACATATGTATACATGTGCCATGCTGGTGCGC 
TGCACCCACTAACTCGTCATCTAGCATTAGGTATATCTCCCAATGCTATCCCTCCCGCCT 
CCTCCCACCCCACAACAGTCCCCAGAGGTGATATTCCCCTTCCTGTGTCCATGTGTTCTC 
ATTGTTCAATTCCCACCTATGAGTGAGAATATGCAGTGTTTGG I MM I GTTCTTGCGAT 
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AGTTTACTGAGAATGATGATTTCCAATTTCATCCACGTCCCTACAAAGGACATGAACTTG 

AGAATTCTTAATGCAGTGCTTTAATACAGTAAAAA I I I I I AGTCTTTGTTTTCTACAAAA 

TGCATTTGAAAAGTGCACTCTTGATCTTGNATTTTCTTTCCTTTCCTTTAGAATCATATT 

AGCAGTTGGACGTATATATAAAATATTAAGTGTACCTGCCCGGGCCGGCCGGTCTAGAAC 

TAGGTGGGATC 

Sequence 2359 

AGGTGTACAAGCTTCGACCCACGCGTCCGAGCAGAACTTGGCAGCAACAGAGGAAGGGCC 

CCTGGAGCCGGCTGTCGTGGATGCCTTTAATCAAGCCTGGCATTTGGTTGCTCACGAATG 

TCCCAACTCTTCCGCTAGGCCCATCATGCTCAGGCTGCCCAAGGCTTTTCTGTCACCTCT 

TTTGTTCTCTCACACTGACCAGTCTTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTAC 

CMGTGAATTTAAATAATTGGTGTGGATTGGCCAGTAGCTAAGTGGGCTTTTAAAGAGTA 

TTGAAGATTGAAAGGGTT1TTCTTTCTTTTTTAAAAAAGA 

GATAATGAAAAGCTAGGGTTfGCCCTCTTCATGTCTACTCTCCTTCCAAATAGTTATATC 
CAAAACTG 1 1 I I I I CCTNTCCCCTACCTTGTCC 
Sequence 2360 

CGAGGTACACAGCTATGCACTTTCCGTTTCTGACTTTTGCCACCCTGTCAGCCATGGGGA 

GCCCACTGTGGGACTGAAACCCTGAGCTGAATGCGGCCTCATGTCTCAGAGAAACACTGG 

CAAGTTGGTCAGAGCCGCGTCTGCATCGAGGCGTAGCTGANCGGCAGGATGGGGGGCTGC 

CTGCCCAGGGTCTCTCACCGTGGTGTAAGCAGAGCCATGGCTNGCCTAGGACCCTATAGA 

TACCATCACTCTTTCTCAGCTCGACTGGAGTTTGCACCTTTGCAGGGGCAAAGTAACTCC 

CTGCACCCTGAACCACCCCCCATTCCTGTTCATTTCAGCAGATAATGATGGAGGGGGGGG 

Sequence 2361 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCAAGCTTCGACCCACGCG 

TCCGCTATCATATTTCAAATCCTAATGGGAGACAAAACAAAAGTGATGGTCAGTATTTCT 

TTGAAATTCTACAAGGAACTTCAGGCACATGGTGCTGATGAGTTATTAAAGAGGGTGTAC 

GGGAGTTTTTGTAAATCCAGAATCAGGATACAATGTCTCTTTGCTATATGACCTTGAAAA 

TCTTCCGGCATCCMGGATTCCATTGTGCATCMGCTGGCATGTTGAAGCGAAATTGTTT 

TGCCTCTGTCTTTGAAAAATACTTCCAATTCCAAGAAGAGGGCAAGGAAGGAGAGAACAG 

GGCAGTTATCCATTATAGGGATGATGAGACCATGTATGTTGAGTCTAAAAAGGACAGAGT 

CACAGTAGTCTTCAGCACAGTGTTTAAGGATGACGACGATGTGGTCATTGGAAAGGTGTT 

CATGCAGGAGTTCAAAGAAGGACGCAGAGCCAGCCACACAGCCCCACANGTCCTNTTTAG 

CCACACCTGCCCCGGGGGGCCGNTTGAGGGCCNCCCGGCAGGGNNNGTNNTAACANNGTA 

ACATGTATTATGGNAACCTTTAATCTTGGGGGNCTTANACAGGCTNNTNCCAANGGNTTA 

ANNAAAGNTTGNTTTNAAAAAAAAAAAMGGGATGGGTNTTTTTNAAAAAAAAAAAAAAA 

Sequence 2362 

CCGCGGTGGCGGCCGCCGGGCAGGTACAGCCTCACATACACAGATGCAGGTGAAGTCACC 

AAAGCTGATCTCTCATTCGTTCTGGGGACAGTTAGCAGCGTAGTGGTCCCACTGCAGCAA 

MGTTTGAAATTCATTTTCTTCAGGAAAATACCCAGCCAGTCTCTCAGTGGAAACCCTGG 

TTATGTCGTGGGGCTCCCATTAGCTGCTGGATTCCAGCCTCATAAGGGTGGAGCTCTCCC 

GTGTCAGC TCGT AGCACAGAAGGTGAAGAGCCTGCTGTGGGGCCAGTGCTTCCAGATTAC 

GTGGCCCCTTTTGGAAATTCCCAGGCCCAGGACATGCTGGACTGGGTGCCCATCCACTTC 

ATCACCCAGTCATTCAACAGGAAGGATTCCTGCCAGCTCCCAGGGGCTTTGGTTATAGAA 

GTGAAGTGGACTAAATACGGATCCCTGCTGAATCCACAGGCCAAAATAGTCAATGTAACT 

GCAAATCTMTTTCATCCTTCTTTNCTGAGGCCAACTCAGGAAATGAAAGGGCGATTCTT 

ATTTNCACTGGNGGTACNTTTGNGGATGTGTCTGCACCTGCAGANGCAGGCTTTAAAGCT 

TCANCAGNCATTAATGGCANGCTGGCCNTTAACTTT 

Sequence 2363 

TAGGGCGAATTGGA GCTCCCCGCGGTGGCGGCCCGCACTTTN I I I I I I 1 I I » I I I I I I I I 
M M M I I M I M I U I H M I 1 1 II I I I 1 1 ) I II 1 I I I I I I I I I I I I I I I ) I I I II I M 
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I N I! I I I I I I I I I I I I I I I I II I ! I I I I I I I I I 1 1 I NNAAAAAAAANTTTNAANGGGGG 

AAAAAMNNGGGNAAAAANTTNNGGGGNNGGGNTTNAAAAAAAAAAAANCCCCNNNNCNN 

AAAAAAAAANNGGNCCCCCCGGGGGGGGNNAANNTNNAAAAAANTTTNTNCCCCCCCCCC 

CCCCGGGGGGGGGNCCCCCCCCCNNNTTTTTTNNNNNTTNANNGGGGGGAAAANNNNCCC 

NGGGGGAAAAAAAAANNAAAAAATT1TTTCCGGGGGAAAAAANNTTTCCCCCCCAAAAAA 

TCCCCAAAAAAAANNGGNNGGGGGNAAAAAAAAANAAAACNGGGGGGNCCCAAAGGGGGG 

CCA NCCN AMNTTTGGGGGGGGNNNNNACGCCCCTTTTTAAANNNANAAAACCNTNNGCC 

CCNTTTTAAAAAAAAACCCCCCCCCCCCGGGGGNGGGGGGTTTCCNAAAGGGCCCCCTTC 

CCCTTTCAAAAAAAAAAMAMGGGGGAGNGNNNNGGGGGGGGGGGGGGGGTTTTTTTCC 

AANANAAGGGGGGGNTNTANTNTTACCAAAAAAAANGGGGGMCCCNCAAAAAAAATTTT 

TNNAAAAAGGCCCNAANGGCCCAANCCNAAAAAAGGGCNGGGGGNGGGGGTTTAANAAAA 

GNCCCCCCCCCCGNGANAAAAAAAA 

Sequence 2364 

CCGCGGTGGCGGCCCGAGGTCTTCGACCCACGCGTCCGCACTGAGTGTTCACCATAATTT 

GAGATTCTTGGCATGTTMCTTTTCATTATGGMTATTGAATMTTTCAATATTATTCAT 

ACATTTCTTTATGTTCAAACATACACAAAAATAGAATAATGAACTCTACCCATCACCCAG 

CTG CMCA AATATCAATACTTTACCGTTCTTMTACATCTMCCCCTTACTTTTTGTTTG 

TTTCTTTTGGTG AAGTATTTAATTGTAA I 1 1 I I I 1 1 I AAG AGACAGGATCTCACTCTGTC 

ACCCAGGCCAGAGTGCAGTGGTACCTGCCCG , 

Sequence 2365 

CGCACI I I I I I I I I I I I I I I I I I I AATAACAAACACTTATCCAACACTTAGTATGTGGCA 
GGCACTGTTTCAAGCACTTTACACATACAAACTCATCCCGGACGCNTGGGTNNAAGCTNG 
TNCACCNA 
Sequence 2366 

CCGCGGTGGCGGCCCGAGGTCMGCTTCGACCCACGCGTCCGGATTATTTCAGCTCTTGA 

CCTGTCCCCTCTGGCTGCCTCTGAGTCTGAATCTCCCAAAGAGAGAAACCAATTTGTAAG 

AGGACTGGATTGCAGAAGACTCGGGGACMCATTTGTCCAAGATCTTAAATGTTATATTG 

ATAACCATGCTCAGCAATGAGCTATTAGATTCATTTTGGGAMTCTCCATMTTTCAATT 

TGTA MCTT TGTTAAGACCTGTCTACATTGTTATATGTGTGTGACTTGAGTAATGTTATC 

AACGTTTTTGTAAATATTTACTATGTTTTTCT^ 

TGCCCGGGCGGCCGGCCGCCCGGGCAGGTACCCTAATAAAGGCAGCAAAATGCATTAATC 

CACTATGAATGGAGTTTTACATTTTAATTTATGCCTMTATTTATAMAGMTTTCATTC 

ATAGGCTACTCACAGTTGTTATCTGACGCTTACAGAAGTGGTAAACAACCAATTGCTAGT 

TCAAGTAGTTTCCTCATGACATCTAATGGTAAGCAAAAATTAGTATGCATATTTCAACAT 

CCCAGTNACCAATCTTTTTAAAATGGA 

Sequence 2367 

CGAGGTGTCAAGCTTCGACCCACGCGTCCCGACTTTTTGTCTTAGACCCAGTTAGGGTCA 

CCTTACAGTGCAGGTGGAAAGAAAGCAGGACTGCTGAGAGGAGCTCAGGACCCATTTTCC 

AGGACTATTGCTTCTCAAAACTTTGGAGAGCAGGAAAATAGATTCCCAAGTGAAAGAGGT 

GGCAGAANTAAAAAAAAAAAAAAAAAAAAAGT 

Sequence 2368 

CGCCCGGCAGGTACACAGTTCTGACTGCMTACCTTTTTCAGACTGCAAAGGGAGCTCAG 

GATCCAGAAGTCATTAAAAGAACGGACGCGTGGGTCGAAGCTTGTACCT 

Sequence 2369 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTCAAGCTTCGACCCACGCGT 

CCGCAGAAATACTGAAAGACTTTTGCCTAAAGTGGCATTATTGACTGCTGGTGTGATGCT 

ACTGTAATGTGATAAATTATTAAATTGTTGCAAAGTGCAAAAAAAAAAAAAAAAAAAAGA 

CCTGCCGCGGC 

Sequence 2370 

CGCACTTTT I 1 I I I M I I II I I I I I I II I II t I I I I II I AAAAAAAGCCTCATTATCCTG 
TAGTCCATTTTGGAAAGTAAAGCCCAAGAAAGCAAAAGATGAAGGTTCTAAAGCTAGTTT 



• 
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GACTGACCTCAGAGTCCTCGGCCGCTCTAG 
Sequence 2371 

TTCGTGATAACTTCTCCTAAGTGCCAGGCATTGTATTACATGCTGNGAGCACANAGATGA 

ATAATANCAATAGGTTCACANAAAAGATGAATTGATTGAGAGAAAAAGA 

Sequence 2372 

CGAGGTACCCAAAAGATATCACTGTGAAGGTTTGGATACACTGACTGAGGAAAAAAGAAG 

GTCCTGAAAGCGTCTAGACAAAAAAAGACTACTTGTAAGTTGCAAGAATCAGAATGGCAT 

TGGACTTCTCAGCTTTCCTCATTAGMGTTAGATCTGAAGCAATCTTTAAACTCGTGAGG 

AAAAT TAAG TCTAATAMTAATTTTCTTCTTAGCCAAACAATCAAATGTGAAGCTAGAAT 

AAGCATTTTCAGGTAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCCGCCCGGGCAGGTACA 

TAATATACAGAGGTATAATCTGTAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGA 

AAAGGAGTAGMTGCTTGTATGTGACTAAAATTATGTTGGTATCAGTTTAAAATATATTA 

TTATAACTTTAGAATGCTATACCCATTCCCACAGTAATTCCCATAGTAACCAAAAAGAAA 

ATATCTGTAGGATCACACAAAAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAA 

AAAAAAA 

Sequence 2373 

CGAGGTGTACAAGCTTCGACCCACGCGTCCGAAGAAGGCTCTCCTCTGTTCCAGGAGAAG 

GAAGGGACAGATGAGAAGTCACTTCAAGTTCCCAGAATACTCAGGAGCTGAACTTGTCAA 

GGTTTAGATGTGGCAAAGCAGGCCAGGCATGGTQACTCATGCATGTAATCCCAGCATTTT 

GGGAGGCCAAGGCAGGAGGATCACTTGAGCCCAGGAGTTTGAAAGCAGCCCGGGCAACAT 

AGTGAAACCTCATCTCTAGAAAAAATACAAAAAAATTAGCCAGGCGTGTTGGTGTACCTG 

CCCG 

Sequence 2374 , 

CGCGGTGGCGGCCCGAGGTACATCTGCAAGCTTTAAAGCAGTAGGTTCAGACTTCCTGGA 

AGAACTGACACTTGAAGCTGACTAGGGNCTACTTGANCNCATACTCACTTTGGCTAAGCC 

ACAGTATGAGGGAAGGGTGTCAGGAATMCNCTTCCATTTTTATNTTGTTCATTCCCGAA 

MTCCAACAGGAGATTCTTTTCACTCCCTAAAATNAACTGNTCTGTGTATAAAGCATATC 

TGGATATCTTGATCTTAAATGGAAATGGTATNTGAAAANNGCNCCNACTTTTCTAAAACT 

TTAAAATTGGCCCTTTTTTATTTTTAGCCCTGGGGGAGGAGGGAGGGAATGATTCCCAAA 

AAACTGACTGGTTTCCTTGNTGCAGTGTATTAATTACTGGTGATT AI Mil I GGGGGGGG 

NAATTTAGCNGGAAAAAA I I I I I I I I I CAGGGTTAAAACGCGCCTNNTAAAAANTTGGGA 

AATGGNCGNAAAAGGGGGCCC I I 1 1 I I I CCCCCGCCGNTTTGGGGGNAACCCCCGCCCGG 

TTTGGAANGGTNMCNCCCTTTNAAAAAANAAAAAACTTGNTTTCAAGG I I 1 1 I I I I ICC 

MMCCGAAAMTTTGGGMGGTCTTTTTATATAAANCTTGGAAACCCGCCCCCGGGGTC 

AAAACCCCCCGNNNGGCCGTTTTAAAAAAATGGTACCCCCGCCNTCCGGNTTTCANAATT 

AAACTTTTTTNCCCCCNNCNCTGGGGGGGGCC 

Sequence 2375 

CTACTATAGGGCGAATTGGGAGCTCCCCGCGGTGGGCGGCCCGAGGTCTTCGACCCACGC 

GTCCCGAGAATAGCTACTGAAGTCCTAAAGAGCAAGCCTAACTCAAGCCATTGGCACACA 

GGCATTAGACAGAAAGCTGGAAGTTGAAATGGTGGAGTCCAACTTGCCTGGCCAGCTTAA 

TGGTTCTGTCTGGTMCGTTTTTATCCATGGATGACTTGCTTGGGTAAGGACATGAAGAC 

A GTTC CTGTCATACCTTTTAAAGGTATGGAGAGTCGGCTTGACTACACTGTGTGGAGCAA 

GTTTTAAAGAAGCAAAGGACTCAGAATTCATGATTGAAGAAATGCAGGCAGACCTGTTAT 

CCTAAACTAGGGTTTTTACCTGCCCGGGCGGCCGGCCGCCCGGGCAGGTACCCAACACAA 

ACTATTCAATAAAGTAATCTGCTTTAAAAATAAAACACACTGAAAGGCCCAGGCAGGTGG 

ATCACCTGACATCATTAGTTCAAGACCAGTGTGGCCAAACTGGTGAAAATTAGTCTCGAC 

TAAAAATACAAACATTAGCTGGGCGTGGTGGCAGGCGCCTNTAATTCACTACTCAGGAGG 

ATGAGGCAGGAGAATCACTTGAAGCCAGGAGGTGGAAATTGCAGTGAGCTNAAATCGTCC 

ATTGCACTTGAACCTGGCAACAAAATGGGGACTCCGTTNAAAAACCCCCCCCCA 

Sequence 2376 

CCGCGGTGGCGGCCGAGGTATCACGAGCGGCCGGCCGCCCGGGCAGGTCAATCATAGAGC 
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TGCCCCAACATCTAGACAGTCTCTCCTACTGATTATAAATGAGTGAAAACTATCAGTTAG 

AAAMTCTMTTTMGTTGTTAATACATGTTTCTTTGGTGAGCACCTGGATATATTTATC 

ACAAATTCTTTTATACAAATGTCGAAAATGGCTTTCAACAAACCTAAGTGTCCTAATTAC 

ATGCCACTTTTMGCATCNNTTTAAGGGTAACCAAAMTGGAAACCCTTATTTTNAATTA 

AAATTTNGGGNCCCGGGGTTTTANTTAMNNACT^ 

CCCCGNGNGGGGGNAAATTTTTTTATATAAA I I I I 1 I I I I NCCCCCCCCCCCCCTGGGGG 
GGGNGCCCCCCCCCCCCNI I I I I I I I 1 1 I I I I I I I I I NNGGGGGGANNAAACTCNNCNCG 
GGGGATAMNANANGGATMTNTTTTTTTNNTTNGGAAATN 1 1 I I 1 1 1 I I I N 
Sequence 2377 

GCGAATTGGAGCTCNCCGCGGTGGCGGCCGCAC I I I I I I I » I I I 1 1 I I I I I I 1 1 1 I AAAG 
CACTGGTGCTCTTCTTAAGTGAAATGTTAACACAGGGCTCAATACATAAAAGAGAAAGTG 
MGTTGTTCCATTTGGGGGGTCCCATMGGGGCCTCATGTTTCCCTAGGTGTTACCCCTT 
CAGACACAGCACATGCCTACAAAGNGGACNCGTGGGGTCGAAGCACCTGCCCG 
Sequence 2378 

CTATAGGGCGAATTGGAGCTCCCCGCGGCGGCGGCCGAGGTCAAGCTTCGACCCACGCGT 

CCGGTCTATTTTGATTNTGGGGGGTNATCAGCATTATTCTTCAGMGGGGACCTGTTTTC 

TTCAAGGGAAGAAACACTCTTATTCCCAAACTACAGAATAATGTGTNAAACATGCTAAAT 

A GTTC TATCAGGAAAACAAANCACTGTNTTTATCTCCGNAGGCTATTTGNTCAGAGAGGC 

CTTTTGNTTAMTATAMTGTTTAAATATAMTGTTTGTCTGGATTGGCTATAACATGTC 

TTTCAGCATTAGGCTTTTMGAAACACAGGGTNTTTGTATTCTTTACTAAAGATATCAGA 

GCTNTTAATGTTGNTTANATGAGGGNGANTGTNAAGTACCTGCCCGGGCGGCCGGCCGCC 

CGGNCAGGTCTTNGACCCACGCGTNCGGGCNATTATCAAAAACACTTGGAAMAGATTTT 

TATTCCTACTTTTAAACATACATCAAAATCTAAAATAAAACTA 

Sequence 2379 

CCGGGCAGGTCTTCGACCCACGCGTCCGATTGAAGCCTCTCTGAAGTTAAACCCAACTAT 

GTTTATTAAAATGTGTGAAACTGAAAGTGGGCTAGGTTCTACCAAGGCTGTGGAACTCTC 

CTACGAGTTCTGCTGATCAGGAAATTTAAGAATTTATCTTAAAAATGCAAGGAAAAAAGA 

CTGCCTTGGCAATTGTGAATGCTGCTTTCAATCTCCTAGCACCGAGCCTGGCACTTAGGC 

AGCTTTCAGTAAGTGGGTGAATGAATGACTGAATGAATGAATGAATGGCTCAGCTGAGGA 

ATGTAACTTTGGTCAAGACCT 

Sequence 2380 

CCCCGGCGGGTTGGGCCGGGCCCCGCCCCCGGGGCCCGGGTTACCATTAATTATTACCAG 
GAGGGTTNTTAATTCTGGTTAACCATTCAATTAATTGNTNAAAAGGTGGGGGGNAANGGG 
CCAAAGGGTGGGGAA 
Sequence 2381 

TGGGAGCTCCACCGCGGGTGGGGNCCCGGGGGGNCCCCNCCCGGGNNNNAAANAAANGGG 
GGGNNTTTTTTNNNNNAAAMCCCCCTTTTTTNNNNNNCCCNN I I I I M I I NGGGAGTTG 
GGGCCCCCTTGGAAAAGTTTGGGAACCCCCCCCCAAN I 1 1 I I CCCTTTAATTGGGAAAAT 
TTTTGGGTTTTTAAMTTTTTAAAAAGGGGGTTNGGGCCCCCCMAA 
MTTTTTAAAATTTTAAAATTTTAAAAAAAGGGGCC I I I I IGGGGGNI 1 II I M I I I 1 1 1 
TTTGGGGAAAAAAAA 
Sequence 2382 

AAACTTTTTATTAAMTGCTTANGANACAGATTG 

CTTTTCCTGAAGNAGGACATATATGCACTCTGATAAAACTGCATTACAGCCTGCAGGACA 
CCTT GGGCC AGCTTGGTTTTACTCTAGATTTCACTGGCGTCCCACCCCACTTCTTCCACC 
CCACi I I I l CCTTCACCAACATGCAAGTTCTTTCCTTCCCTGCCAGCCAGATAGATAGAC 
ACGGACGCGTGGGTCNAAGCTTGTACCTGCCCGGGCGGNCGCTCTAGAACTAGAGGATCC 
C 

Sequence 2383 

AGGTGCGGCCGGCCGCCCGGGCAGGTACAAGCTTCGACCCACGCGTCCGCACAAACATTT 
TTTCAATGTAGCAAAATCAAACTTAAAAAAAAAAAAAGAAGAAAAGAAGAAGATGCCGAC 
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AGTGCGGAGTCTAGCCTTTTGTAAACTTCATATTGCACACTAGGACTATAAGCCATTGCT 
AGCTCATTTTGAATTTTAACGTGTAA I 1 1 1 1 GTTTTATTTTCTTTCTGTGGGGAAACAAT 
GCTTGATCCACCAATGCTCTTTTTAATGTTTTATAACTATGTATGTGTATATATATAAAT 
ATCAAATAATATGTATGCACATATGTGTGTGTATATATCTATATGTATATACATA 
Sequence 2384 

TGGAGCTCCACCGCGGTGGCCCCGGGCAGGTCGAGCGACCGCAC 1 I I I I I I I I I I I I I 1 1 

TTTACCTGAAMTGCTTATTCTAGC7TCACATTTGATTGTTTGGCTMGAAGAAAATTAT 

TTATTAGACTTMTTTTCCTCACGAGTTTAAAGATTGCTTCAGATCTTAAACTTCTAATG 

AGGAAAGCTGAGAAGTCCAATGC CATTCT GATTCTTGCAACTTACAAGTAGTC I I 1 1 I I I 

GTCTAGACGCTTTCAGGACCTTCTTTTTTCCTCAGTCAGTGTATCCAAACCTTCACAGTG 

ATATCTTTTGGGTACCTCGGCCGCTCTAGAACTAGTGGGATCCCCCGGGCTGCAGG 

Sequence 2385 

AGGTACTCATAGCATTTTTCCCCACATAGTCTTTCAAMTCTGCATTTATTTCAAAATCT 

GAC CTTC ATMCTCAACTATACATGMTTGCTGGTATTGTCTTTTAACTTGGCCAAAGAA 

CAGTTTTCTGAGTTAGCTATTATTTCCACCATAAAATTGGGGGTMGATTTGGCAAAAAA 

AAAAAAAAAAAAAGTGCGGCCGCTCTAGAACTAGTG 

Sequence 2386 

AGGTACTACCCATTTTAATTACACAGTAAACAGAAGCACGGGTAAGTGACATACTCATAC 
TTTAAGCAATAAGAATTAGAAGAAACCATAGAAGCTTGGGGCCTTCTCTCTAGCTCTAAC 
CCAAAGAAA ATGAATTTTA I I I 1 M I I I I I I AAAAG AAAACAGCATCAATCACTTAAG AT 
TTTCTTCCTC 1 1 I I I I I I I I I I I I I I I ACACTTGCTTATTAGTATAGNATCTCGTTCCAA 
AGCCCGGACGCGTGGGTCGAAGCTTGTACACCTGCCCG 
Sequence 2387 

AATTGGA GCCTCCA CCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCCACGCGT 

CCGTCTTATTTTTTACTCTTAGGCAATGCGGAATATCAATCCCAGCACAGCAAATTCTCC 

AAAATGTCAGGTAGGCTCTTATCTGATGTTTTAGCACTGGAAAAAAAAAAAAAAAAAAAA 

AAGTGCGGGCCGACCT 

Sequence 2388 

CGGCCGAGGTCTTCGACCCACGCGTCCGAAATGACTGGTTATTTAGAAAAGAAGGATGTT 

TAGAATAAAACAGGAAGTCCAAACATGTCATAAGTGGTTTGTGTATGTCATAATAAGGGA 

TTATAMAAGAGGATTTATGTGAAAAAAATTTTATGTGATCAAGTTGTCTACAATTACAA 

GGAMTTATTTATAATAGACGTTCTAGAGATCTATATTAAAAAAAAAAAAAAAAAAAAAA 

CCTGCCCGGG 

Sequence 2389 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCC 

ACGCGT CCGTC TGAACCTACACCCCGTCTCTTCACGGTTTAGACTTACTAAAATAAATAC 

MGGTGATTTTCATCTTCAGGTAGAGTGAAGCCTTTTAATTAAGGCGTCACAGGTGCAGC 

TATTCTACCTTAATGAMTGGGTAGTGATTTTCCCACCATTATTTATTTCGGTGATAATA 

TGCTGCATATTCMGTCTCTTGTAGTTATTTTCACCCAMGTAGTTGACAATTTGATGCT 

TCTGGTGATGTTTATGGCTTCATTTTATGTAATTTTTTAAGTAAGTTCCACTAGAAACAG 

TTCATCTTATACCTTCAAAA 

Sequence 2390 

AGGTACTATAAGMCACATTAATTCMTGGAMTACACTTTGCTMTATTTTAATGGTAT 

AGATCTGCTAATGMTTCTCTTAAAAACATACTGTATTCTGTTGCTGTGTGTTTCATTTT 

AAATTGAGCATTAAGGGAATGCAGCATTTAAATCAGAACTCTGCCMTGCTTTTATCTAG 

A GGCGTGTTTGCCA TTTTTGTCTTATATGAAATTTCTGNCCCAAGAAAGGCAGGATTACA 

TCI I II I I I I I I I I I AGCAGTTTGAGTTGGNGTAGGGGTATTCTTGGGTTATCAGAATAC 

TCATATAGCTTTGGGA I I I I IGA 

Sequence 2391 

GTTCGTA TTTCTTATTCTACAACAAGGGTCAGCCTACAGGCAAAACACATCCCATTGTCA 
TTTTTTTGTAMTAAAGTTGTATTGGAACATGGCCACTCTCATTTGNTTTCTATTATTT^ 
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TGGCTGCTTTCACTTACAACCTGAGTGGNTGCCACAGAAACTGTATGGGCCTGCAAAGTC 
TAAMTATTTACTATGTAGCTTTTTCTTTTC I I 1 1 I GGAGACAGTNTGCCACTCTATTGC 
CCAGGCTGGGAGTGCGGTGGTGTGATCATGGGCTCATTGCAGCCTCAAAACTCCTGGGCT 
NAAGCAATCCTCCCGCCTCGGTCTCCCAAGTAGTTGGGACTACAGGCATGA 
Sequence 2392 

CCGGGCAGGTACCCAGTMTCACATAMTTCTGCMTCATCTGTTTATTTAGCTTAACTG 
I I I I I I I I I ATTTGNTG AAGNTGTTCGTTGTTATTTCAG NCTTTTTCTTATTGGG 
Sequence 2393 

ACCGCGGTGGCGGNCGCCCGGNCAGGTCTAGCTTAGTCGACCCACGCGTCCGGGCTTAAC 
TAATATTTGNNTGNGTGCTACTAACAGGATTATAATAAATTTGTCATCAGTGAAAAAAAA 
AAAAAAAAAAAAAAGTGCGGNCCGNTCTANNAACTAGTGGGAT 
Sequence 2394 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGATTTCAGGTTGACTTTTCTCACCTTTAA 
CCTCTTTATATAGCACAGTGCAATCTGGCCCTACTGCCACTTCATCTGGGTTATCTGTAG 
CTTGAGTTGTAAAAAAAAAAAAAAAAAGT 
Sequence 2395 

AGGTACTGTTGTCTCATGTAATGCTAAAACTGMATGGTCCGTGTTTGCATTGTTAAAAA 

TGATGTGTGAAATAGAATGAGTGCTATGGTGTTGAAAACTGCAGTGTCCGTTATGAGTGC 

CAAAMTCTGTCTTGAAGGCAGCTACACTTTGMGTGGTCTTTGMTACTTTTAATAAAT 

TTATTTTGATAAATAATATTGAAAAAAAAAAAAAAAAAAGTGCGGCCTCGAGCGGCCGCC 

CGGGCAGGTACAGGCACCTATAGAATTTAAAGGGGAGATTTCTTTATTTTGTATTCAATG 

TATTMTAAGATTTTTAAAACATATTTTGGAGAAATTGCTAAT 

Sequence 2396 

AGGTCCTAGCTTGAGTCGACCCACGCGTCCGATTTTTGCCTCCAGACTACAGATCAGAAA 
ACTGAGACTCAGMTGTTTCAATTCCTTGTTTAAGATCACAAAACTAGTTTGAGGTATAA 
TGGAAACTGAAAAAAAAAANAAAAAAANGT 
Sequence 2397 

AGGTCAAGCTTCGACCCACGCGTNCGGAAGTNTTCATTCTCCCT Cl rTTTTTTTTTTTn 

TTAGCMTTCMGNCATGTTTTTGNCTACAAGTTTTTCCAGTATTGTATAGATAAATAAT 

MTTTACNAGGCTGCCTTTGAGTATACTTAGACAAGAGACCTGCCCGGGCGGCCGNTCTA 

GNACTNGGTGGANCCCCCGGGCTGCCAGGNATTTCAATATNAAGNCTTATTNGTTACCGN 

GCGACCTACGAGGGGGGGG 

Sequence 2398 

CGGCCGCCCGGGCAGGTATCAAGTGCTTGGATTCTGAACTGNCAAAAGAAAACTGCACTT 
GCCCTCTGAAGTAAAAACCGAAATGAGNTTTCTTAGGCAAATGTATTCATCAGCCCAGAT 
AAAAAAAAAAACCANNTAATGNGAGCCNTTAGTCACTGCT 
Sequence 2399 

AGGTACAAGCTTCGACCCACGCGTCCGATACGACTCACTATAGGGATCTACCTGCTTGAG 

TCGACCCACGCGTCCGAACACATACAAAAGAATTAAACCCACAAGCTGCCTCTGACAGCA 

GCCTGTGAGGGAGTGCAGAACACCTGGCCGGGTCACCCTGTGACCCTCTCACTTTGGTTG 

GMCTTTAGGGGGTGGGAGGGGGCGTTGGATTTAAAAATGCCAAAACTTACCTATAAATT 

AAGAAGAG I I M I ATTACA 

Sequence 2400 

AGG TGGCCG CAC I II M I I I I I I I I I 1 I I I II I AAAGTTTGGGGTCTGTCAGGAGACAGA 
GGCTTTTTTGAATTCACTGTGAAGAGAAGAACCCGAACCTTAAGACGGCAGATCCCTGAG 
AGTCTTTCTGGCTGGTTTGAGCGGACGCGTGGGTCGACACCTGCCCG 
Sequence 2401 

AGGTACTTCAAAGTTATTTGCACATACACTTGTTTACTTTGNATGTTTTGCAGGATTAA^ 
CTTTGTATAATCTTTTTGCAAAA I I I I I I I I I CAGTATGCAANGCTTGCAAGATGAAAAT 
TAAAACC 
Sequence 2402 
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ACCGCGGTGGCGGCCGAGGTACAAGCTTCNNNCNACTCGTCCGAGCTTGAGTCGACCCAC 
GCGTCCGCGTTNATGATTTTTAAMCACACTTGAAATAAAMTGATTGAACTAAATTTTG 
GTCCGGNGACATCATTNTGCACTGCATAGCCCA 
Sequence 2403 

CCGGGCAGGTACATCTCTATCAGAGCTTTTGGGTGACCAGGCACACTGTCAATGAGCAGT 
AATA CTTTGAAAGGAATCTTTGGGG I I 1 1 1 1 I I GGTTTGGT I 1 1 I GTTTTGTTGTTATTT 
TTrTGTTNGTTTGATTTTTGTTTTTGNTTTTT^ 

AATACTCAGTAAACCATGCTGTTAACAGATAAGCTGNCATCCAGACTTTGNTGTTCCATT 
TCTAGAGCACAGAGCAGATTTAGCACMTTCTTAANGCTTTATGATTTTTCANAATGGTA 
AANTGAGCACTGGCTTCAA 
Sequence 2404 

NTTTTTTGGGGGGGGGAAMCCCCNNACCCCCNCCAN^ 

TNACANNNNAGGGGGGGGCANCCCCCCAAAAAAACNNNCNANGCANGAAGNANANNNAAA 
CAMACANGNANGNAAGNNNNCACGNGCTTTTTTTAMANATTTTTTTNNNNGGGGGGGG 
GGCCCCCCCAAAAAAAANGAGGACGNNGGAGCCCCCCC 
Sequence 2405 

CTNCTATAG GGCCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGG I I I I I 1 1 I I 
1 I I I I I I I ICTTTCTGTTCCTTGGACTAGATAATCTGAMTCAACTGTCTTCAGTTTTGC 
AGACTC TTGTG CCAGCTAAAATGTTCTGTTGAGCCCCAGMGCTAATTTTCTTTTCAGTT 
ATTATGATTTTCAGCTTTAGAATTTATTTTTTTMTATAANNNCTACCTC I 11 1 1 1 ATAT 
TCTCCATTGGGNGAGACATTCACATACTTTCTTCCAG I I I I I 1 I I AGACGTAGTTTCCTT 
GAGTNCTTNGAGCATATTTAAAATAGTAGATTTAAAGGATTTGGCTAGTTACTCCACCNT 
CTGAGNTTCCTCAGGGAAA 
Sequence 2406 

CCGGGCAGGTCAAGTCGACCCACGCGTCCGCTTGTTTTGCTCTATCCCATAGGAGTTGGT 
ATAGTTGTGTTTCCAATATCATTTACTAAAAGMNANTNTCCTTTTTATTTCTTCATTGA 
CCAACT 
Sequence 2407 

AC I I II I i I I I I I I I I I I I I 1 1 AGTCTTTCTACTTTN AGGGTTTATTNACTGGCAGTTTT 

C AGGT CTGNAAATACAAAAAGGAACATAGACATNGGGTNTCTGCCNTTGAGGNGCTCCNA 

GTTTTGGNGAGGGACACAACACNTTTCAGGGGACAACCGTGGTACCTNGGCCGGTNTAAA 

ACTAGTnnnnnnnnnnnnnnnnnnnnnnnnTCNATATCAAGCTTATCCGATACCGTTCTA 

CCTNTGAGGGGGGGCCCGGCACCCA 

Sequence 2408 

AGGTCMGCTTCGACCCACGCGTCCGGCCTGGGACAGATCCTTAGTCTTTCCTTGACTTT 

TATAGACCCAGAGGTGAAAGGCCAAATG7TTTGTAGAATGTCCTTAAACTTGGGTTTATC 

TGAGGTTTCCTTGTGATTGMTTCAGGTTAGACATCTTTGATGGGACTGTCATAGAACTG 

ATGCTGTGTTCTMTTGCATCTTATCAGGTGACTTATGATTTCTGTTTGNCCCATTATTG 

ATGCTGNTACTTTAGATCACTTGATTAAGGNGGTGTCTCGCCTGGCTTCTCCAGTGTGAA 

ATTTCCI I I I TCTTCT 

Sequence 2409 

AGGTAATTTTTCTGCTTGCCCAAGAAACAAAGCTTCTGTGGAACCATGGAAGAAGATGAA 

AA TGAG ACTGGCAAAGAACAAATGCTGAATCTGAAGAAGATTTGGGCAAATAATCTGCAT 

ACTTTTMTTGGGAATAAGATGGAAAATATGMTGCTAAATCAAATTTTTTAAAAAAATA 

CACCCGGACGCGTGACCTGCCCGGGCGGCCGGCCGCAC I 1 I I I I 1 I I I I I I I I M ICAAT 

ATTATTTATCAAMTAAATTTATTAAAAGTATTCAAAGACCACTTTCAAAGTGTAGCCTG 

CCTTCAAGACAGATTTTTGGCACTC 

Sequence 2410 

ACI I I I I I I I | I I I I I I I I I I I I I I I I I I I I I NGGNGGGGAGGAAGGCAAGGCACTGGGA 

AAGACTNACAACANATATTGACCCTAGTCGTAAAGAAATCCATAATTGCCAGTAACACGA 

CNTATTTAAGMCAGGMAAGACNGACAAGGAAAAGAGGGACTTTTTTTAAAAAACATTA 
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CTAAAGAATTNGGACATAANAAGAGTGAAATTGACAAGGAAAGGAGGAGGGGGA 
Sequence 2411 

GAGCTCCACCGCGGTGGCTGGCCGCCCGGGCAGGTACATAATATACAGAGGTATAATCTG 

TAACATCAATAATGTAAAGTGGGGAAGGGCAAGGTGGAAAAGGAGTAGAATGCTTGTATG 

TGACTAAAATTATGTTGGTATCAGTTTAAAATATATTATTATAACTTTAGAATGCTATAC 

CCATTCCCACAGTAATTCCCATAGNAACCAAAAAGAAAATATCTTGTNGNATACACACAA 

AAGAAAATCAGAAGTAGATGCAAACTTGTCACTACAGGAAAAAAAAAGCTATCAAAATAG 

AAAACAATNATGGNGAAAAATAAAGACA 

Sequence 2412 

CCGGGC AGGTG CCGCAC I I M I I I I I I 1 1 I I I I I I I I I I I I I M 1 1 ATGGAAAAAATATT 
GTGATTATTTTTMTAGATTTACGGTATAAMGAAAAACTTTTATGATNCMTTTCGACA 
GACTACTTTCATAAAAAATTTTACTNTACATACMTGTATTGCAMTTTTTNGGCAAC 
Sequence 2413 

NGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTCGACCCACGCGTCCGATTTT 

GTGAGMTGATTGTTCTTTCACTTGGGCTGTTTGAGAGCATAATTATGGTAATCATGAGA 

TTAATGT TTCA TGATTTCTACCTCCAAAGTGTGAAGACAAGTNAAACAATGNTTCTAAAT 

TGTCTTATTTTGTTGGCGGAGAAGATTACAATGGGCTATTAGTGCTACATTTGGTCAAAT 

GTAATCACTTAAATAGCTTCTTGTCACCTTAACTAAAGCAGAATAAAAAAACCCTGCCCC 

GGGGCGGCCGGNCCCGCCCCGGGCAGGTACCATTCCCGACGTTTGCAATGGTGGGAGTTG 

GCAGGTGTG 

Sequence 2414 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGCTTCGACCCACGCG 

TCCGCTTCTCTTAGAATTTTGGGGAAATTGATAGTCCAGTGACTCCTACCCACTTTTGGG 

TGMGGACGATTTGGAATTTTGAAGTGTGGGGAGACAGGCCTGTGAAGTCCGAANGACTC 

ACTTGGGGT 

Sequence 2415 

CCGGGCAGGTCAAGCTTCGACCCACGCGTCCGAAAAAAAAAAAAAAAAGTTTGTTGGAGA 

CCGACTGGGGGTGAGGGGCTGGCAGCAGGAGACAGATAACANGTTCNCTCAGAATGCAGA 
GT 

Sequence 2416 

ANGI I I I I I I I I I I I I I I ICANTGCTTCCCAAAGCTGCGGACATAAGGGTAGCTGANCTG 

GACTCTGNCCTTGCTGAAGACTTGGAGATGTCTGAAGTCATAACTGGGGNGACCTCCTTG 

GTCCAAAATGTGCTGCCCCCTATGAATTCCACATCAGGAGTTGTAGGAGATGAGGTTAGA 

GGGATGCTTGGGCTCATCTGGCTTCCTTCAGCATAGCCCGGACGCGTGGGTCGAAGCTTG 

ACCTGCCCGGGCGGCCGGCCGCAC I I I I I I I I I I I I I < I I I I I i I I 

Sequence 2417 

CCGGGCAGGTTTTCTTATGAGTGGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGAGT 

GGGAGGTGACTGATCGTGGAGGTGGATTTCTTATGAGTGGGAGGTGACTGATCGTGGAGG 

TGGATTTCTTATGATTGGCTTATCACCATCCCTCCTTGGTGCTGTTTTTGCAACAGTGAG 

TGATTTCTTGTGAGATCCGGTTGTTTAAATCCAGAGGCACCTNCCCCTACCCTCTAGCTC 

CCATTCCTGCCATGTMGACACCTGCTCCCCCTTTTTCTTACCCCATGATTGGAAGCTTT 

CTGAGGCCTCCCCAGAAGCTGATGCCAGCCCTATGCTTNCT 

Sequence 2418 

CCGGGCAGGTCTTCGACCCACGCGTCCGCACATTTTGATGGTCAGTCAATAACTTAAGCA 

GNTACCAAAATACTAGGTATCCAAGGAGCGAGAGGTGGGCCGAGCATAAGAAACACATTT 

CTNATGGCACAGCTCTGCCAAAGCCCTGCAGAATCATTTACACATAGGTCTTTGGTTAGT 

AGCCCCTGGCACAGAANTCTGATCTTAAACAAATATTGTCTATAATCAAGTAGAGCAATG 

CAATTAAAAAMAAAAAAGCACAGGNTTTTGGGGCCATNGCTGAAATCCCAGCCTTGCTA 

TTTTGCTTGGCTGNGTGACCGGGGTTTCCT 

Sequence 2419 

AGGTGTACAAGCTTCGACCCACGCGTCCGGGATGAGTTTGTATGTGTAAAGTGCTTGAAA 
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CAGTGCCTGCCACATACTMGTGTTGGATAAGTGTTTGTTATTAAAAAAAAAAAAAAAAA 

AAAGTTGCGGCNCGGCCGCCCGGNCAGGTACTGGCGTGGATTCTGCATANTGGNGATCAC 

ACGTTCCACCTCATCCTCAGTGAGTTCTCCCGCCCTCTTGGAGAGGTCAATGTATGCTTT 

CNCTCAACACCACATGAGCATATCTTCGGCCCACACCCTTTAATGGGCAGTGATGGCAAA 

GGCTATTTTCCGCCG 

Sequence 2420 

AGGTATTCAACAAGGGCCCTGAGAGAGGGACAGGCAGCCCCTGTGAATCTTGCTGTTCAG 

CANAGACAGGANTCAGCACGTGTGAGGGCAGCAGGGAAGTCTTCCTGGAGGAGTGAGACC 

TGGCGATGAGGAGGCACGGCAGGGAGGTGGAACAGGCAGGAGAGACTCTTCANGAATTGA 

GGAGATAGAATAGAGGACACTAAAGCCTTAGAGAGGCCAGGGGTGGTGGCTTGGCAGGAN 

CATCGCTTGAGGCTAGGAGTTTAAAAGCAAGCCTGGGCAACATATCCGAGACCCCCATCT 

CTAAACACAAAAAATAAAAAA 

Sequence 2421 

CCGGGCA GGTACCCAGTMTCACATAAATTCTGCAATCATCTGTTTATTTAGCTTAACTG 
I I I I IN I ATTNGTTGAAGNTGTTGTTGTNATTTCAGTC I I I I I I CTTATTGGGGTTGAC 
CAGACTTGGGTAAAATCTGTAAGAAAAGTNCCATAATTATGGGGGGAAGATTTCCTCTTG 
AATTGGCTTAAATNCCTGTTAGCTGAAAAAAAAAAAAAAAAAA , 
Sequence 2422 

CCGGGCAGGTCTTNGACCCACGCGTCCGAGCAAAATTCAACTAAAAATACAATCTGGATT 

CCATAGCCAAGGGTTTTATTTACAATNTCCTAGTAGGAAGTCTTTATTTTAGCTTTCAAT 

GTGTTGAACTTATAAGGAMTTTAACGTATACATGAGTATTATATTTATGGAATGTGAAG 

ATATACAGMTGGAMTGGAAAATMTGTTMTTCGTATTGACTTTGAGGAATCTTANAA 

TCATGTAGCCCTGTTGCAACAAGAAATAGGGAACTTCTGAA 

Sequence 2423 

AGGTCAAGCTTCGACCCACGCGTCCGGTTTGTTTTTTTCTTACGGCAACTCAAAGCAAAG 

AGCTGGAGGAGCCAGCCATTATAATTGCTTACTCTCATCGCTTAGCGCCCCAGGTGGGAT 

GTGTTTCCAAAACACATTTTTGTATTTATMGGAAATGTAGTTAGGATTAATT^ 

CCTMTTAGMCTCACATTTTGGTTAMTCCTCMTTTCATTAAAAAAAAAAAAAAAAAA 

AAGTGCGGCCGGCCGCTCGACCTCGGC 

Sequence 2424 

AGGTGCTTCGACCCACGCGTCCGACTTAATTGAGAAGGTGGAATCCTCCTATCCCTGAAC 

TCGGGGGAATGGAATCTCGCTGATCTTCCAGGACTAGCTCCCTGATCATTCCAGCCCCTC 

TGAACAACAGGGCCCCTGGAGATAGAAGTAGTCCTATTTCACCCCAACTACAACATTAAT 

GGGAAAAAAGAAGCAGGMTTCCTGMTTTTATGACTATGACGTTGCCCTGATCAAGCTC 

AAGAATAAGCTGAAATATGGCCAGACTATCAGGCCCATTTGTCTCCCCTGCACCGAGGGA 

ACAACTCGAGCTTTGAGGCTTCCTCCAACTACCACTTGCCAGCA 

Sequence 2425 

AGGTACAACTTCTCTTTTTTGGAGTTTTACT 

TTTCCAT GACGTT GGAGTTTGGCTCACTTCCAACAGGGAAAAGGAGTG I 1 1 1 I I I I I GTT 

TTGTTTTGTTTTTCCTGCTTCTTGAATGGTAGAGAGCAGTCTATAGCCAGAGACTCGTCC 

CTAGGTAAGTAACTGMTTGGGGTTTGTCTTGGTTAAAGTTAAGATTAACGACCAACTGG 

TCTTMTTTCTCCTTACCATTAGAGCACTCAGTTATCATATAMTTGCGCCATTGTTTGT 

TTTGCCTAA 

Sequence 2426 

AGGTCAACCGCCAGGGTCAAACGGAACACAACCCACTCTCAGGAAGACATCCCTAACACA 

AATCCAGGGACTTTGTTTCTTAACCTTAAAATTGGAAACACTTCTTGCTACCAGGGATGG 

GGGGTGGGGCTCAGCAGTTTGGGGAAACGGAGTGGGAGTCTTTTGCTGAACCGGACGCGT 

GGGTCGAAGCTGGACCTGCCCGGGCGGCCGGCCGCAC l I I M I I I I I I I I I i I I I I I GAG 

AGTGGCTATTTCATTAANATTTAATAG I I I I I I I I GGACTAAGTAGTGG AAAAACTTTTA 

TACTTAACTGAGACATTTTGTCAAGGCTAAAAAAAAGT 

Sequence 2427 
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■AGGTACATCCCACTATTTCTTTTCTTTTA 

CCAMTAAMTGATTTTATTTAGAAAATGCCGGGCACTAAAAAAAAAAAAAAAAAAAAAA 

AAGTAC 

Sequence 2428 

CCGCGGTGGCGGCCGAGGTACCTATTTTGTATATGTGAGATGTTTAAATAAATTGTGAAA 
AAMTGAMTAAAGCATGTTTGGTTTTCCAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGC 
CGCTACCTGCCCGGGCT 
Sequence 2429 

AGGACAATGCTGTAGATAATGCAGCCCATGCAATACACCCAAGAACACTAGAGTCCTACA 

CCCAAGTACAATATGATAAAGCAGCCCTCTGCAAGTGGTGCTGGATACCACTAAGAAGTC 

TACTGCAGCCATGTTGGTTATGATTTTCCATGCAGMGGGTACAGTTAGTTCATATTTAT 

GTATTGCACATAATCATGCTATTCAGCATTGATGCTATATTGTATTATGTAAATAATAAA 

AGCCATGTACAGAGGGAAAAAAAAAAAAAAAAAAAAA 

Sequence 2430 

CCGCGGTGGCGGCCGAGGTATTCGACCCACGCGTCCGTAGTTTTTATCTTTGACCAACCG 
AACATGACCAAAAACCAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAAAAAAAAA 
AGTGCGGCCGNTCTAGAACTAGTTGGATCCCCCGGGCTGGAGGAATTC 
Sequence 2431 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCTTGAGTCGACCCACGCG 
TCCGAGTCAAGAGAACAGCACATTAGTTCCAGAAGAAAGATGGAAATTCTGAAAACTGAA 
TGTCAAGAAAAGGAGTCAAGAACAATTCACAGTATGAGAAGAAAAATGGAAAAAAAAACT 
TTATTTAAAAAAGAAAAAAGTCCAGATTGTAGTTATACTrrrGCTTGTTTTTCAGTTrC 

Sequence 2432 

CTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCTGTTCGTATTTCTTATTCT 

ACAACAAGGGTCAGCCTACAGGCAAMCACATCCCATTGTCATTTTTTTGTAAATAAAGT 

TGTATTGGAACATGGCCACTCTCATTTGTTTTCTATTATTTATGGCTGCTTTCACTTACA 

A CCTGAGTGGTTGCCG CAGAAACTGTATGGCCTGCAAAGTCTAAAATATTTACTATGTAG 

CTTTTTCT7TTCTTTTTGGAGACAGTGTGCCACTCTATTGCCCAGGCTGGAGTGCGGTGG 

TGTGATCATGGCTCATTGCAGCCTCAAACTCCTGGGCTCAAGCAATNCTCCCGCCTTGGT 

CTNCCAAGTAGTTNGGGACTACAGGGCATGAGCCCNCCATACCCCGGNTAATTTT 

Sequence 2433 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGGAAAATGTTrTCTTTTTTT 
AATTTMGGTTTAAMTTCCTTTGCCAAATCAAAAAAAAAAAAAAAAAAAAAAAAACGNT 
NGCNTGCCNGGCCGGCCGGCCGNNCTTAC 1 1 I I 1 1 I I I I I 1 1 I 
Sequence 2434 

CTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCTTGAGTCGACCCA 

CGCGTCCGATAAMCACCGCCTTCAGTGTAAAMTTTTACATGGTATCTGAACAACATTT 

ATCCA GAGGTG TATGTGCCAGACCTTAATCCTGTTATATCTGGATACGTGAGTATTTTCC 

TGTTCCTrTTTTMTTMTATACTCCTTGCCACCAACCTTTATGTGGTTCTAAGAAMTT 

GCTGAA ATACT TTCTTTATTGCTTTTGAGATTTTACATTATMTCATTMCAT^ 

TAMTTATTTTAAAAATATATAAATCACATGGATTAAAAAACTTTTCATAACTTGAAAAT 

TTTCCTTTAGATTAAAAGCGTTGGTCAGCCTCTATGTCTGGGATGTTGGAGAAAACAATC 

AAGGAGGCAAACCATTAATTATGTATACATGTCATGGACTTGGGGGAAACCA 

Sequence 2435 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCI I I I I I I I I I II 1 1 I 1 1 I I I AATG 

GTTAAGCTACTCAAATTGTTTTCAGAGCCAGAGTAAAAACCAAAAAAAATACTCATCAAT 

ATCAATTGCCAAATTCAGTCTGAAAACATTTCACACACAGCTTACCCAAGTATAAAGCTG 

CTGGGG GGACTTC TGAAAANTTGGCMCATTCATTNGGGGCTTNGAAATGCTTTACAAGG 

GGMGGNTTTTTTAANGCAGGGCTTACNTGGGTTTTCCCCCAAGGCCCCTGGNNANGTTT 

NCCNAAAATNAAAGGGGGGGGCCCCCCCTNGNNGNGGGTTTTTTCCCCNNNCCNCCNNAN 
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'CATNANGGGGNGNGCCCCNCNCNNTTIT^ 
TNTGA AAAMGGGTTTTTTTAAMCCCCGGGGGGGTTTNTNT^ 

TTTTTAAAAAGGGGGGGGGGNNGNTGGGNNTTTTTNTGGNGGGAAAANNTNTNACACCCC 
NNTAAAAANGGGGTTTTTTNCCNNA 1 I I I I I 1 1 I I NCNNCCCCAAAANANGGGGGGGGGG 
GGGGGGG 
Sequence 2436 

CTNCTATAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTCATTTTATTTTTAC 
GTTGTTACGATATGGGGAGTAGTGTGATTGAGGTGGAGTAGATTAGGCGTAGGTAGAAGT 
AGAGGTTAAGGAGGGT 
Sequence 2437 

TNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGC 

GTCCG GCCTG GGACAGATCCTTAGTCTTTCCTTGACTTTTATGACCCAGAGGTGAAAGGC 

CAAATGTTTTGTAGMTGTCCTTAMCTTGGGTTTATCTGAGGTTTCCTTGTGATTGAAT 

TCAGGTTAGACATCTTTGATGGGACTGTCATAGAACTGATGCTGTGTTCTAATTGCATCT 

TATCAGGTGACTTATGATTTCTGTTTGTCCCATTATTGATGCTGTTACTTAGATCACTTG 

ATTAAGGTGGTGTCTGCCTGGCTTCTCCAGTGTGAAATTTCTTTTCTTCTTTGTAATATT 

TTGTGGGGGMGTMCTTTMGACTATGTMCTGm 

Sequence 2438 

TACTATAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTACAAGCTTCGACCCAC 

GCGTCCGAAAGATTCTTGGCTGAGCATGGTGGCTCATGTCTGTCATCTCAGCAATTTGGG 

AGGCMGGGCAGGAGGATAGTTTGAGCCCATGAGTTTGAGACCAGCCTGGACAACATAGT 

GAGACCCCATCTCAGCAAAAAAAAAAAAAAAAAAAAAAAAAAAAANGTGCGGCCGGCCGC 

CCGGGCAGGTACATGACTATATCAGGATTTCAAATTGAGGAAACCATTGACCGCGAGACT 

TNTGGCAATTTANAGCAACT 

Sequence 2439 

CCGCGGTGGCGGCCGAGGTACCTATTAAGCTCATGAACCATAGAGGTATCTCGGTGGCCC 

CTCATTACCATCTGCTGTTCTTTCAGCTGTTTAGCTACATCTTTGGCTGAGGAACCAGAC 

ACTTCAATCCATGTCTTAGAGAAGAATGCACATGACCCCAACATGAAGATGATATAAACA 

ACGACATGGACAG 

Sequence 2440 

TNCTATAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGGCCGCA C I 1 I < I I 1 1 I I 
I I I I I I 1 1 I GAAMTAGTTTATTATTTTATTGTTTTTTGTTTATACATGTTAAGTTTCM 
CTTTCMTAATAAAATTCAATAAATTTGATTCCTTMTCATAAAAACTTGCTTTACA 
Sequence 2441 

CTATAGGGCGNTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTCAAGCTTCGACCC 
ACGCGTCCGAT MTTTA TACTAAATTTAGTAAAATGGACTTCTTATTCAAAGCATCAATA 
ATTAAAAGAATTATTTTAAAAAAAAAAAAAAAAAGTGACCT 
Sequence 2442 

GGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGACAAGCTTCGACCCACGCGTCCGCTT 

MTAGCAMGGATAATTGAAATCCCAAACTTACAAGGTTTTCAACAAAAGTGAAGTTTGC 

TTAAAGTTAACAGTGTAACATGTATTATGGTAACTTCTAATCTTGTGGCCTTAGACAGTC 

TAGTCCAAAGGCATAMGMAGTTTGCTTTAAAAAAAAAAAAAAGGAATGGTTATCTTCA 

AAAAAAAAAAAAAGTGGGGGGAGACAGAATTTATGTAAAAAGAGTGTTATATGGTAAATT 

CTTGTCCTGAAATAAACTAACTGGTGTTTAAAGAAAAAAAAAAAAAAAAAANGT 

Sequence 2443 

CTNCTA TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I I I I I I I I 1 1 I I I I 

TTTTTTACCTGAAMTGCTTATTCTAGCTTCACATTTGATTGTTTGGCTAAGAAGAAM^ 

TATTTATTAGACTTMTTTTCCTCACGAGTTTAMGATTGCTTCAGATCTTAAACTTCTA 

ATGAGGGAAAGCTGAGMGTCCAATGCCATTCTGATTTTGGCAACTTACAAGTAGTCTTT 

TTTTNNNTAGACCCTTTTCAGGACC I I I I I I I I I I I CCT AAGTCAGGGGGTTTNCAAAAC 

CTTTCNAAAGGGGNTNTTTTTTNGGGNNNCCTGGGCCGGTTTTTAAAAANTAGGGGGGAN 
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CCCCCCCGGGGNTGGGGGGGMTTTGATNTNAAAGTTTTNTNGATCCCCGCCCCCCNTGG 

GGGGG 

Sequence 2444 

TACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I I I I 11 1 1 I I 1 I I 1 1 1 
TTTTTTTTGAAATGTTTG 

TAATTTCTTTGGMTATMGTCACTTTTTGCAAGCTAAAAAATAGAATCAAACTAAGGTG 

ATCTAAGTCCTCTAGGCATCCAGGCTGATCCTTGGAATCATGAGCAGAATGATGACATAC 

TACANGGGGCTAGCAATACCGGNTNTAMCTNTTAAATAATANCCCTNCATGGTTTTATT 

AGGGAACCAGCCAAAAGTCCCGNCCCTTTTAAAACTNGGGGGANCCCCCGGGCTTNNNGG 

GANTTCGATATTNAACTTTTTTNGAAACCCGCCCNCCNCNNGGGGGGGGGGCCCCGGGCC 

CCNACNTTTTTTGTTNCTTTTTAANGGGGGGGNNNNAAATTCGCCCCNCTGGGGGGGAA 

Sequence 2445 

ACTTAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I 1 M I I I I M 1 11 I 1 I 1 1 1 

I I I I I I I I I I I ATTGGTATTTAGTTTTTATTTCATAATCATAAACTTAACTCTGCAATCC 

AGCTAGGCATGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCGAGAGCA 

CAAAAGATTCTAGGTTNNTGCGAGCAMTGGGGTNGGNGGGGNGCTTTNNTTGGNTAAAN 

MGGGATGNNNTTGGGGGTTTAAAAAMACCCNNGGCANNCNTTTTTNGGGTNNNCCCCN 

ANCCCCAANNNGGGGGGNNTNTTTTGTGGGGGGGGTNGNr4TNTTTTGGGGGGACNCAAAA 

ANGNNNNTNTTGGNCCCCCNAAAAAAAANTTTAANGGCN I I 1 I I I I I I 1 1 I 1 I 1 NTAAAA 

AAAANNNAAAGTTNNNTTGCCCNANNNCNCCCCTN 1 1 11 11 I 1 GNGGGGGGNGGNGAAAA 

AAAAAAANNNGNNGNCCCCNNNTTTTTNNNGGGGGGGGCCCNMNTTTTTNTrCNNTNAAA 

AAAAAAATCTCCCCCC 

Sequence 2446 

CCGCGG TGGCGGC CGGCCG GGCAGGACACAGGAGGCCTTATTACTTTTAAATTATACAAC 
ATTTTTTGCTTAAMTTTT^ 

TTTAAACATGTTTTAACTTTTTGACTTATTACAAACATTCTT^ 

AATTTATTTCAGGACAAGAATTTATCATATMCTCTTrTTATATAMTTCTGCCTCTCCC 

CTTTAl III I GAAG ATAACCATTG I Mill I AAAGCAAACTTTCTTT ATGTGTTTTG ACT 

AGACTGTCTAAG GCCA CAAGATTAGMGTTACCATAATACATGTTACACTGTTAACTTTT 

AGCAAACTTCACTTTTGTTGMMCCTTATAAGGTTTGGGGATTTCAATTATCCTTTGCT 

ATTAATAAGACCTTGTTCAGTCTAAATTAACTTAAA 

Sequence 2447 

NCACTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCAC 

GCGTCCGGGAGAATTACCCGAAAAACAACCACACTGCTTCAATCCTGGACAGGATGCAGG 

CAGATTTTAAGTGCTGTGGGGCTGCTAACTACACAGATTGGGAGAAAATCCCTTCCATGT 

CGAAGAACCGAGTCCCCGACTCCTGCTGCATTAATGTTACTGTGGGCTGTGGGATTAATT 

TCAACGAGAAGGCGATCCATAAGGAGGGCTGTGTGGAGAANATTGGGGGCTTGGCTGAGG 

AAMATGTGCTGGTTGGTAGCTGCMCANCCCTTGGAATTGCTTTTGTCGAGGTTTTGGG 

AAATTGNCTTTGCCCTGCTTGCCTCGTGAAGAGTTTTAANAAGTNNCTTCCNNAGGNTNA 

NNTMGGGTNTCTTNGGTCTTTNTNANNNCCTTCNTTANTTTGGGGGGNG 

Sequence 2448 

ACTTAGGGCGATTGGAGC TCCCCGCGGTGGCGGCCGCCCGGGCAGGTGCCGCACTTTTTT 

I I I I I I I I I I I I I I I I I I I I AGMTTTAGCCACTTCTCAGGCCTTNTTCCCCATAATTTG 

GMCTTTCCTTTGGATTTGATCAAGTTGGATAGAGTTGATCAAACCCTGATCAAGTTGGA 

TAGAGTTGATCAAACCCAATGGGAAAAAGACCAAAACAACAATAAAAAACAGACAACAAC 

MCAAAAAACAGTTAGGCAAAACAAACAATGGCACAATTTATATGATAACTGAGTGCTCT 

MTGGTAAGGAGAAATTAAGACCAGTTGGTCGTTAATNTTAACTTTAACCAAGACAAACC ' 

CCAA 

Sequence 2449 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTTACAAGCTTCGACCCACGC 
GTCCGCGAAAGCGAAGAAGGAAGCTCCTGCCCCTCCTAAAGCTGAAGCCAAAGCGAAGGC 
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TTTAAAGGCCAAGAAGGCAGTGTTGAAAGGTGTCCACAGCCACAAAAAGAAGAAGATCCG 

CACGTCACCCACCTTCC 

Sequence 2450 

CCGGGCAGGTACTTTTTCTMTATACTTTTCNATTACACATGAAAGCCATGACAGGAACT 
GAGATAAGATTrCTrTGTTTTTrGAAACATCTTATCTACTAANAA^ 
TTNACTTNAMGCTATTAGTAGTT1TATACTCNCTTMTAAGTATTAATAAATTTACATA 
CTNGACTTAGTMNCTMGCAATTTGGNTAACGTNTTTNTT^ 

CANTTGGATATTTTTNCTACCTTACTATTACNTTATAAATATATTTCCCCAAATATATCN 

TTCCTCTTTAAAAANTATGTTTTGNCAACNAACCTTNAAA 

Sequence 2451 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAAAGTTATTGCAC 
ATACACTTGTTTACTTTGTATGTTTTGCAGGATTAMCTTTGT^ 

I I 1 1 1 1 I I I CAGTATGCAAGCTTGCAAGATGAAAATAAAACCTGTTTGCCTGATAAAAAA 

AAAAAAAAAAAAAAAAAAAAAAATGT 

Sequence 2452 

CTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAGCTGATTACACCAACTT 

GAATGAAACGACTTCTCTTGTGAACTATCAAGGGGCCGCCAGAATCACCTCTGCAAGTAT 

TGGGGTCAGCATAGGGACTCACTCCTCCAGTACAAAGGAACCGAGGGGTGACCACCTCTG 

AGATGTCCTTGACTTTGTCATAGCCTGGGGCATATTGAGCATCTCTCTCACAGCTGCCTT 

TCTTATCCCCATTCTTGATGTAGACCGGCCGCCCGGGCAGGTGCACATACACCAAATGTC 

TGAACCTGCGGTTCCTCTCGTACTGAGCAGGATTACCATGGCAACAACACATCATCAGTA 

GGGTAAAACTAACCTGTCTCACGACGGTCTAAACCCAGCTCACGTT 

Sequence 2453 

CCGCGGTGGCGGCCGCAC I I I I I I 1 1 1 I 1 1 ! I I I I I I I AAGAAATAGGGTCTCACTCTGT 

CCCCCAGGCTGGAGCCATTATAGCTCACTACAGCTTCTGACTCCTAGGCTCAAGGGATCC 

TGCCACCTCAGCCTCCCTGGTAGCTGGGACTATAGGCAGGAGATCGCTTGAACCGGGAGG 

CGGAGGTTGCTGTAAGCTGAGATCGCGCCATTGCTTTCCAGCCTGGGTGCCAGAGCAAAA 

CTTTGTCTNAAAAAAAAAATTN I I I I I I I I AATTAAATAAAAAGGGNCAGGGGA I Mill 

GGGAAAAGGTCNAAAANAAAATTGNNCTTTTTGAAAAAANCCTTGGNAAAAANCCAAAAA 

MAMTTNGGNGGGAAAAAAATNNTTTNNTTNNGGGANAAAAAANAAAANAAAAAAANNT 

ANGGGGGGGGI I I I I I I I INGGGGNGNGGGGGGGGGGNAAACNCAANCCCCCCCCNNTTT 

ANAAAAAAAAAGGGNGCCCCCCCCCCCNNGNGGNAAANGNGNTNTTNNTAAI I I I I I I I I 

TTTNCCCCCCCCCCCCCCNGGGGGGGGGGGN 

Sequence 2454 

GAGACACAGTCTCACTCTTGCCCAGGTTGGTCTAAAACTCCTGGGCTCAAGCAATCCTCC 

CGCTTTCAGCCTCCCAAAGTGCTGGGGTTACAGCCGTGTGCCACTGTGTCTGGCCCTTTT 

CTTTTTCATAGGAGAAGGGTTGTTGACTCCCAGGAAACGTCACCTGGAACCAAGAATGTG 

MCTCAAGGACCCCCGCCTGTTGGCAGCTGCATTTACTTGACTCCTGTTCACTGTTTCTT 

AGCCTTGTCCTTTCTCTCCTGCCAGTTCTAGGGGACACTGCTTCTCCTGGTTGACCTCAT 

CAATGCC 

Sequence 2455 

CCGCGGTGGCGGCCGAGGTTTCAAGGACCAGCCTGGCCAACATGGTGAAACCCCATCTCT 

ACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATGTGCCTGTAATCCCAGCTACTCAG 

GAGTCTGAGGAGGAGAATCACTTGAACCTGGAGGCAGAGGTTGCAGTGAGTCGAGGTTGC 

GCTACTGCACTCCAGCCTGGACAACAGAGGGAGACTCTGTCTCAAAAAAAAAAAAACCTA 

CAGCTGTTCAAGGACCAGCTGACAGGTCMGTGTGGCCTTTTCTGGTCTTTGAACACATC 

ATAGAAAGTGACAAATGCTGCAAAGCCATGAAGAACATGAACTATAAACGGGTAGACTAA 

CTGCCCAGCTTAGACACTTATCTATGCCACAAAACAGCTGAATTTGTCACATTTATATAT 

TGCAATAT 

Sequence 2456 

AGGTCTTCGACCCACGCGTCCGGTGGCTTATGCCTGTAATCCCAGCACTTTGGGAGGCCG 
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AGGCAGGCGGATCACAAGGTCAAGAGATGGAGACCATCCTGGCCAACGTGGTGAAACCCC 

ATCTCTACTAAAAATACAAAAAGTAGCTGGGCGTGGTGGCACACGCCTGTAGTCCCGGCT 

ACTCGGGAGGCTAAGGCAGGAGAATCGCTTGAACCTGGGAGGCGGAGGTTGCAGTGAGCG 

GAGACCACGTCGCTGCACTCCAGCCTGGTTGACAGACCGAGACTTCTTTTTCAAAAAAAA 

AAAAAAAAAAAGTGCGACCTGCCTNGGGGGNCGGTTTAAAAAATNGTGGATTNCCCCGGG 

CTGNAGGAATTTTCGATNTTCAAAGCTTTATTNNATTACCGTNCGACCTTTNGGGGGGGG 

GCCCCGGTACCCCAACI I I 1 1 I 

Sequence 2457 

CTATACACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTCAAGCTTCGA 

CCCACGCGTGCGCMTTTTTAGGCCCACAAGGAGTCAAGCACCTCAAGGAGATCTTCAGT 

TTGAACTTGGTGTAGACACAGGGATACTGATGAATCAATATTCAAATTAGCTGTTACCTA 

CTTAAGAAAGAGAGGAGACCTTGGGGATTTCGAGGAAG 

Sequence 2458 

GCTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTAGAGTCGGGAT 

GCACAACTTCAACCACCGACTTATCAATGCAGCCGCCTGTGTATTGCAATTGGCCGTTAC 

CTTMGCACTGAGCCACCCGGGTTTAGTTCAGCCATTTCAAGAAGTATATTTAACGTCGG 

TAGTTCTGCTTTATTAAAATGCAGCAGAGGTACCTGCCCGGGCGGCC 

I I I 1 1 I I I I I I I I I I AATGAAATTGAGGATTTAACCAAAATGTGAGTTCTAATTAGGACA 

ATAMATTAATCCTAACTACATTTCCTTATAAATACAMAATGTGTTTTGGAAACACATC 

CCACCTGGGGCGCTAAGCGATGAGAGTAAGCAATTATAATGGCTGACTCCTCCAGCTCTT 

TGCTTTG AGTTG CCGTA AG AAAAAA 

Sequence 2459 

CCGCGGTGGCGGCCGCTACCTGCCCGGCGGCCGGCCGCCCGGGCAGGTCTTCGACCCACG 

CGTCCGAACTAATTGGCTTTTTAGAMCACCCCACAAAAGCTCAGAAATTGGCTTTAAAA 

AAACAACCACCAAAAAAAATCAATTGGCTAAAAAAAAAAAAAAAAAAAAAAGTGGGCCGC 

TCACCT 

Sequence 2460 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTCTTCGACC 

CACGCGTCCGGACAGCTCGTGTCCACAGGGGTCATGGGGTCTCTTGCTGCTAGGATTCTG 

GAGTTTCATGGTAAGAGCGGGCCACTCCCCACCTATTCAACTCACCCCTTCCCCAGGAGT 

TAATGGGGGCTAGTAAGGAATGCTAGTGCTTGGAAGCCCTGTGCAGGCTTCTGAGATTCC 

TGCCCCCTCAGCCCATGCTCTGCATCCTCCCTTCATCCACCCTCAATGTTTTCCTTCAAA 

AGATCTGCTCAGAGTGTGCCAGTCTTCCCAAATTCCTGGTCTCTCCATGAGAGATGTTCT 

TCCTGGCTGCTTCTAGTTGGCCATCTGGTCTTGAGTCCTNTGTACCTCGGCC 

Sequence 2461 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCACI I 1 1 II I I I I I ! I I I I I I I I I I I I 
I I I I I I GGGTTTCTCTTTGAAAGNTTATTGNTTTCTTTAAAAAAAAAAAAACCTATACCT 
TTTATATTTTACATTCACCTNTCANAATATTTAATGGNACCTGCCCG 
Sequence 2462 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGCAGGAT 
TACCATGGCAACAACACATCATCAGTAGGGTAAAACTAACCTGTCTCACGACGGNCTAAA 
CCCAGTAGAAACAAAGT 
Sequence 2463 

ACGACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTAAAGTTA 
TTTTAGTCATGAAATTTTATATGCAGAGAGAAAAAGTTACCGAGACAGAAAACAAATCTA 
AGTCGA 
Sequence 2464 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGNGGCGGCCGAGGTTTCAAGACCAGCCTGGC 
CAACATGGTGAAACCCCATCTCTACTAAAAATATAAAAATCAGCCGGGCATGGTGGCATG 
TGCCTGTAATCCCAGCTACTCAGGAGTCTGAGGAGGAGAATCACTTGAACCTGGAGGCAG 
AGGTTGCAGTGAGTCGAGGTTGCGCTACTGCACTCCAGCCTGGACAACAGAGGGAGACTC 
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TGTCTCAAAAAAAAAAAAACCTACAGCTGTTCAAGGACCAGCTGACAGGTCAAGTGTGGC 
CTTTTCTGGTCTTTGAACACATCATAGAAAGTGNCAAATGCTGCAAAGCCATNAAGAACA 
TGAACTATAAACGGGT 
Sequence 2465 

CCGCGGTGGCGGCCGAGGTACAAGATAGGTATGGATTCCAGNCTGGAGAAACCCNTAAAC 
CACTACACCCTGCCTCANAGNAGGGNAGAATTNTCAGTATGTATGTGGAGACAGGCTCGG 
ACGCGTGGGTCGAAGACCTGCCCGNN 
Sequence 2466 

CCGCGGTGGCGGCCGAGGTCTGATTACAAAAATTTAAAAAATATGCAACTTCAGGCATAA 
AGGGTTAAGGGGCAGATCTTCACAAACTCCACACAAGCCAGCAAGCACAGCACCTGCCCG 

Sequence 2467 

TACT TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I 1 I 1 I 11 I I I 1 1 1 I I 1 I 

TTTTMCCTTTCCTTATGAGCATGCCTGTGTTGGGTTGACAGNGAGGGTAATAATGGCTT 

GTTGGTTGATTGTAGCGGACGCGTGGGTNGAAGCTTGACCTNGGCCGCTCTAGAACTAGT 

Sequence 2468 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGCTTGTACAGTATGTGAAGAAGTTGCAC 

CTCATGAGCTGATCACTGATCAAGTTAAAAACACCAGACATACATTATACGTCTAATAAA 

TTTCCTTCAGGTCATGACCAGCATGAAAAAATNAGACACAGACACTCAGCAACCATTCTG 

TAAATGTGTACCT 

Sequence 2469 

CTACTTAGGGCGATTGGNNCCTCNCNGNGGGGGCGGCCGAGGGCCTGATCTAACTGCTGC 

CAAGACATGCCAAGGTCACCAGCTTCCCGTCCTGACCAACAGCCACCTCTCAGAAGGCTG 

GCTTAACTACTCANCCCCAGTCTTNTTAGCAGATGGCAAGGAGCTCTGGCCMGATTTTA 

GTCTAAACAGAACCCTAGGCTGCTGATGCAACATCAAGCAC 

Sequence 2470 

GGGATGTGGTTTAAGTGGATTACTGCCTANCTGAGCCAAAATGTTTGCTTGTACCTNnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnACTTNTTTCCTGGNTTAAMTGTTATTTCGGTTTAN 

MTTTTTCCCCANNCATACCMNNCCCGGGGGGNTTNAAAATTTNTNAAAANCCGGGGGG 

GGCCTTNATGGGGGNGGNGCNANNCNCCCNNATTAATTANNGNNNGNGGNTCNCNNTNGC 

CCCI I 1 1 I I I IAAAAMNGGGAAAAAACNNTTTTCCNCGNCCNANTTTGGNNTNTATTTN 

AAATTNGGCCCAACCCCCCCGGGGGGNGGGGGGCCNTTITGGNNTTTTTGGCCCCNNNTT 

CCCCNTTTTNTTTTTTTNTANAAANNANGGGGGGGGGGNGGG 

Sequence 2471 

TACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTGCGGCCGGCCGCCCGGN 

CAGGGTCACGCGTCCGGCCTGCTGCTTTGGACATGATGCTGACTGGTAGAAGCATTCGTG 

CAGACAGGGCAAAGAAAATGGGACTGGTTGACCAACTGGTGGAACCCCTGGGACCAGGAC 

TAAMCCTNCAGAGGAACGGACAATAGMTACCTAGAAGMGTTGCAATTACTTTTGCCA 

MGGACTAGCTGATMGAAGATCTTTTCCAAAGAGAAGACAAGGGTTTGGTGGGAAAAAT 

TGNANAGGGGTTNTTGCNNTNACTTTTTTCMTTTGTTTAGGGCMCAAGGTTTTCCNA^ 

AAAAAAAAAAAAAA 

Sequence 2472 

CTACTTAGGGCGAATTGGANCTCNCCNCCGGGCGGGCGGCCGAGNACTTACACNCNNGCN 

ATCGNTTTCNTGNCNGCAGNNGGATNCACTAGTTNTAGCTTCGGNCGCCACCGNANCCNN 

CNNGAGCATGCATGGMCACATATACCAAACATCTTTCTGATAACATTAMCATTTTTAA 

MGATGTTAAATGTTCTTTTCATTGTGGTGCTTCAGATTCCTGATTCTAGAACTTGTGTG 

TGTGGAACCTGTGTGCTMCTATTCTGTTGGAATTTACCAGCAAAGAATTATCTAAGAAT 

TTTCAAACTAAATGATGGGGGGAAGGAACTA 
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Sequence 2473 

CCGCGGTGGCGGCCGCAC 1 1 I I I I I I I I I I I I I I I I I I 1 1 1 I I I I I I GGGGTTTCTCTTT 
GMAGNTTATTGNTTTCTTTAAAAAAAAAAAAAACCTATACCTTTTATATTTTACATTCA 
CCTCTCANAATATTTAATGGTACCTGCCCG 
Sequence 2474 

CCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACGCGTCCGATCCTCTTCTGCTGTTCTG 

GAAAGAAACAGACCAGTGGTATGGAGTATAAGAAAACTGATGCACCTCAACCGGATGTGA 

AGGAAGAGGAAGAAGAGAAGGAAGAGGAAAAGGACAAGGGAGATGAGGAGGAGGAAGGAG 

MGAGAAACTTGMGAGAATCATGCTGTTGCATTTAGAACTTTCTGCTTTGCACAGGAAA 

G AGTC ACACMTTAATCAACATGTATATTTTCTCTATACATAGAGCTCTATTTCTCTACG 

GTTTTATAAAAGCCTTGGGTTCCAACCAGGCAGTAGATGTGCTTCTGAACCGCANGGAGC 

AAACACTGAAATAAAATAGTTTAT 

Sequence 2475 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCAC I I I I M I I 1 1 1 1 I ! I I I I NNGTATTTTN 

AGNANAGATGAGGTTTCACCATGTTGACCAGGCTGGTCTCGAACTCCTGACCTCAGGTGA 

T CCACC CACC TCAG CCTCCCAAAATGCTGGGATTACAGCGTGAGCCACCANGCCCGGCCA 

Ami l GTAACTTTTACAAAGATATTTMTTTAAAATTGATTTTAATAAAAGGTAGACAT 

CCAAAAACACAGGATGATGAATGCACTTCAATGTTAGGGGAATATC 

Sequence 2476 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTGTTACTTTAAACAAAAAA 

TGTTTACTTNTAAGGATATACGCACAAAGGGAACATAATATACAGATAAATGAGAAGTTT 

CGATTCTGCATCAAGCATTATTCAATCGGACGCGTGGGTCGAAGCTTGTAAACCTGCCCG 

GGCGGCCGCTCGAGGCCGCAC I \ I I I I 1 1 I I I I I 1 I I I 1 1 GAGACGGAGTCTCGCTCTGT 

CACCCAGGCTGGAGTGCAGNGGCGCAATCTNGGCTCACTGCAACCTCCACCTCCCAGGTT 

CAAGTGATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAG 

Sequence 2477 

CTAACTATAGGGCGANTGGANCTCNACCGCGNGGNGGCGGCCGNGGNCAAGGCTTCGACC 

CACGCGTCCGNAAAAATTANCCAGGTGTGGTGGCACACTCCTGTAATCCCAGCTACTCAG 

GAGGCTGAGGCAGGAGAATCGCTTGAATCCAGGAGGNGGAGGTTGCAGTAAGCCGAGAAC 

CTACTGCACTCTGGCCTGGGCGACAGAGCAAGACTGTCTTGGGAAAAAAAAAAAAAAAAA 

AGTGCGGCCGNCCGNCCGGGCAGGTCCTAGCTTGAGTCGACCCACGCGGTCC 

Sequence 2478 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAATTGCTAAAACAGCTCCAGG 

GNAAGNNATCTTATTTAGCATTAGCTCCCTCACMCTNTTTTCATTCATACTTNTATTGG 

CCAGNCTCATACCTGAAGTATTTTAAATGAGTTNACAATTATTNCACTTACCNTCAGAAA 

AAAAAAGGAGCAAAAACTCTTAATGACTGGTNACATGCACATTTGGTGTAGGAAATTATT 

ATGNGGTAAMTTTATATATTTCTATTTATTTTTATTMTTTATTNTTNACAC^ 

CAC 

Sequence 2479 

ACTATAGGGC G AATNG GAGCTCCCCGCGGTGGCGGCCGCAC I 1 1 111 11 111 1 1 1 11 1 11 
l I I I M 1 1 ITT GGTTTTCTCTTTGAAAGTTTATTGTTTTCTTTAAAAAAAAAAA 
TATACCTTTTATATTTTACATTCACCTNTNAAAATATTTAATGGTACCTGCCCG 
Sequence 2480 

ACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTCAAGCTTCGACCCACG 
CGTCCGTGGTGAACACAGAGAAGACAGGTCTTGTATATATTCCTCTGTATTCTGGGGAGC 
TTTGACCTTGGAGCTTTGTACCTGCCCG 
Sequence 2481 

CCGGGCAGGTCTACTCAAGTAGTCTTTACCCCCTACTCAAGTAGGGGGTAAAGTGTAGAA 
CAAGGAGTTTGATCTGTGTTCAACTGATTGTGAACCATCAATTGAGATAACTCACTACCT 
TCAAGGCCAGCCAGNTACATACTTTTTGNAAMGCCAAGAGTGGMNCAGGGTTGGTTTT 
TAATCCAA I I I I IGGGC 
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Sequence 2482 

AGGNCGACNTTNTCACAGGCAGNNAAACCGGCCAGNTNAAAACACTATGCTANCTCGCGG 

GGCCANTNTTAGGATGGGTGAGGCAGATGAANCCATTCTCCNANTGGCCAAGGCCGAGGG 

CATCAGCCTCAAAGMCNTTTGACNGGAGAGAATCACANACGTGNNNTATTCGTCATAAA 

NAAANAATGAAAAACCNACC 

Sequence 2483 

AGGTATTCGACCCACGCGTCCGTAGT1TTTATCTTTGACCAACCGAACATGACCAAAAAC 

CAAAAGTGCATTCAACCTTACCAAAAAAAAAAAAAAAAAAAAAAAAAAGTGCGGCCGGCC 

GCCCGGGCAGGTACCAGTAGCAACATATGAGTATTTCTCTAGATAAC 1 I 1 1 I I 1 I GACAA 

GGTCTCACTCTGTTGCCCAGGCTGGAGTGCAATGGTGCAATCTTGGCTCACTGCAGCCTT 

GACCTTCCCTAGCTCAGCTGAACCTCCCATCTCAGGACACCATTGCCTCCACTGCCCATC 

CTGCATCTGCCTGCCTACCCCAAAAGT 

Sequence 2484 

AGGTACATTTTCACTCCTGCTCTAAAACTTGCCTCAGTCTCTCACTGTGCCTTATGCCCC 

TCAGCTGAATTCTTTCTTCTGAGCAGGCAGGAATTGAGGTTGCTGCAGACGTGTATGGAT 

TTGCCACCAGTAACATACTTGGTGCCACATGACTAGGATATGTTCTCTAGTGCTAACATG 

TTCGTTTACAGTTCTTAGGACTCCCTGATAGAAAAAAACACAAAAAAAAACACAAAAAAA 

CCCAACCA ACCA AACAAACAAACAAAAAAAAACACAGGAGTTTCTCCCAAAAAGAAGTCT 

GCAGTGTCTTTTCTGTTTTCTCTGAAGGTATCCCAGGGTGTTAGAT 

Sequence 2485 

AATGTTGCAGGCTACTCTCTGCCGACCAGGCCGCCGCCCGTCCCTCGTGACTACAGCAGG 

TACCTGCCCGGANGNNNNNNANGNAACGNATGTTTCCACCTNCTTCTCCAACCTCTACCC 

CACCATTAGT NGTA TNTTNACTNTNAAAACAGTGGAACCACAGCCCTAAAGACCTGCTNA 

TNNAMGTNCTTTTGTCTTAATTGTATTTAAAAAAAAAAA 

Sequence 2486 

GGGGAAAACCCAGCTCCACCGCGGNGGCGGCCGCCCGGGCTT7TCAAGCNTCGACCNACC 
CGTCCGATA ATTTA TACTAAATTTAGTAAMNGGACTTNTTATTCAAAGCATCAATAATT 
AAAAGAATTATTTTAAAAAAAAAAAAAAAAAAAGGGACCT 
Sequence 2487 

AGGT AC CCTCACCTTGGTCATCTATC CTGAAATAAGGCTTAGTTAGTATTGGCCTGAATG 
TTTTGTGI I I I I I I I I IG I I I I I I I I I I I I I I I I I ACTGTTACTTTGAAAAATATGTATG 
TATACCTTATCATATCTGCCTATATCAC7TACTTTGGGGAGATACTCAGAGCTTTGTGGT 
TATCAGTATACTAAAAAAAAAAAAAAAAAAAGTGCGGCCACCTGCCCG 
Sequence 2488 

AGGTACGCGGGGGAGCCTGTCCAGCTGGCCCGGGCCCTGGCCTGGTTCTCAAGTGTTTCC 

TAGAC AGAGAG GCACCTGGGTCAGTATTAGTCTATTTATCAGAGGTGTAAATAATCTATG 

TATAAG ! ill! CTCCTTTTAGA7TATTTTGTATTTGTTTAAAAGMGTTTTGTCAAAATA 

CAAA MTATA AAGAMTGACTGAAAGTTGTTGACAGGGTTTTTMGAMTMNTTATTCT 

MTTGTTTTTGTTTGGTTTGTTTTTGCCTTGTAAACTAGCGCCAAGGAAC 

Sequence 2489 

CGGGCAGGTACGATGGGAGGACAGCTTTGTAGAAAGGACATTATNCAGCTNATAGCAAAC 

TTTGTGGATCCCAATCCGAGATTTNCCTGCTGAAAGACAAGAAGTNTCTNAMTAAAAGN 

GCTGTANCAGNATTTGTATACTCCAGAATAAGNTTCTGTGATTCTTANCTGCCAATGTGT 

TCAAGGCGTGATGACTNGGTNTCTGTTTCTNTGAACATNAATACTAGGGTCTGTATAAAT 

TTCMTGCATGCCACCAGCTNATCMCCCTTTTGGCTTTGATTTTTGNATGNNGNATTNT 

TATTCCCTANGANTTCCGGCCAAGTACCTTTGGNCGCCACCCGTGGTGGGAGCTCCAATT 

TCGCCCTTATMGTGAAGTCCGNMATTACGCGCCGCCTCANTTGGNNCGGTNAGTTTTT 

ACAACGCCNGANGACCTGGGGAAAAACCCTTGNCCGTTACCCCAACTT 

Sequence 2490 

GNCGGGCAGGACGCGGGACCAGAATGCAGTTCCAGCTTAGGAAGCCACAAACAAGCCACC 
CAGGAGGAACMAACACCGNAGCGTGGATTTTTCAAATTTCCCCCGGGAAAGTAAGTCTC 
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GCTTCCTGCCAAANAA 
Sequence 2491 

CGGGCAGGTNCCTTGCTAAGTTCAGACCTTCTCTTCCTTTNCTTTCCTTTCCTCTCCTGC 

CCATTTTCCTGTTCTTCTGTCCTTCAATACTTCTGNAGCTTCCCATTCATGTTCTCTTCT 

CCCACGCAGGCCNCATTGTGTGCAAAAACTGNGGNGGGGGCTGTGCTGNACTCNTNCCTG 

CCTCCTGCCTCCTGCGGCTGTTGGATTTGGGAATGACCTTTGGTGAGAGNCTCACTGCTC 

C AGGGT CTATTTTTTGGTCCAAAGGCTAGMCCTATANAAGTNGGGMTCACCTNTTTTT 

TCTTTTTCNNGGGTGAAMTAAMTGGGTTTTTTTCMNTTCNMCAAAAANAMAAA 

Sequence 2492 

AGGTACAGAGAACTGAATTTACACAATAAAGTGTTACCCTATACCAGTGATTCTAAAATT 

TTGGTCTGGGGMCCTTTCATGGGGTTCATGAGGTCAMACTATTTTCATGATAATGTTA 

TTTTGCCTGTTATATTCTCATTCTCGAGTATACAGTAGAGTTTTCCAGAGGCCACATGAT 

CCATGACATCACAACAAAATGAATAGAGACAGATGAAAATCTAGCTGGCTTCTGCTACAT 

GAATCAGACATTAAGGGGTCCAGAGACCATAAAGTGTGACAACCACCACTCTTCACAACC 

TATATATAATATCTCAAMTMTGTTAATTCTCTATCCTCAAAGTTTATTTCCTTATATC 

TACATTTTCTATGATCAACACACTCACTACAAAA 

Sequence 2493 

CCGGGCAGGTACCACAAAAACAGTTACATGGTAGAGTTCCAATCACACAGAAAGGAATCC 

ATTCAGCAAATTTCCMAGTTGCAGACTGTTTGGGCCCACCCAGAATTCCACCAGGACTG 

GACGGCCCCCACCAGGCTATACTGGATATGTGGACACAGAGCTTATGCTAAGCTGCCTGA 

TCAGGACAGGTAGCTGTATMTTGGCACCATTAAGCCATCTTTCTTCTTACTGCCCATAA 

AAACAGGTGAACTTCTAGGCTTCCCAGTCTATGCTTCCTGCAAAAAATGAAGCATAGCCA 

TAAGTGATTAGAAAAATAATGAATGGCCCTCTGAAAGAATCATACAATACTATAGACCCG 

CCACTTAGGCACAAGAAGGCTCATGAGGATATC 

Sequence 2494 

AGGTACCTTGGGATTGCAGGTGCCCACCCTTGCGCCTGGCTAAI \ I I I GT A I I I I lAGTA 

NAGACGGGGTTTTGTCATGTTGTCCAGGCTGGTCTCGAAGTCGAGCTCAGGTGATCCGCC 

CATCTCAGCCTTCCAAAATGCTGGGATTACAGGTGTGTGCCACCATGCCCAGCCAACACA 

CACATTTATTTMTGCMGTTTTACCTAGCACAGAGAAGCAGTGAGAGTCAGTTACTTAT 

ATATTGAATTGGACCAAGTAACTTGTGAAGAAGCTAGTAAATATATGGAGGCTAAAAAGC 

TGAGTGGTTCTGTTCTAACAAGGTCTGCACAGTAATCTCTTGGCCTCGACTTCTCATCCT 

TAAAAATAAGGAGATCGTCCTATGTTTC 

Sequence 2495 

NNGGGGGGGGGGGGCCCCCCCCCNGGGNNNAAAAAAAAAANAI TTTTU T TNTTNTTTTT 
TTTTTNAAAAAAAAAMAMMNNNGGGGGGGGGGGGGGGNGGGGGNGGGTCTTTTTTTN 
AAAAAAAAAAAAAAAAAAAAAACCCCCCCCCCTNNN i I I I I I I I I I I I I I I NTNTATNAA 
AANNGGGGGGGGGNNNNNNNNNNNNNNNANATNTNTTTTTNNNNNGGNGGGGGGGGGGGG 

Sequence 2496 

TGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGCGGGGGCGGAGGTCANGGG 

ACAAGATGGTGCCACCGGTGCAGGTCTCTCCGCTCATCAAGCT 

Sequence 2497 

GGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACAGCGTGGAAGGGTTTANGGCAG 

CAGTGT TCTG ATTCTTTGCGGGACGGCGAGCGCATTTGTGCTTTGCCCGCCGCGGCCTAG 

GAGGCCTTTTGAGGCCGCGTAGTCGGTGTTTTTGAACTGACTCTACAGCTTCTGGCAGGC 

CGTGCGGCGCCCTGACCCGGCCTCACCATGTTGGTGCTGTTTGAAACGTCTGTGGGTTAC 

GCCATCTTTAAGGTTCTAMTGAGAAGAAACTTCAAGAGGTTGATAGTTTATGGAAAGAA 

TTTGAAACTCCAGAGAAAGCAAAAAAAAAAAAAAAAAAANANAGTACCTGCCCG 

Sequence 2498 

AGCTCNCCGCGGTGGCGGCCGNGGACNAGGGNCTGANTGTCTGNGTNTCAGAATGGGATN 
AGTGNCCTTATAAATGAGGGAGCTNGNTTGTCCCTNCCACNACATGAGGTTACAGCAAAA 
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AGATGGCTGTCTATNAACCAGCAAGTTNGCCTTTGNCANACCCCAGATCTGCANACTACC 

TTGATNTTGGACTTTNCATCCTGCACAMTCTAAGANANAMTTACTGNTGTTTATCAAC 

CACTCNGTTNATGGTISTITNTTCGTTATAGCAGCCTGMCTMGACMCAGGTNGATCTTA 

AGGCATNGCTACNATNAAGTCTTTCCNTGCTCAGAATCTCC 

Sequence 2499 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGGAANTGGATGACTT 

TTCTTGTCCATATCACCATGGAAATCTGTGTGCTGGGCATGGAGAGTGNGAAGCANGCAG 

ATGCCAATGCTTCAGTGGCTGGGAAGGTGATCGATGCCAGTGCCCTTCAGCAGCNGCCCA 

GNACTGTGTCAATNCAAAGGGCCAAGTGTGCAGNGGAAGAGGCACTTGTGTGTGTGGAAG 

GTGTGAGTGCACCGATNCCAGGAGCAT 

Sequence 2500 

TGGAGCTCNCCGCGGTGGCGGCCGAGGTACTATATGTCTGAATGTCTGTGTCTCAGAATG 

GGATTAGTGACCTTATAAATGAGGGAGCTTGTTTGTCCCTTCCACTACATGAGGTTACAG 

CAAAAAGATGGCTGTCTATGAACCAGCAAGTAGGCCTTTGCCAGACCCCAAATCTGCAGA 

CTACCTTGATCTTGGACTTTCCATCCTGCACAMTCTAAGAMTAMTTACTGTTGTTTA 

TCAACCACTCAGTTTATGGTATTTTTGTTATAGCAGCCTGAACTAAGACAACAGGTAGAT 

CTTAAGGCATAGCTACAATTAAGTCTTTCCATGCTCAGAATCTCCATCTGCTGGCCAAGC 

ATAGTGGTTCGCACTTGTAATCCTGGCACTTTGGGAGGCCAAGGGCGGGTGGGTCACCTG 

AGGTCAGGAGTTTGAGACCAGCTTGGCTAACATGGCAAAACTCTGTCTCTACTAAAAATA 

CAAAAAATTAGCCGGGTGTGGCGGNGGGTGCCTGTAATCCCAGC 

Sequence 2501 

NTGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGCCGGGCCATTNTTCTTTCTTT 
1 1 I 1 1 1 J I NGCGGATGGGGACTTGTGAATTNTTCTAAAGGCGCTATTTAACATGGGAG 
Sequence 2502 

AGGG GCGAAATTGGGAGCCTCCACCGCGGTGGGCCGGGCCCGAGGTACTTTTC I I I I I I I 
TTTTGNTGGATGGGGGACTTGGNGMTTTTTTCTAAAAGGGGGCTATTTNAACATGGGGA 
AGGANAAGCGTTGTGCCGGTTTCCA 
Sequence 2503 

CCGGGCAATCTAAGAAGACATGATCACTAAATGTGATGTGGGATCCCAGATGGGATCCTG 

GACCAGGTAAAMCTAAAGTMTGTTTCAACTTCAGTAAATAATAATGTATCAATATTGG 

TCCATTAATTGTGGCAAATGTGCCACACTAATGCAAAGCGTTAGTAACAGGGAAAACTGG 

GAGCAGGGTATATGAGMCTTTTTGMCTGTTTTCACAATTCTCCTGTAAATCTAAAACT 

CTTCTGAAMTAGAGTTTATTCTTTAAAAGTGTCTGGAGGATGTGCACAAGGGTGTGGCA 

GCAGAGGGGCTACAGGTAAAAAATCATGACATCTGGAATATTTCCTTCAA1 Mil GCTCC 

ACACGGTGACTATCTTACCCTGCTCCC 

Sequence 2504 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGGTACCCGGGGGGTCTGAGG 

ACCCGAGGTCGTAGGTGGATCTTTTTCACGGAGCAAAGAGCAGGAGGACAGGGGATTGAT 

CTCCCAAGGGAGGTCCCCCCGATCCGAGTCACAGCACCAAATTTCATGCGCATCTGTGTA 

MGAGACCAACMGCAGGCTTTGTGTGAGCAGCMGGCTGTTTATTTCACCTGGGTGCAG 

GCGGGCTGAGTCTGAAAAGAGAGTCAGTGAAGGAAGATGGGGTGGGGCCGTTTTGTAAGA 

TTTGGGTAGGTAAAGGAAAATTACAGTCAAAGGGGGGTTGTTCTCTGGCAGGAGTGGGGG 

TCACMGGTGCTCAGTAGGGGAGCTTTTGAGCCAGGATGAGCCAGGAGAAGGAATTTCAC 

MGATMTGTCATCAGTTMGGCAAGMCAGGCCATTTTCATTT 

Sequence 2505 

CCGCGGTGGCGGCCGCCCGGGCAGGTACTAACAAATGCAGTAGCCAACAAGATTACCATG 

CMTCATTAAGGAGAACCAAAGTAAGAGAGCCACTCAAACCAGATTTTGAACGCTACTAA 

AATTAAAGTAGTTCTTTGATGMTATGAATGAGTAGGGAAAGGATTCTTTGTAATAGTGA 

TACCTCTGTGGTMGAGAAGGGTGGTATGTGAGTTTTAGTCTACAGATTATGGCAAATTC 

AGTGACAACAATCAAATGGTCTAAGATTGACAGTAGCACAGTTTTACTCTGTGAAGGTAA 

TGTTCAGGACAAATTTCAAGAAAACTAGAAAACCATTCTTTACAGCTGAAATCTTTCCCT 
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MCCATTGTTATTTCCACTTTTAAGTCCTCAAGAGATGAGAAAAGGGAGGTAAGGCTTCC 

TTATACATTTCCTGCACAATGAAACATTTTTCCTCCTCCAGGCAAAAGATTCAAGCAGAA 

CTGGCAAATATCTTATCTTGCTCTTCTTAATAATATAATGGTGGTAGGATATAAAGGTCT 

ATACAATTAACCTANAT 

Sequence 2506 

ACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGNCNGGCCAGGTACACNTTC 

TTAGACCTCAACCTCGAACTNTCAAAATTNAGGATGTCTCANCCCTACTGAGGCCGGGAG 

TCACCTNNACACTGANGGCCCTNGGTGNGAAGATGAACCTTNCACCGTCTNTANTGCATT 

CTGGAGTGCAAAAATAAAATCCACTNAAGAGTCACAAGGCCCGCTGTGCATAATNGGNTT 

CACTTTTACC I I I I I I I 

Sequence 2507 

CCGCGGTGGCGGCCGGCCGGGCAGGTACATGTAATGCTCCTGAACTGTATGCTTCGCACG 

GCTGACATGCTAAGNTTTGTTCTGTGTATTTTATGACTATTTTTTAAAAAGTAAA 

AAGAATTAGCTGGAAATACCAGCACAGGCAAACCCCTGGAGACAGAAAGCAGGTGAGTGG 

NTGCTGGGGCTTGAGCAGGAGGAAGGGCGAGGGACTGCANAATGGCCATGGGCTTTGCCT 

TCTAGCATGATGAGAATGTTCTGGAATTAGACAGTGGTAACGCTTGTTCAACACTGCCAG 

TGTA GTTM TGTCACTGAATTATACACTTTAAATGGCTAACATGACCAATTTTATGTTAT 

ATATATTTTACTACCACAAAAAAACTANCTGGCACCTAAAAACATTCCATTGAACAGGCC 

CCTTCAGATCTGTGTCTTTCCTGCATGCAAATTACNCCACAGAGCAAGCACCTATGGCAN 

CGTGGATCACAGGCTCTGTTTTANGATAGANAAAGGACACAAGGNGTCCCCC 

Sequence 2508 

ACTTAGGG CGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCTTTTCTTTACTTTT 

I I I I I I I I IGAGACGGAGNCTCACTCTGTCGCCAGGCTGGCGCGATCTTGGCTCACTGCA 

ACCTCTGCCTCACGGGTTCAAGCGATTTTCCTGCTTCAGCCTCCTGAGTAGCTGGGACTA 

CAGGCACACGCCACCATGCCCAGCTCATTTTTGTATTTTTAGTGGAGATGGGGTTTCACC 

ATGTTGGCCAGGATGGCCTCCATCTNTTGACCTTGTGATCCGCCCGACTCGGCCTNCCAA 

AATGCTGGGATTACAGGCGTGAGCCATCAAGTCTGGCGAGAGAGATTGTTTCTAGATGAG 

GGNGGGGGCCGGNTGTCCTTANCCCAAAGCTTGTGCCAGTCTCTATCAGAAATAAATGCC 

CCCAAAACCAAAAAAAAAAAAAAATAAGGTACCT 

Sequence 2509 

GCCG GGCAG GTACTACNTCAGCAATTTCTCCANNGNGCNGNNCGACAGCATATGGCACCA 

GCCCATTTTCAATTTGCTGAGCATCGGCCAAAGCCTGTATGCGAAAGCCAAGGAGCTGGA 

CAGAGTGAAGGAAATTCAGGAGCAGCTCTTCCATATCAAGAAGCTGTTGAAGACCTGTAG 

GTTTGCTAACAGNGCATTAAAGGAGTTCGAGCAGGTGCCGGGACACTTGACTGATGAGCT 

CCACCTGTTCTCCCTTGAGGACCTGGTCAGGATCAAGAAAGGGCTGCTGGCACCCTTACT 

CAAGGACATTCTGAAAGCTTCCCTTGCACATGTGGCTGGCTGTGAGCTGTGTCAAGGAAA 

GGGCTTTATTTGTG 

Sequence 2510 

CCGGGCAGGTACAATTGNTTTAGAAGATANTTTC 

ACTAAAGACAAATCATGGTAGGATTGGNTTGTTTATTATACTTGGCCTAACTATTTGTAT 
ACAATGCAGCAAGAATGATTATTTTTTACTTAGGCTTTTAAGTAGGCTCTGATGGAACTT 
TGTTCCATAGCAGGAATCTCAGATAAGACTTTGTAAAACCCGTAAAACTCANCCGAGCCA 
TGGATTTATGCCATTAMTACCCATGAGTTGGGTGAMTTTCCTNTCCTTTNGAGGGCCC 

AAGATAAACCTGGGGCGTCTGCACTTGNCAAAAAGTGATATTCTTTACTTACACAG 
Sequence 251 1 

AGGTACACNGNTNNAATCTTACTTCAACTTTNAANGGGCCCACNNAACCCTCTATATCCC 

CTTGTAAATTTAACTGCTAGTCCACAAGAGGAACAGCTCTTTNTACACTAGGAAAAAACC 

TNGTNGAGAGAGTANAAAATTTAACACCCATANTNNGCCTAAAAGCAGCCACCAATTAAG 

AAAGCGTTCANGCTNAACACCCACTACCTAAAANATCCCAAAAAAAANAAAANAAAGNAC 

GCGNGGANGTGTNMATTTTNANAGAAGAATTTNTNTTTAGTTCTTTGCACGAAGGNANA 

GATAAAGACACTTTTTCAAAA 
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Sequence 2512 

CCGGGCAGGTACGCGGGGACTGAAAATNGGACTGTTCAACTCACCTGGCAGCCACTCCCA 

GAGCTCCTGGAACTCTGGCCCAAGGTTCTCTGACTGACTCCTTCTTGGCTTACTGGCTGA 

AGACTGACGCTGCCTGATCGCCTCAGAAGCCCCGCAGACCATCATGGACGCCGAGCTTTA 

GGTAACTCACAGTGGAGGCCCGCCTGCACCCAGGTGAAATAAACAGCCTTGTTGCTCACA 

CAAAGCCTGTTTGGTGGTCTCTTCACACAGACGCGCATGAAAGGGAAGACATACAAAAAC 

AAGGCCTCTGAGGTAGGTACCT 

Sequence 251 3 

CCGCGGTGGCGGCCGAGGTACTTN I I I I I I 1 1 1 I I 1 1 I I I GGGTTTCCGGGGTCGTNTTG 
GGGGTCTCAATA I I I 1 1 GGCTCCTCTCCTTTACAGACACCTTGTATTTTCAAAG I ITTT C 
TTGGAGTCNAATTCCTGATCAGAAGTTTGAATGGTTGTTACTGCTGTGTTTTTCATGTCA 
ACAATTTCTT1TNTTTGTTNCCTG 

GCGCTTTCTTTCCTTTTCTCCTGTA I I I I I I GGTATCTTNTTGAACACAAN AATGCTCTG 

ACACAAGCTTGAATCAAAATMGCTTGTCMTAGCTTCCTTTCGCATTAGATTTAACTGC 

TCCACGNGATMTACTTAAGGMCACTTTTGTTTTTCCAAGAGCCCAGTTATCGAGACCA 

GCTTTTTCCAAAATGGTGGCACAAGTGTCAGGGCTCATGCGGG 

Sequence 2514 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC l I I 1 1 1 1 I I I 1 1 1 I I I 

TTTTTACTTTTTAAACTTCTTGTTAAAMTGMGACACAAACAAACACATTAGCCTAGGC 

TTACACAGGGTCAGGATTATCMGATGTCACTTAGGCGATTGAAATTTTTCAGCTCCATT 

ACCATCTTATGGGACCACCATCCTATAAGCAGTCTGTCATTGACCTAAACATCATTATTC 

AGCACA AGCGTATTTCAAATTTAGAGTTTTACTTTGATGTTCTTC I 1 1 I I I ICTTTTTCT 

TTTTTTGAGACGGAGTCTCACTNTGTCGCCCAGGCTGCAGNGCAGTGGTGCAATCTCGGC 

TCACTGCAACCTCCAGTTTGGGCCACAGAGCGAGACAGCGAGACTCGGTCTCAAAAAAAA 

AAAGAAAAGAAAAGTACCTGCC 

Sequence 251 5 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGAAATAAAC 

TCAAAGGATAGGTTCAACACCATAATAGAGAGGACAGACAAAAAGAATCTGTGAACTTGA 

AAATTGAACAATAGGAATGACCCAATTTGAATAACAGAGATAAATGAACAGAGCCTCAGG 

GAATTGTGGGAGTATAACAAAAGATCTAACACTGATGTCGTCAAAGTCCCAAAGGAGAGA 

GAGGGGATGGGGCTGAAAAAGCACCTGAAGAAGTAATGGCTGGAAGCTCCCCAAATTTGG 

CAAAACACATAAGCCTACAGATTCAAGAAGCTGGGTGAATCCCAAATGAGAGAAATCCCT 

TCAAATTCACACAAAGACACATTATAGTCAAATTTNTGAAAACTNNCANATAAAAAGAAN 

TNGNCAAATNNGGTACCTGCCCG 

Sequence 251 6 

CCGCGGTGGCGGCCGAGGTAC I I I I I MCI I I I ! I I I I I I I I I I I I I I I I I I GAGACGGA 

GTCTCACTCTTGTCACCCAGGCTGGAGTGCAGCGGTGTGATCTTGGCTCACTGCAAGCTC 

CACCTCCCAGGTTCAAGCCGTTCTCCTGCCTCAGCCTCCCAAGTAGCTGGGACTATAGGC 

GCCTGCCACCACGCACAGCTAA I I I I I I I GTATTTTTAGTAGANACAGGTTTTCACCACG 

TTGGCCAGACTGGTCTCAAAACTTNTAACCTCAGGNGATCCACCCGCCTCAAAGNGCTGG 

GATTACAGGCGTGAGCCACCGCACCCGGCCCAGACTCCTTAAAATGTGANAAGTAGCACT 

GAGGMTGTGATCAGATTATGGCCTTGATTGGCACATGGGGTCGCTTTCCACGGTTGGCC 

TTCTTGTTCTCCACGGCATCTTGTGCATMGCCATTTGCCATTTCAGGAGCTCAGCATGC 

ACATCCCGGGATTTCTGNGCTTGGGT 

Sequence 2517 

CCGCGGTGGCGGCCGAGGTACCTGTGACATCATAATTGCACCCTCCGACATGATATCTCT 

TCCAAATGCTTGATGAAAAAGGTGGGAGGATCACTTGAGCCAGGGAGTTGAAGGCTGCAG 

TGAGCCCTGTTTTTGCCACCACACTCCAGCTTGGGATTGATTCCTAAAGACTCATGTTAC 

GTGAGGAAGCAGCTCAGAAGAGGAAAGGAAAGGAGCCAGGCATGGCTCTTCCTCAGGGAC 

GCCTGACTTTCCGGGATGTGGCTATAGAATTCTCATTGGCAGAGTGGAAATGCCTGAACC 

CTTCACAGAGGGCTTTGTACCTGCCCG 
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Sequence 2518 

CCGCGGTGGCGGCCGAGGTACT 1 1 1 1 I I I I I I I I I I I I I I I I I I I I GGGGCTGGAAGGGG 

GGCAGAGTCTCACTCTGTTGTCTAGGCTGGAGNGCAGTGACAGAATCACCGCTCACTGCA 

ACCTCTGCCTTCCAGGTTCAAGTGATTCTNGCGCCTTAGCCTCCGGAGTAGCTGGGATTA 

CMGCTAATCCCANCTAGGCGTGCGCCACCACACTCGGCTAATTTTNGTTATTTTTATTA 

TAGTANAGACGGGGTTTCACCATGTTGGCCAGGCTTGTCTCAAACTCCAGACCTCAAGTG 

ATCCACCCGCCTTGGCCTCCCAMGTGCTGGGATTACAGGTGCGAGTATATGCTTTTAAA 

GGGTATCCAATCAAGCTAACTATGGTGATGGAATGTCTCCAGTTCCTCTGTAATACACGT 

ATCGTCCAGCCCGCGTACCTGCCCG 

Sequence 2519 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGAGA 

TTANGGGAGTGGTGATGACTCTTAACAAGCATGCTGCCTTCAAGCATTTGTTTAACAAAG 

CACACCTTGCACAGCCCTTAAGCCATTTAACCCTGAGTTGACACAGCACATGTCTCAGGG 

AGCACAGGGTTGGGGGTAGGGTTACAGATTAACAGCATCTTMGGCAGAAGMTTTTTCT 

TACAGAACAAA ATGGA GTCTCCTATGTCTACTTCTTTCTACACAGACACAGTAACAATCT 

GATCTCTCTTGCTTTTCCCCACAACCTCAGCCTCTCAGAGTGCTGGGATTACAGGCATGA 

GCCACCGCGCCCAGCCTCCCTTTTAMGCACTTTCTGAAGTCAAGCCTGATTCAGGATTG 

CAAGCCTGCAG AGAAC TATGGTGGTAAAGCCTAAAAAGATAGAATCCTTCCCACACCTGA 

GMGGCAGGTATTTTTAGAAGGAAACACCAGAATCACACTTAAGTCACTGCAAAGGCATT 

CATGTTTATACATTTCTGMCTGTCTTACTTGGAACTTTATGNGG 

Sequence 2520 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCATGGTGGCTCACGCCTGTAAT 

CCCTCCACTTTGGGAGGCCAAGGCAGGTGGATCACCTGAGGTTGGGAGTTCGAGACCAGC 

CTGACCAACATGGAGAAACCCCGTCTCTACTAAAAATACAAAATTAGCTAGGCNTGGCGG 

CACATGCCCGTAATCCCAGCTACTCCGGAGGCTGAGGCAGGAGAATCACTCGAACTCGGG 

AGGCANAGGTTGNGGTGAGCTNANATCACACCATTGCACTNCAGCCTGGCCAACAAGAGT 

GAAACTCCATCTCAAANAAAAAAAAAAAGGAAACATGAAGCCTTCCTTNAATGATGATAG 

TTTCTAAAGTGMTTATTTGAATCTCTTTGCATGTTTTGGCTCTGTTAATCTAACTCTTG 

TCTCTAAATAGATGCTGAAAGTGTAAATCTAGATGACTATAACATACACGTNATTGCAAG 

TGTATTCAA 

Sequence 2521 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACTTATTCI MIN I UM 



M M I IGGTT ATGA AAACTTAGGGACTAAAATTAATATAAAAATTGGCATAATGTTGGAT 

TGAATCTA CATTTTG GCAGAAGTTAAACATTCCCACATAATGTCAAAATTATACATCATG 

CAGTTCTG M M M I GTTTGTTTTATTTTGTTTTGTTTTTGAGTCTGGCTCTGTCACCCA 

GGCTGGAGTGCAGTGGCGTGATCTGCAACCTCTGCCCCCCGGGTTCAAGCGATTCTCCTG 

CCTCAGCCTCCCGAGTAGCTGAGATTACGGGTGCGCGCCACCACACTTGGCTMTTTTTG 

TATTATTAGTAGAGACGGGGTTTCAGCATGTTGGCTAGGCCGGTCTNTCCTGACCTCAGG 

GTGATCAGCCCACCTCGGCCTCACAAAGTGCTGGGATTACAGGCGTGAGCCACCTTGCCC 

AGCCCACATCATACAGTTTGAMTGAAACTTTGCCACAACCAGCCTTTGCTGTAGCACAC 

AC ATATATC ACTG AACCTGTTTG N AATAAAAGTTTTTTTT 

Sequence 2522 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I M M M I M M M M M M I M M M M M I 

TTTNGGTCATTTGNCCTTAGTTTTTAAAMCAAMGTGTTAACTAGAATNTAACAACAGA 

TCAMTCCAAACNCA GCAG TCCAGNGGAGAATCAMACTTTTCCGGCTTTATTNTNTGGG 

MAACCCCTGGTCTGTTTTCATTCCTATTGGNCCAGGCCACCATTNTATGATATGAAGGC 

CTAAATTAGGAAAGCTAGGNGAGCTGNGCAAATTNTGGGTGTCTGANCCNCCTGTTGTTT 

GGCGTGTGATGGGGGTGGAGGCCCNCACAGGGGTGTTCTCGCTAGNGTTCAAATCACAAA 

AACAGGGACCGTAACTAGGGGGAGGNGAGCAAAGCNCTCACCTTGGGCACAAAATTTAAG 

GNGTGCCAAAAAACCCAGTAACCAAAGATAAATACTNTTTTMTGCAACATTTTTAAAAA 

ATCCMTTAMTGTAMMGGTTTTTGATGGACAATGTTTCNAAAATTTTNAATAAAAGG 
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GTTCCCCCCGTAC 1 1 1 I I 1 I I 1 I 1 I i 
Sequence 2523 

ACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACACACATTACACTATCCC 

TTATCAAATATACTGAATGTAAGTAGGCTGGGTGCAGTGGCTCATGCCTGTAATTTCAGC 

AACTTGGGAGGCCAAGGTGGGTGGATCACCTGAGGTGGGGAGTTGGAGACCAGCCTGGCC 

AACATGGTGAAACCCCATCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGTGGCACAC 

GCCTGTAATCCCAGCTACTCGGGAGGCTGAGGCAGGAGAACTGCTTGAACTCGGGAGGTA 

GAGGTTGCAATGAGCTGAGACTGCACCACCGCACTCCACCCTGGGTGACAGAGCAAGACG 

ACGTCTCAAAAAAAAAAAAGTMTTAMGCTTGTTTCGATTACTTAGGCTCATCAATAGT 

AAGATCAACATGGTACCTGCCCG 

Sequence 2524 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGNCCGGGGTCCTTTGTCCTNAAAACGT 

TAGATCTGGGCTCTGATTCCTAACTGCCACTCGTGATCACAGAGGTGCAGAGAGTCATTA 

GCCATTGTTGCACTTCCATTGTTGCAAGCCCCTCACCTTCTGGTTGAAGTGAC7TTCAGG 

AGAATTAAMGAGCTGGNGTTTCCCTCCCCACTCCCATTCATTTTTCTCATTTTCCCCTT 

TTGGGAGCCAGACATAAACACAATTGCNTATATGAGAGAAATCAGAAAGTCATGGCACAG 

CCTAGGGGATGGTGCAGGCTCAGAAMGACCTCTGAGAAGACCTTAGATTTATACTTCAG 

ACTCATCTTAGGCAAAGAGGGCTTACAACAATCAAAAAAACAAAAATAATAAAAACAGTA 

ACAAAAACAGCAAAC CGTG NAGAAGAGGAAGAATCTGATTTCCAGAGTTAACACATTAGT 

AAATTCTMTGTCTGGTTTTCAACAAAMTATCACAAAGGGATTACCAGAGAAACAGGGA 

AAGTGTAACCCCATTCAAATTGAAACCTAAAGAGACTTAANAAACAANAGGGATTTAAAG 

GTCAAG AAAA I I I 1 1 IA 

Sequence 2525 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCGGAAGGTCACGGGCTT 

GAGTCGTAGCCAGGGATAGNATTATGGGTGAGAATAGACAGGATGGAGGACACGAGGCCA 

CAAAGCCCCACAATCCCAGGGCCACAGCGCCAGAGGCCTAGTTTGTCCAGAGGAATGAAG 

AGATCACAGGCATTTCTGACCACGTNTAGCANAAGTGGGGGATGACCTNTAAGGGCTCGA 

GCCAGGAGCNTGACTTGCAGCCCNAGTTTCAGAGCCAGTTGGGGCANACCTCCTCCTGGA 

GTCCTGGTCCCCCMGTCCCCCAGTTTTNATTTCCTCNTGGGACTCNTCTTCAAAANCNT 

TTAATNCNCCGNTTNNANGNAAANNATTTTGGTTCATTNATANGNCNANrrTTNTNAACCT 

TNCNGGTTAAATTNATNANTNNGGAAAACNAATTGNACCNNCCCNNGCNNNAGCNGACCC 

CCNGCCCGTAC NI I I 1 1 I I I I 1 1 1 1 I I 1 1 I 

Sequence 2526 

CCGGGCAGGTACGCGGGGCTTCTGTTGGGCGTTTCTGCTGAGAGGCGGGAGGCGCCGAGA 

GTCTGTGCGMGGTCCGTGGACAGACTGCTTTGCCTGTTGTTGCTCTTCGGAGGCGGCGA 

TCCCCGAAGGCGAGCTGAAATACGGCTGCAGGCTACAATTTGCAGCCGACCATTATGGAT 

GACAAGGAGCCGAAGAGGTGGCCCACCCTCAGGGACCGCTTGTGCTCGGATGGCTTCTTA 

TTTCCCCAATACCCCATTAAACCGTATCATCTGAAGGGGATCCACAGAGTTGTCTTCTAT 

CGTGATCTGGAGGAACTGAAGTTCGTTCTGCTCACCGCCTTATGACATCAATAAGAGAGA 

CAGGAAGGAAAGGACCGCCCTACATTTGGCCTGTGCCACTGGCCAACCGG 

Sequence 2527 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTCAAGAAGTAAGTGGAA 

CACCTTTCCCTGTCATAGTTATTTTCATCCAGACATCTGGTGGAAGCATCAGATTCCGAC 

AAACAAGGATTTATGTCAGGATCTCTCAGCCTCTGTGTTACCGAGGGCATTCTAACAGTC 

TTCTTACTCCGGCCTCCGCTTCGCCAGCACCCAGGCCGTCTCCAGCTCCAAACACCCCCG 

CGTACCTGCCCG 

Sequence 2528 

ACACTACTTAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I U I 1 I 1 I I 1 
I M I I I I I M I I INGGAANGGAGTCTCGCTCTGTCACCCAGGCTGGAGTCCAGNGGCACG 
ATCTCGACTCACTGCAACCTCCACCTCCCGGGTTCAAGCTATTCTCCTGCCTCAGCCTCC 
CGAGTAGCTGGGACTACAAGGGATCTGCCACCACNCCCGGCTAATTT7TGTAATTTTAGT 
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AGAGACGGACTTTCACCATATTGGCCAGGCTGGTCTTGAACTCCTGACCTTGNGATCCAC 

CCACCTTGGCCTCCCAAAGNGTTGGGATTACAGGCGTGAGCCACCGCGCCCGGCTTACTA 

CMGTCATAAGTTTCTTAAAGGCAATGTAAACTCCGAAAACTTAATGCACTCTTATATTG 

NT MTAC ATTAAAATCCACTGGCCTGGCTTACACTTTTGAATCAATCTTTGAGCCATGCA 

TGATTTTGTMCATTCACGTACCTGCCCCGGCGGGCCGNTTrfTAAAACTAGNGGATCCCC 

CGGCTGNAGGAATTCGAATTTAAAGCTTTATCGATCCCGCNNCCTCGANGGGGGGGNCCC 

G 

Sequence 2529 

CTATAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I I 1 M I I I 

I I I I I n I 1 1 GTTTTTATTTTTTTATTTATGTTGGCAGAAAAGCTGAGGCAGGGCTTCTG 

ACATAAGGTAAAAGTGTCTTGGAACATGTCCTGGGTCCAGGGTCTATAACCCCTTGNGGC 

CTATGGAACACCAAGCTCTGTGCCAAAGGGTGGAAGGCTGCCCTGCCNCACTACAATNTA 

AGCCCAGAGCATAAAACCCCTTGTAGCCTGNGGAATATATNCANACTCGCTGGCCCCTTG 

CTCNTTGCTCTNCCAAGATCACAAATNGATTGCATNTNNAATTAAAAAAACCTGNTCTCC 

CTTATCNNAAGGTAGCAAGANNCATAGCCAAACCCGTNCAGGNTACCGNTTGNNGCACCA 

ATTA CCTTTNTNTCN TNACGTCCTAACCTGGCNACCCTTACNTCANAACATCCTAATTAC 

CTGGI I I 1 1 I I I I I I GGATTCCCAATAAAAAGGGGGG 

Sequence 2530 

CCGCGGTGGCGGCCGCCCGGGCATGGTACGCGGGGTCCCTACAAATGCAACGTCTGCAAT 

AAAGTCTTCACCCAGCGCTGCTCTCTGGAGTCCCACCTGAAGAAAATCCATGGGGTGTAG 

CAGCAGTATGCCTATAAGCAGCGGCGGGACAAGCTCTACGTCTGCGAGGATTGCGGCTAC 

ACGGGCCCCACCCAGGAGGACCTGTACCT 

Sequence 2531 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGGAAGACCTTAT 

TCGATATCTTGAACCAGA GAGA TGGCAGTTGGACTTAGAAGATCTATATAGGCCAACTTG 

GCMCTTCTTGGCMGGCT1TTGTTTTTGGAAGAAAATCCAGAGTGGTGGATCTGAACCT 

TCTAACAGAGGAGGTAAGATTATACAGCTGCACACCTCGTAACTTCTGAGTGTCCATAAG 

GGAAGAACTAAAGAGAACCGATACCATTTTCTGGCCAGGTTGTCTCCTGGTTAAACGCTG 

TGGTGGGAACTGTGCCTGTTGTCTCCACAATTGCA 

Sequence 2532 , 

GGAGCTCNCCGCGGTGGCGGCCGNCCGGGCAGGTACCNCGGGGGTGCCCCGNAAGCAGTT 

GTTGTTGGTTGGGGGCCTTTGGGCCGGTGACGGANACTGCCCAGGTGTTGGNCACCATGT 

TCCTCTNCGCGGTCTTTTTTGCCMGAGCMGTNANATGNMCAMTAGTCCTTTTCGTG 

GAAAAGMMAMTNCGCTCCCTTTNAACGGTGGATTGAAMTGACTNTGNTTTATAAAG 

AGAANACTGAGGGCGGGGATACTGATTCANAAATNCTGTANCGTGTAATAAAAG 

Sequence 2533 

CCGGGCAGGTACAGCTGCATCAGCTGCTCGTAGGACATGTCCAGCAGCTGGTCGAGGTCC 

ACGCCGCGGTAGGTGAACTTGCGGAAGGTCCGCTTCTTCTTCTGCTCTACTTCTGCCACC 

CGCGTACCACGGCTATCCTTATAGCTTTTTAATTAMTGAAGCCAAGTGGGATTTGCATA 

MGTGAATGTTTACCATGMGATAMCTGTTCCTGACTTTATACTATTTTGMTTCATTC 

ATTTCATTGTGATCAGCTAGCTTATTCTTGTGTACCT 

Sequence 2534 

GACATGG CGCCCGCCGCC GCTTCCCCCCCGCGTA C I I I I I I I I I I I I I I I I U I M I I I I 

TTTCTTTCl I I I I I I I I I I I I I I M I M I H I M I M I I I I I I I I I I I I 1 1 I I I I [ 1 1 I I 

n I I n M 1 1 I I I I I I I I | | | | | | | 1 1 | | | | | | | | | | NGGGGGAAATTANAAANCNTTNN 

TTCNAAAAAAAAAAAAAAAAAAAAAAANTAAAAAANTNTAAANAAAAAAAAAAAGGGGGC 

CCCNGNNCNNTAAAAAAAAANNGGNNCCCCCGNGNGGGNGNAANANTTANAAAAATNTTN 

NNANCCCNNNGANAANGGGGGGGGGNNCNNANNNAAMTTTTTTTNNTTTNANGNGGGGA 

AAAANGCNCCCTNGGGNAAAAAAA 

Sequence 2535 

CCGCGGTGGCGGCCGAGGTACTTCTC I I I I I I I I 1 1 I I I I I I I I I ACATCCCATATGACA 
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TTGACTGATTAAAMGGCCTGATCGGTTTTGTTATCTTCCCACACTTCGTTTAATTTAAC 

TTGTGTTTTGTATCTCTTATAAACCATAAGATAGGCCAGGCGCAGTGCCTCATGCCTGTA 

ATTGCAGCACTTTGGGAGGCTGAGGCGGGTGAATCACCCAAGGTCAGGAGTTCAAGACCA 

GCCTGACAACATGGTGAAACACTGTTTTACTAAAAATACAAAAAATTACCCCGGCGGNGG 

GGGNATGCCCCCTGGAAAATCCCACTTCNTCGGGANGGNTGANACAGGGAAAAATCCNTT 

TGMCCCTNGGMGGCAAMGGTTTCNANNGNCTTANAAANCCNNCCATTTGTTTNCCCN 

AGNCTNGGGNGAAAAAGGGCAAAAATTTCCTTCNCCAAAAAAAAAAAAAAAAA 

Sequence 2536 

GACTCCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCCGCCCGGGCAGGACAAAACAAA 

GACTTTTCATCMCTCTTTTAGATATGCTAGAAGAGCTAAAGGAAACCATGGACAGAGAA 

CAAAAAAATTAGGAAAGCAATGCCTCATCCAATACAGAATATCAATAAAGAGATTGAAAT 

TGTAGAAAAGAACCAAATAGAAATTCTGGAGTTGAAAAGTATTATAACTAAAACTGAAAA 

TTCACTAGAGGTATTCAGCAGCAGACTGGAGAAGTCAGAAGAAAGAATCAACAGGCTTCA 

AGATAGGTCAATTAAGATTATACAGTCTGAGGAGCAGAAAGGAAAAAGAATGAAGAAAAA 

TGAACAGAGCATAAAAGACCTCTGGGACTCTATCAAGCA 

Sequence 2537 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAGTGTAAGCGAGCTACACCAAAGAATGGTG 

ATTAGCGCTCCCAGG GTGG GGAAAAATGAGGATTGTTTATATAGGCAAAATGGAGGTGCC 

AAACAGAATTATAACATTTTCGGMCTAAGGCTCATTTAMGATACAAATTTGATTGGCT 

ACTATTGATTACACTTGMGGGGATTTGGTTTMCTTTCTTTtGTNAAAGAAAGNACNAG 

NNNNTAAGGNNNTTTANTTTCCCANCCNTTTAAGTCTTAGGTTTTTGAAAAAAAAANAAT 

ANGGGGGGCCTGGGNTTAAAATTTTNACCCTNNTAAACCCCNAAAAGGNCCAGAAAAAAN 

AMTA AAMNGTTTGGTNGNTTTTTCCCACCGACCCCAGGCTTGGCAAGGCNCCCTAACT 

TTTTTCNTCTTTNGGCCNAAATNCNATTTGGGGMGGGC^^ 

TTTACCATTCCCCTNTTNTANTGGGNAAAANNTT 

Sequence 2538 

CCGCGGTGGCGGCCGCCCGGGCAGGTACATCACAACAGTMMTTTTGCTCTTTGCTTCT 

GGAGGAACACCCTACAACAGGTAATTAAAAAATTTAAAAATAATATGAAAGGGGAGCTAA 

ATACCTGAAAATTTAAACAAAATGAAGCAAAAAAAAGGGAAACAAATTCATTGCAAGAGA 

TGGAAATTACATTTTAAAGGCTGAAAATAATAAAGAAATTCACTAAACAACNGGCCCTAT 

TGNCCTGGCTCCTTCAGAAAAGATTACTGGAANGGAACCTGGGAAGGGGGTTTGNGGAAA 

TACAAAATTTTTGGGANTTNGGGGGGAAANTTANCACNGNGNGGAAANCCCCCCNGNACC 

CCCNI I I I I I I I I INGNGGNCCTTTTTTNAAAAAAAAANNGGGTTCCCCNGGGNCCCNAA 

AAAAAAAAANTAAAAAAA I I I I I I I I CCCCCCCCCCCTTTNAAAAANNGGGGGCCCCCCC 

CCNNNNGGNGGGGNATTTTTNNNTNNNN I 1 I I 1 I I I NCCCCCCCCC 

Sequence 2539 

AGCCTTGGAGCAGGCAATGATCTCCACCCTCCTGGGTGCGGGTGCAGACTCAGGCATCCC 

TGCACTTTTGGGGGCCTGCTTTTGCAGGCTCGGAAGTGCTTGCCTCTGCAGCTTGGCTTC 

TCCCTGCTATTGGCACCAGCTCTCAGATCAAAGCAGGGGGGTAAGCCTGGAAGCCATGAA 

CAGGGAGGGCCTGAAGGCTGAGGGCCAGGCTGCCAGTCCCACCTGTAGGAGTGGGAACTG 

GTGCCTTTTCCAGGCCAGTAAGTATGCACTTNCTCCCCCGAGGCCCGTAAAATGCCCCAG 

GCTCAGCCAGAACTGAGCAGAGGGTGGGACGACCAGCAACAGG 

Sequence 2540 

CCGCGGTGGCGGCCGAGGTACTAGTCTCAAAAGCTGGGGACTCTGAGCCTTACCTAGAGT 

CTCAGCAGGTGGACCATTAAGATTAACATTTCTAGTAGGTGAGTTCAATCACAAAAATAT 

TTC TTGT TCCATAGATTTTATTGTGGCCATGTCAGTGMCACCCACMGTTTTGCTCAGA 

ATATTTTAGGTGTAAGCTAAATCCCTAAATTGTTCAGAGTTCCCACAGCCCTGTAGCAGC 

AGAGCGAGMCTTTMCCAGACTTTTTCAATCCCAAAGCTAATCTGGAGGCCAACAGTGT 

TCAAAACCTTGGTGACTGAGGMCCATTTAGAGTTTTTTCAGGCTCAGGAATCACATGGG 

TCGTTGTTGGGCTTGGGGGTMGTTTCACAGGCGATGAAGCTTGACGTTGAGTCACTTGA 

CTTCTGGAGCCATAATTTATTTTCTCCCAGCAACCTCCT 
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Sequence 2541 

CCGCGGNGGCGGCCGAGGTACTCAATGGAGTTCCTTGGGGGGATCCAAAACCNNGGAGAG 

GAACACAGGCGCTGAGTATGGAGCAGNNTNGAGACCCACCCCACCTGCAGGCCTGACNAG 

AGCGCCNGGATNTCTGGTTCNGATGCCAATACTAGACACCCCAAGNAGNCNGCTCACAGN 

AACCNTTACCTNNTTTTNAACGCCCCTGGGCCTCGCNACGCGCACAGAAANGGGANCATA 

ANCNNCNGAAAGGNACCCACTGAAGCCCATTTCTCCGNNGAGNCANCNNGNAANNNNCAG 

CTNNCNCCCCGCTAAGGAAGANANCNAGNGNNGGGGANCCNGGCCCCNCAACCNCCNGNG 

GAACCCAGCAGNNCNGNNANANCCCCNGCNACCCNAGAAGAANNAAGACGGNNAAANACN 

GGGNGGAACNNNNAANAGANAACACACNNNNNAANAGAAAACGCNNAAAGGAAGNNAGNA 

CCNGNCCGGGCGGCCGNNCNAGAACAAGGGANCCCCCGGGCNGAGGAANNNGANACAAGC 

NAANNGAACCGNCGACCNNGAGGGGGGGNCCGGACCCAGNNNNGGNCCCNNAANGAGGGN 

AAANGNGCGCNNGGNGAANCANGGGCANAGCNGNNCCNGGGNGAAANGNAANCGNNANAA 

NNNNNANAAAAANNANCCGGAGCAAAANGGGAAAG 

Sequence 2542 

ACTATAGGNTTTTATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGGACATTATTTTT 

TCTATTCATCTAGCACNGAAGCCAAATTTGGGTCTAAAATCTTGTGATGGCAACCAGGAG 

CTTTTAAACTTCCTTCGTAGTGATCTCATTGAAGTTGCAACAAGGTTACAAAAAGGAGGA 

CTTGGATATGTGGAAGAAACATCAGAATTTGAAGCCCGNTTCATTTCATTAGAGAAATTG 

AAGGATTTTGGTGAGTGTGTGATTGCCCTTCAGGCCAGTGTCATAAAGAAATTTCTCCAA 

GGCTTCATGGCTCCCAAACAAAAAAAAAAAAAGAAAAAAAAAAAAAAAAGTACAAACTTA 

GAAGAAAATTGGAAGATAGAAACANGATAGAAAATGAAAATATTGNCAAGAGTTTCAGAT 

AGAAAATGAAAMCANGCTAAGACAAGTTTGGGAGAAGTTTAGAAAGATAGAAAAATNTA 

AANGCCCAAAATTGGGATAAATAGCNCTGAAGAAAAAAAAAAAAAAAAAAAAAA 

Sequence 2543 

CCGGGCAGGTACTAGANNACCTTCCTCGCCACTCTCTCCACATGAGAGAGTCAGCTGCCC 

TTTCTCCTGTGCCTCTGCAGGAAGAACTCTCTTGCATGGCACATCTCAGCTCCTCATTGA 

GGGATAGTTTTCTTTGATAAGAAACCTGGAGTCCATTTACTCTGACCTCTCTTTAAATCT 

ATATCCAGAGCCACTAGCCCAGGAAAMCTTGGGTGACCCGTAATTTCTCTTCTCCTGCT 

GTCC1TTTGCTCTTACGCCCCACCCCMCTCCCCTTAAATTTTACAGGCTTATGACAGTT 

TGTATGTGCTCAGCCAATGAGCAGAAAACCTGGAAAGAATTTCTGGACTTTAGCCCACCA 

GTTTGTCTGGTTGACTAACCTGCTGAGAGCTAMMTTGGCACCCATTGCXJCCGTGCXTT 

CAGGCAGTCTCCTGGGGCAGAGTATGCCACCATCCGAATATCAGGCACTGAGTGGGGATG 

TGGGTG ATGCTCAACATGACTGGCTAGAGCTTTGGGGGTGGGNNGGGGGTTAAACTACTA 

TTTTTTTGGCCATGANCTNTTTCCCCTTCC I I I I ! I I I 1 1 AATTAAATAAANGGNTCAAN 

TAAATANTTCAAGGCCTGCCTTTNAAAANAAA I I I 1 1 I I I I A 

Sequence 2544 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTATTTCA 

TATATTGTGTGAGCCCCACAAATGTCTATTTTAAAAAGAGTATAGTCCCTGGCCAGGCGC 

GGTGGCTCACGCCTGTAATCCCAGCAGTTTGGGAGGCCGAGGTGGGCGGATCACCTGAGG 

TCTGGAGTTCGAGACCAGCCTGACCAATATGGTGAAACCCCGTTTCTACTAAAAATACAA 

AATTAGCTGGGCATTGGGGGAGCATGCCTTGTAATCCCAGCTACTCGGGNGGGTTGNGGN 

GNGGAAMNANCTTTGAACCCCCCNANGGCCAAAGGTTTTTTATTTGGGGCCCAAAAAAA 

ACCNCCCTTTTGCCCCTTCANCCCNNGGGGNNAANAAAAAAGGNGGAAACCNTTCCNCTN 

CCCCCAAAAAAAAAAA I I I I I I I I AAAAAAAATTTTAAAAATTTCNAAATTTAAA 

Sequence 2545 

ACTTAG G GCGAA TTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I I I I 1 1 I CI I II I I I I 

mum AGTTTTTGAAATCCCGTCTCTACTAAAAATACAAAAAAAAAAAAAAAAAAAAAT 

TGGCCGGGCATGGNGGCGCATGCCTGTAATCCCAGCTACTCAGGAGGCTGAGGCAGGATA 

ATTGCTTGAACCCGGGAGGNGGAGGTTGCAGTGAGCCGANATCGTGCCACTGCACTCCAG 

CCTGGGCAACANAGTGAGACTTTNTNTCGGAAAAAAAAAAAAAAAGATCTGGNGGGTGAA 

MTMCCNGAATGAAAATAGCTTGAAAACTCACANGNGGGMGCTCCCTTTTACCCTTTT 




WO 01/070979 



PCTAJS01/09126 



TABLE 1 

415/467 



TNTTCC CTTGGCCNGATGGAATCCNNNCCCTTTTTTANAAAAAAAAGGGAAAGNCCNCTT 

TTTTTTNAAAAAANANTGGTTNTNMNATTTTNCNCNCCCCGGGGGGGGGGGNGTTATTA 

AAAAAAANGGGGNCCCCCCCCGGGGNGGGGAAAATNNNTTNAAATTTNTTTNCCCCCCCC 

CCCCCNGGGG 

Sequence 2546 

CCGCGGTGGCGGCCGCCCGGGCAGGTACCTCCATTGTTTCTAGGAAGTAACTAACTTGCT 

TTTGATT TTACA GGCTTGTAGGTGGMTGGGCTTGCCTTGTCTCAGATGAGACTTTGGAC 

TGTGGA CTTTT GAGTTAATGCTGAMTGAGTTGAGACTTTGGGGGACTATTGGGAAGGCA 

TGATTGGTTTTGAAATGTGAAGATAGGAGATTTGGGAGGGACCGGGGTGAAATGATATGG 

TTTCGTCCTGTGTCCCTCACCCAAGTCTCATGTTGCAAGTCCCACAATTCCCACGTATTG 

TTAGAGGTGATTGAATMTGGGGGTGGGTCTTTCCTGTGCTGTTCTTGGGATAGNGAATG 

GGTCTCATGAGATCTGATGGTTTTAAAMCGGGAGTTTNT^GCACMGCTNTCTCTTTT 

NCTGCTGCCATCCATGTAANATGTGACTGGC 

Sequence 2547 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGACCATGACACCA 

AGCATTCCTGTCC CCTCC CCCGCAATGGCAATCAAGTGTCACAGAAAAATAGCCTTTTAA 

MTTACCATGAGCCTTTTATTTTTATTTATrTTrACACCCAGG 

GATCACGGCTCACTGTAGCCTCAAACTCCTGGGCTCAAGCAATCCTCCTGCCTTGGCCTC 

CCAMGTTGGNTNAGGGTTGATGAGCCACTGTGCTCAGCCAACCATGAGGTTTTTAGAGA 

TGATCTTGTNTAMCCCTTCTCGCCTGTTTTGCAGAAAAGAAAATTGAGATCCCAGAGAA 

ATGAAGTAACTTGCCCAAGGTCATTCAGCAGGCAAGATAGAACTAGATCCCCAAATTGCA 

MCTAGCTGNCCAGAGTTCTTTCCTCAATGAGCAATTTAAAAGC 

Sequence 2548 

CCGCGGTGGCGGCCGAGGTACTAACTGTCTGGGATCGTAGTCGATTAAACAGAGCCACCT 

TTGTTCCTGA GGCM TGCATAAGTCAGCATTTTTCAATGACTGCTTCTTTTTGGMGGTT 

TGGAGATGACTTTTATCCGCTTGCTGAGGAACACACCAATGTCATCACTGTTGCCATAGA 

ACATCTTTACAGACMCATGAAGTGCTTTCGCTTGTCTGAGTCAGATATATACAATGGTT 

TGGCTGGTGCCATMGTTCTTTTCNTTCCAGGTTMGCTGGCTGCATTTTNTTGGTCACT 

ATTTCCTATNCCAATAAATGCACACNGGTGAGACTCTTGTTCAAAAACAACCATCGCGGT 

CCATTTGGTC I I 1 I I I I I ITC TTCCATTCNCTGGCCATAAGATATCCAAGGGGGGNGGGC 

AAAAAACTGGGAGTTATTTTGTATGCCAAAAAGACACAGCCAAGAGGACTTGNGGATCAT 

GCCCCC 

Sequence 2549 

ACTTAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 1 I 1 1 I i 1 I I 

I I I I 1 11 M I rGG CATAAMAGTGCACMGTTTGTCCTATGTGTTAAAACATCTTAATTT 

GGATGTATTTTTATCCAAAATGATGCATACTTGGATGCATTAAAAGCACACCAACAAAAA 

TNCNCNCTNTAACAATAANGGGNNNNTTTNNAANGGNGNCGGGGAGNGGNAACAANATTT 

NGGTANANGGGANNTTNTTNTTTTTTTCNAAANAAMGAGMTTTGCTT^ 

AAl I I I l I iCCCCNNCANATTTAATTTGAAAATNTGAAAGNTATNGGAAGGACANNCNCC 

MTTGGMCACNNTGTGCAAAAGTTCNANAACNAAANAAAAGATNNTTTTCTNGGTTNGG 

GCNCTNANNCCNTGNAATTCAANATTTGGGGNNGGNNGGNNCCCCN I I 1 1 1 I I I GGGGGG 

GNGGGGGNTTTCCCNCCCCCCCCCCCCCANNNNGGGGGGAAANCN ) I 1 1 I I II I AANNAA 

AAAAAA 

Sequence 2550 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGTGCTGAAATC 

CGGCATGTTCTTGTCACACTGGGTGAGAAGATGACAGAGGAAGAAGTAGAGATGCTGGTG 

GCAGGGCATGAGGACAGCAATGGTTGTATCAACTATGAAGCGTTTGTGAGGCATNTCCTG 

TCGG GGTG ACGGCCCCNTGGGNGGACNNCCCCCANNGGNCNTAAANGGGTNANAACCNTT 

CCNGTTTTCCCCAAANGCCCGNCCCCTTTTCCNTTGGGANAANTTTTTNTTTCT^ 

AAANGTTNCCCTAGGNTTTNTTGTCNCANNMNTTTCCCATTTTGTTTNTNTGGGANGAT 

GTTTNGCCCGTCANNTTCCACCMATAANANTTNCTTTTTGGNAAAAAAATNNTAAANTN 




WO 01/070979 



PCTYUS01/09126 



TABLE 1 

416/467 



NNNNCNTNNNNNNAAAANCCCCCCCCCCCCCCTATNAAAANTNAGGGCCCCCCCCGGNNN 
GNAAMM TMNANTAMTTTTTTNCCCCCCCCNCNCCNGGGGGGGNGGCCCCCCCCNCT 
I I I I I I I I I 1 INTAANNNANCCCCCCCCCCC 
Sequence 2551 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGTGTCTCACCACATCCTGGCTCCA 

GTGTGGATGCAGAGAGAGTGTGACAGAGGATCTGCCTGCGAACCACCTGGGATTAGTCAA 

GTCCCAAGGTGCCCAGAGTGGGACTAGTTCTTCACAGTGTGGCAGCTGCACTAATCTGTT 

TGTGAGGGAATA TCCAT TCCCTCACTCTACTCTCNTCATTATGGGMTTTrfTTTTTGTTN 

CAAAATAAANCCCTTTTGTATAGANAAAAAAAAAAAAAAAAAACCNCCCCCCCNNTTNAA 

ANAANGGGCCCCCCCCNGGGGGNGNANNNNTTNNA AI I I I 1 1 I I I NCCCCCCCCCCCNGG 

GGG 

Sequence 2552 

ACTTAGGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGGGATCTTTTTGGCAN 
GAGCCCCAAGTAAAGCTGTCAATCCTGATGAGGCTGTGGCCATTGGAGCTGCCATTCAGG 
GAGGTGTGTTGGCCGGCGATGTCACGGATGTGCTGCTCCTTGATGTCACTCCCCTGTCTC 
TGGGTATTGAMCTCTAGGAGGTGTTTTTNCCMCCTTr^ 

NNCCCAAAAAGNGCCGGGTTTTTTTNNTCCCCCNTNNNGGGNAACCCNGGGGGGAAAAAA 
A MNGTGGTT CNNGGGNAAAAAAAAMNTNTTTTAAAAAAAAAAANATNTTTTGGNNNNN 
Alliu m IT GG GANAG ANNNCCCCCCCCCCCCCCNGGGGGGGNNTNTNTNAAAGAANN 
TTTNNffTTTTNCATTTTTNNCCCCNNNNNGATAAATNCCCCCCCCGGNNGCCTATATAAA 
AAAAAAANCNCCCCCCCCGGGNNAANMNTTTTNAMTATTTTTTCCCCCCC 
Sequence 2553 

CTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTCI [TTTTTTT 

I 1 1 I I I I I I I I I I I I AAGACGGAGTCTCGCTCTGTCGCCCAGGCTGGAGTGCAGTGGCGG 

GATCTCGGCTCAGTTCAAGCTCCGCCTCCCAGGTTCACGCCATTCTCCTGCCTCAGCCTC 

CCGAGCAGCTGGGACTACAGGCTCCCATCACCACGCTCGGCTAAGTTTTTGTAATTTTAG 

TAGAGACAGGGTTTCACCATGTTAGCCAGGATGGTCTCGATCTCCTGACCTCGNGATCTG 

CCCGC CTCGG CCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGTGCCAGGCCAATA 

TGAAAGI I I I I MCTTATTGGCACAANAAGTTTTCATGAACCTAACAATATTTAATTAAC 

AAGTATTCTTCAATAACATTCCCTT 

Sequence 2554 

CACTACTTAG GGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTTT 

I I I I I I I I I I IGAGACGGAGTTTCACTCTTGTCACCCAGGCTGGAGTGCAATGGCGCAAT 

TAGGGTTCACTGCAACCTCTGCCTCCCGGGTTCAAGCAGTTCTCCTGCCTCAGCCTCCTG 

AGTAGCT GGGA TTACAGGCATCCACCACCGTGCCCAGCTMTTTTTGTATTTTTAGTAGA 

GACGGGGTTTTGCCATGTTGGACAGGTTGATCTCAAACTCCTGACCTCAGGTGATCTACC 

CTCCT CGGCC TCCCAGAGTGTTGGGATTACAGGCATGAGCCACCATGCCAGGCTGCTAAT 

TCTCCI I I I I AGGNGAGTTAGGGGAACTGAGCCTCAAAAAACTTAAACGATTTCTCAAAA 

AACACCTCAAGTGATMAGTGGCCACATTGNMAGGGAGTTTTTATCTTTTTATTGNNGG 

CCCAGGGGTCATTGGACAAAATCATGCTACCTNTTGGATTTTAAAATATTCAATTGGCAA 

A 

Sequence 2555 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACTCAGAGGC 

CGCCATCAACCCGCCAGATCAACCTGGAGCTCTACGCCTCCTACGTTTACCTGTCCATGT 

CTTACTACT1TGACCGCGATGATGTGGCTTTGMGMCTTTGCCAMTACTTTCTTCACC 

AATCTCATGAGGAGAGGGAACATGCTGAGAAACTGATGAAGCTGCAAAACCAACGNNGGN 

GGCCGAATCTTTCTTCAGGATATCAAGAAACCNAACTTGTNATTACTGGGAAAAGCGGGC 

TTMTCMGGGGGGTGGGCCTTTANNTTTTGGNAAAAAAAGGGNGAATTNATTTCTNTTT 

TTGGGAACAAGNAAAAACCTGGGCCAAAAAAAAA 

Sequence 2556 

CACTACTATAGGGCGAATTGGAGCTCACCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTT 
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i i m i i n 1 1 1 I II I I I I I INAGACAGGGTCTCCCTNTGTCACCCAGGCTGGAGTGCAGT 
GGCACAATCATGGCTCACGGCAACCTCGACCCCCTGGGTCCAAGTGATCCTTCCACCTCA 
GCATT CCACAAGATGATGGAACCACAGGCATGCACTACTATGCCTGGCTAATTCTTTTAT 
. TTTTTGNGCAGACAGAGGTCTCCCTATGTTGCCCAGGCTGGTNTTGAACCCCTGGGCTNA 
AGCTATCCTCCCGCCTTGGCCTCCCAMGNGTTGAATTANNNGGAATGAGCCACTTTTTG 
GGCCTNGGCCTCNANTTAATTTNAAMNGGGNGTTTGNTTTNAACCNCCGCCXCTTTAAA 
AAAAAGGGGCACCCCCCCGGGNGNGGGGNAATTTTNMNNNNAATTTTTTCCCCCCCCCC 
CCCCCGGGGGGGGGGGNGCCCCCCCCCCI I I I I I I I I I I 1 1 I AANNAAAAAAAAANCCCC 
CCCCCCCAAAAAAAAAAAA 
Sequence 2557 

ACTNCTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTTCTTGTCAAATT 

CCTTCTTGATATGAGCCGCAATGCCCCTCTCTATGTTGTATTTCTCCAGCGCCTGAGTAG 

CGCACTCCACCGAGTCCTGTTGCATCTCTTCCGACATGTCCGCATTTTTGATCACGGCCT 

TTCGGTCGCCCCGCGTACCTGCC 

Sequence 2558 

. cttagggcgaattggagctcnccgcggtggcggccgaggtactactgctgaggtctncag 
gacagaagncacctcctntggtagaacatncatccctggnccttntcagnccacagtttt 
gccagaaatatccacangaacaatgacaaggcttttYgcct 

Sequence 2559 

CCGGGCAGGTAC I I I I II I I II I I I I 1 1 1 II I II I I I I aggggacacggtctatccgcaa 

gctgnggattctgccattctcatcatagttttgaagccggggttctgaaatctcatgaca 

gatctccagcttgaactcacggaatggtttttctaggccctgctcataatacagctgcag 

gtaaccagngtctgtcagtttgacgaanatcggtccccagtgcctggaggacatgatgtt 

tttnttctcagggatcctcaacatnattggccacccgtcacgaggctgggaccgtgctga 

tccaggcgggtgagcatntagttcaatccaggctactgggtcatcatcaggtagagttgc 

actgccaaagtg atcag ggtcattmtttc^ 

agtgngactgggttttgctttgaatcatcnaaactggatggcatccttggnagatgacaa 
tgaagggaatnttntttggcttt 

Sequence 2560 

ccgcggtggcggccgcccggcaggtactgggccgggtgcagtggttcacgcctgtaatcc 

cagcactttgggaggccgaggcaggcagatcacgaggtcaggagatcgagaccatcctgg 

ctaacacggtgaaaccctgtctntactaaaattacaaaaanttanccgagcntggnnatg 

gnngcttgnatnccmnttnttcggaagcttagcnagaaaatngctnaaccttggangng 

gag ctttgan tgagccaaaatcccnccnttgnactcanccttgggggaaaaaaccngnaa 

NTCCI kill iaamaaaamaaaamamaamgggnccnntcggccctnttttaaannn 

ttggggacnccccccgncnnnnnggnaai I I I I I I I NAAACTTTTTTTNTNCCCCCCCCC 

CCCCCNGGG 

Sequence 2561 

NCTACTTAGGGCG AATTGG AGCTCCCCGCGGTGGCGGCCGAGGTAC I I H I I I I I 1 1 I II 

TTGATTGTTATTTAGTTTTTATTTCATAATCATAMCTTAACTCTGCAATCCAGCTAGGC 

ATGGGAGGGAACAAGGAAAACATGGAACCCAAAGGGAACTGCAGCGAGAGCACAAAGATT 

CTAGGNTCCTGCGACCAANGGGGGGGNGGGNGCCNNNNNNNCCCNAAAAAAAAAANNTTN 

GGGGGTT NAAAAAAANCCCCNTNN I I I I I 1 1 I I I I GNCCCNCCCCCANCCCNGGGGGGGG 

I I i I 1 1 i iGNGGGGGGGNTTTGCCNATNNCCGGNANCCCAAAGCGCTCATGCCNCANAAA 

NAANANTC CGNG CC I II I I I I ICNCTTGNGNCAAACCNCCANTGNGCTTTNGACNNCCAC 

CCNC TTCATTT TTGGGGGGGGGCAAMAAMMANCCCGGGCNCNCTTTNGGGGGGGGGN 

CCCNTTTTTTTNNAAAAAAAAAANANNCCCCCCCCC^ 

TTTTT 

Sequence 2562 

ACTTAGGGCGAATTG GAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I M I II II I I 
M I I 1 1 I I I I I I I J | IAGGGGACACGGTCTATCCGCAAGCTGGGGATTCTGCCATTCTCA 
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TCATA GTTTTGA AGCCGGGGTTCTGAAATCTCATGACAGATCTCCAGCTTGAACTCACGG 
AANGGTTTTTTTAGGCCNTGOTMTMNNNAGNTGCAGGN^ 

NNCAAAAANCGNNCCCAANGNCTGGAGGACANNNTNNTTTTNTTTTNAAGGATNCTTAAC 
ANTATTGGCCNCCCGTAACMGGGTGGGACCGNCCTTATCCAGGCGGGNGGGCTTTTTAG 
TTCAATCCCGGCTCTGGGGNANTANTNAGNAAAAGGTNCCTTCCAAAGGAANNGGGGCAA 
NMTTGGGNGGTNNTTNGGNGTTTTTCMCCCNNANAAMGGGGGCCGGGGGGGTTTTTT 
TTAAAANAAAAAAAAATGGGGGCNTNTTGGAAAAAANAAAAAAGGGAANN I M 1 1 1 1 1 
Sequence 2563 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCAGGTGGAGAGAGGGTGGAAAGTGA 

GCAGCGGGCTGGGCTGGAGCCGCACACGCTCTCCTCCCATGTTAAATAGCACCTTTAGAA 

AAATTCACAAGTCCCCATCCACAAAAAAAAAAAAAAAGAAAAAGTACTATTCTAGTCTTA 

AGAATAAAGACCNTGTTCAGCCAGGCGNGGGGGTTGNCNCNTGTANTCCNACNCrfTTNGG 

GNGNTTGGGGGGGCAAAATCCGGGGGTCAGGGCATTCGAGACCAGCANCCCGGGGTCAGG 

CTTTCGAGACCCAGCCTGGCCAACATGGTGAAACCCTGTCTNTACTAAAAATACAAAAAT 

TAGTTCGGCATGGGTGGTGCACACCTGGTAATCCTAGCTACTCAGGAAGGTGAGGCAGGA 

GAATCACTTGACCTAGGAGGCGGAGACTGCAGTGAGCCAAGATTATGCCAATGCACTCCA 

GTATGAGTGACAGAGCAAGACTTGTCTCCAAAAAAAAAAAAAAAAA 

Sequence 2564 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGGAGCTC 

GGATTTTAAGGCAGT AGCC TGCTGATGCTCCCAGCTGAATAAAGCCCTTCCTTCTACAAT 

TTGGTGTCTGAGGGGTTTTGTCTGCGGCTCGTCCTGCTACATTTCTTGGTTCCCTGACCA 

GGAAACGAGGTAACTGATGGACAGCCGAGGCAGCCCCTTAGGCGGTTAGGCCTCCCTTGN 

GGAGCATCCTTGAGGCGGTNTCCCCCCCCCCGGGGCCCCCTTNAAAAANANNCNCCCCGN 

AAAAAAANACCCCCGGGGGNNNCCCCNANNNAACGGNGGGGNCCCTTTTTTGGGGNNNAA 

NAAAANNCCNNCCCCCACCNCAAANAANNACTN I I I I I I I 1 1 1 CNCNTNNTNNAAAAAAA 

AAAAAAAAAAAAAAAAAAAAANNTTTCCNCGGGGGGGGGNTTTNNTAAAAAAAAAANNGN 

GCCCCCCCGNNNGGGNGAMNIsTITmNAAAATTTTTTNCCCC 

Sequence 2565 

GCGAATNGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTG I M 1 1 M I I H M I I I ( 1 1 I 
GCTGGTAGGAAATGCTTTAATAAAAATGCAATCTCTAAGGGGCCATGGCATCATTAAAAG 
AAAGGATGTCATGCCCAGATCCANAACTTGAAGGTGGCNGGCACCAGCAAGCACCATANT 
TTTGMTNGGCCTTNCTTTNCAGGGTCCTTANTTT^ 

CTTGAAAAAATGGANNAACNTGTTCNCCNCCCTTGGGTTTNTTAGTNGGGAGGGAACTTT 

TNGTCCANCTAAMAATNTTGNNCGNGGGCCCANTTTNGGGGGCNTTTNGTTTTTTNANN 

TTGGCCCCNNGGGGGGGCNGGGGGGGNNNCCCCTTTGTTNNNCNNNNTANGNNNAAAANT 

TTTTNNNTCNGGGGNAAAAAAAAAAAAA 

Sequence 2566 

NCTATAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGAAGAAGAGG 

AAGAAGAAGAAGCAGAAGGAATGGAAAGCCTGGAGAAAGAGGATGAAATGACGGATGAAG 

CAGTTGGAGACTCTGCTGAGAAGCCTCCTTCTACTTTTGCCTCACCTGAGACTGCTCCAG 

AAGTGGAGACCAGCAGAACTCCACCAGNTTGTGAACCCCCAACCCTTCAATCAAGAAAAG 

ACCTTTGATCAGGAGAAGACTTCTCGTTTCATTTTGGGGACACATTCAGGATTTCTTCAA 

AAGCAGTGAGTMCATTGAACCCCTCTTGCTTGCTATTCTGGTCTNTCTTCTNATANAAA 

TTGGMNMMCCCCCGGNCCCTTGGAGCCTTTTAAATGCCAMNGNNNGCNCTTTNAAA 

ANTTAGGGNANCCCCCCGGGCGGGGGGGNA I MINI I AAMNTTTTTTTCNCCCCCCCC 

CCCNTGGGGGGGGGGCCCCCCCCCNN I I I II I I I I M I NTTNNAAGNNNGNGNANANNCN 

CCCCNNGGGNAAAAAAAAAAA 

Sequence 2567 

CTATA GGGCGMTTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTNCTTTTTTT 

TTTTTNTNGNGGACTTGGGCTTCTCTGCCCATATTGCANTGTTGATGTTCCCAGAGTTCT 

ATCCTTACTCTAAANGATCTCCATTTTTGAGCTTATCCACACAGNGGACTGTGGNTTNCT 
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GGTCTTNGNNCTTNTGCCTAGGACNTNCTTTCCTAGGTANCCACATGCCTGCTCTCATTA 

TNTTCATGTCTTMCTCANAGGNCAACT7TGANTTMNGCTTTCTATNACTACATTNAAN 

ANNGTGNGACCTTTCNTGTCTNCCCANCTNGCCTGATNTTTNCTTNTCCATTGCACTTTN 

CTTNTGACTTCT 

Sequence 2568 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATT 

TCCGNTGAGTGACCCTTACAAGTCCTTCTTGATCCTGAACTGGGTTAGGTGCCGCTGNTG 

CTGCTCGTGTTGAATCTAGMCCGTANCCAGACATGGNACTGGAGGACGAGCAAAAGATG 

CTTACCNAATCCGGAGATCCTGANGAGGANGAAGAGGAAGAGGAGGAATTAGTGGATCCC 

CTAACAACAGNGAGAAAGCAATGCNAGCAGTTGNAGAAATGTGNAAAGGCCCGGNAGCGG 

TTAGAGeTCTGTGATGAGCGTGWCCTCTCGATCACATACAG 

Sequence 2569 

CCGCGGTGGCGGCCGAGGTACATTATGTTTTCGTTTTCATTCATCTCAATAGTTTCTTCT 

TiGGAGATTTGGGGGGAAAATGATAGACAGGAGGCAGGACTAGATTGCAGCCCTCACCCGG 

ACAGACAGCAGCTCACAGGGACTCGCATCATGAACTTTGGCTCCAGAACTACTGCAGGAA 

TATACCAGGAAAGCCAAGAGAATCCACAGACCCTCTGAAGGAAGCAGATTGCTCCTTCAG 

GACCCAAGAGACACCCTAAATACTGTGTTGGTATCGTTGGCAGAGAAACCTCAAGACGGT 

TCACATTACAGGACTCTGTGCAGACAACCCTCGGTACCTGCCC 

Sequence 2570 

AGGTACTACAAGAACATCGGTCTGGGCTTCAAGACACCCAAGGAGGTGCGGGGAACCTCA 

GAAGAAAGAAGGGGAACCTGGCGTTCCTGCACGTGTGCCCACGACGAGTTGCCCTGCCTG 

CATCT AAGT GGCTTCTGGGGCTGCTGGGAATTGTAGTTGCTTCCCTGAGGCCACGCCCCT 

GGCTNTTTTAAGGMCCGCCCGCCCAAGGCTCACTCCTTTATCTTTCCTATCCTTTCAGG 

CTATTGAGGGCACCTACATTGACAAGAAATGCCCCTTCACTGGTAATGTGTCCATTCGAG 

GGCGGATCCTCTCTGGTAAGTGCGGGAGTTACTGGTGTNTGGGGCCTGAAATACTGAAAG 

AAGGGTCTTGGGGCCCA 

Sequence 2571 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCNGGCCGGGTACCCCCATNACAG 

NNGNCAANCNGNCTGAGGAAGTGGCNNCTACCANGCAGGNGATCTTNCNGGAGCGGTTGG 

CNGCAGNGCCNGAGTTNCACCGGTNTTGNGCCCCTGTTCAAGTNCTNACCTGANCCCATG 

GCCCTCAC CGNGT CANAGACGGAGTNTGTNATNCGNCTGCACCAANCACACNTTCNNCAA 

CCACATGGTTTTTNANNTNGNCTGCACAAACNCACTNAATGACCNTACCTTGGANAATGT 

NNCAGTGCNNATGGAGCCCANTGNGGCCTATGAGGTGCTCTGTTACGTGCCTGNCCGGAG 

CCTGCCCTACNANCANCCCGGGACCTGCTACACACTGGTGGCACTGCCCAAAGAAG 

Sequence 2572 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGNNGNGCCCQGGACGCACAGNAAANN 

CNTGTNTTNGTNGGNNTNTCTATNAAAAAGGCAATCAAGAAAGATAATGTGAAAAAGANA 

GGAATTNATAGGTCGGGAANANATGAATGTCAAGACATTTGAAGAACTATAGTAAAATGA 

TCAACACTAAATATACTNAGAGAAANCTTTGTTAATATGCCAATGAGGTNGGCCTGATCT 

TTGAMTAGTGAATAGGAATNCMTGCATTTCCTCAGTGATCACTGATTANGAATGAGTT 

GGTNNGGATCCTTGGGA 

Sequence 2573 

TAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAAGGAAGCCTGGTTGAAGTC 

TTTTCCTTGAGAGGCCTGGATTTGTTCACATATTCAGATTTCMCTGGCTTTATAACAGG 

TTTCCT TATGT GCCTCATATTGGCTCCAGGTTCACTGGGTTTAAAAGGAGCCAAAGCACC 

ATAAGGTTTTGGCAAAGGAAGAGTGGCATCTGCTTCTGGGATGTAGGCATTCGGACGCTG 

CTCT GCAG TTGTGTAAACACCATTGGGCAGCTGGCGATTATCTGCTTCCAAGAACTCCTG 

AGACTTTTCAGCGATGGCCAAGGCGTGCATTTCTTCATCTCGAAGGACTTTCAACCATTC 

TTTCTCAATTTCCTTATTGAGTGGCAGACC I I I I I CTA 

Sequence 2574 

TGCCGGTGGCGCCCCGCAGTCCGCTTGTCCGTCCTTNCTCTCTGACTGNGGTACNNCGGG 
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GCCTGTCTCTGCAGAGGTCANGGGAGGCNGGGGCCCCAGCACACGTTCCNCAGTGGCAGC 

GNGGAGCGGCAGCTTACGGGTTCGCGGGAGCCCCCGACCCCCCAAGGGCTAGAGGAGCNC 

TCGGCGGACCAAAGAAAGCCCGCGAAGCGGNTGCGCGCCNACCAATAGGTGCGGGGCTCG 

GAGCCNNACAAACTTGCGCCGTACCTCCNTCCGCCCCCGCCCCGNCNCCNCCGCCGCCGC 

CGACTCCCCCTACTCCTGCTCCTGCCTTGGCTCCTCCGCCNAACGTNTCGCACTCCGAGA 

GCCGNAGNGGCAGCGGNNCCGTCCTGCCTGCAAAGAG 

Sequence 2575 

CCACGCGTCCGTTCCCAGNCAGTTCCATCCAAGGGAGAATTAAGTAGAGAAATTTGTCTG 

CAATCTCANNGGNNNGACAAATCTACGACACCAGGAGGAACAGGAATTAAGCCTTTCCTG 

GAACGCTTTGGAGAGCGTTGTCAAGAACATAGCAAAGAAAGTCCAGCTCGTAGCACACCC 

CACAGAACCCCCATTATTACTCCAAATACAAAGGCCATCCAAGAAAGATTATTCAAGCAA 

GACACATCTTCATCTACTACCCATTTAGCACAACAGCTCAAGCAGGAACGTCAAAAAGAA 

CTAGCATGTCTTCGTGGCCGATTTGACAAGGGCAATATATGGAGTGCAGAAAAAGGCGGA 

AACTCAAAAAGCAAACAACTAGAAACCAAACAGGAAACTCACTGTCAGAGCACTCCCCTC 

AAAAAACACCAA 

Sequence 2576 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCAGCCTCCTCTATTCCTCCCA 

ATCTCAAGGTTACTCTTAAATACTAGTAAATGCAAAAAGAACTTGTAAAGTGGCAAGGCA 

TGGCCTATCAAAAGTCAGCCCAAGGGCAGTTTTCAGCCCTGCCTCACCTGGGTCTAGTTC 

AGCTGACGGATGAGCTGATTGATGCGTTCACCCCGATAGCCAGGTGTGCCCATCTCCTTG 

AGGAAGCCCACTCTATTTTTGGTAGCATGACGGGCCACTGAGAGGTGGAAAGGGCACAAG 

AACCATGAGATCTCCTGGAAATGCTTCCCTGGGAAGGCAATTTCATGAATGAGGTCTTCC 

AAGCAAATGACGCCAAACTTCCCCAGGTGCTCCTCAATCACTGTACCTGCCCGGGCGGC 

Sequence 2577 

AGGTACAGAGTCTTTTGCTTCCTCCCACCCCTAGGGGGAAAMCTGCTTTGTGCTTTGGG 

AAGTTGTCTCTGAAACCCGGGGACAGAGGACGCAGGACAGACTAGGAGGGAGCCGGGAGC 

TGCCCGGCGGGTCATGGGAATAACGCCGCCGCATCGCCCGGTCGGCATCGTTTATGGTCG 

GAACTACGACGGTATCTGATCGTCTTCGAACCTCCGACTTTCGTTCTTGATTAATGAAAA 

CATTCTTGGCAMTGCTTTCGCTCTGGTCCCGTCTTGCCGCCGGTCCAAGGAATTTCACC 

TTCTAGCGGGCGCAATACGAAATGCCCCCCGGCCGCTCTAGAAACTAGTGGGATCCCCCG 

GGGCCTGCAGGGAMTTTCGGATATTCAAAGCCTTATTCGAATACCCGTCGACCTCGAGG 

G 

Sequence 2578 

CCGCGGTGGCGGCCGCGCGGGCAGGTACCCCGAGTCCAGCGGAGACAAAGGAGTTAGAAA 

GAGACAGAATAAGAGTTTAAAAGGCAGGTCCAGGGGACCGGAGCGTTGGAGGCTTGCTCA 

TGGCCCAGAGCTCTTTGGCTCCGCCCAATTTATTGATTTACAAGCTCTTTGTTCTTAGGG 

CAGATGGGAGGGGTAGGAAGGGATGAGGAAAAGGATTAATCAGTGAAGGAGAACTCGTGA 

GTCATTCAATAATATGTATAGTAGTGGTGGTTTCTGTGAATTTCCTTGAGTAAAGGCGTG 

TGTCTAAACTACTCAAGATCTTTAACTTATCGGNATTGAAATGGATGGG 

Sequence 2579 

AGGTACGCGGGATAGTGAAACCCCGTCTTTACTAA I I I 1 1 1 GTATGTTTGGTAGAGACAG 

GGTTTCACCGTGTTGGCCAATATGGTCTCGATCTCCTGGCTCATGCCTGTGGNCCCAGCA 

CTGTGGGAGGCTGANGCAGGAGGATCATNTTGAGGCCAAAGAGTTCGGGATCAGCCTGGG 

CAACATAGTGATACCCTATCTCTTAAAAMGAAGMGTTTTTAMTTTGAAATAATAANA 

GGTACCTTGCCCGGGCCGGCCGCCTCTAGAAACTAGTGGGATCNCCCCCGGGCTGCAAGG 

GAATTTCGATATCAAGCCTTATCGGATACCGTCCGACCTTCGGAGGG 

Sequence 2580 

CCGCGGTGGCGGCCGGGCAGGTACGCGGGATGATTGMTTTTGTTTCCGCCTAAAATAGT 
AATCTATAAGATATAMCTCGAGTTAGGGTTTACATTTTTTACTTATGAACACAGGGCAC 
TAGGG CCAC TTCAGTCTMTTrCCTGCTTTTTMTTACTTTAACACTCCACAGGAGGAGG 
ACTGGTTTTCCTCTGTGACTTCCTAATGTATGGCAAGCAGGACTTCTTCTAATCCACTAC 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

421/467 



CCTCTTCCCCTAGCTTAACTMGGCTTGCAGTAAMTTATAMTTTCCACTTTCTTTCCT 

ACATTCTCAAATGTAGGAMTGAGGACAMCAACTTCCTCTCTCCMTTTACAACACTAT 

CMCTATTTGTCCTTTATTGTGCATTTCAGACACAGGTGTTTTAATTGNTAATCATGTTT 

TTACACTGCAGTGGATGGCAGGTTTT 

Sequence 2581 

CCGGGCAGGTAC 1 1 1 I I I 1 1 I I I I I I GAGATGGMTTTTGCTCTTTTGCCCAGTCTGGAG 

TGCAGTGGCATGNNCTCAGCTCACTGCAACCTCCACCCACTGGGTTCAAGCGATTCTTNT 

GCCTTAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCCGTGCCTGGCCCTGTCT 

CTCTTMGAGTAGGTTCATTGTCTGTCTTAGAGTCACTTCTATTGCMCTCATTTTCm 

TTCCAGGGCACAGATCGACCAAGCTGCCGGTTCCCTATTCTGCAGGGACAGGGACTATTT 

CTAGCATACCTGCTTTCGTCCACCCAGGCAGGGGTTTGGGGGTGGGTCTNTTCTGTGCCT 

GCAGTCCCCCATTTTGACACTTTGGGTTG^ 

GGGAAATGAAGGCTTCCATTGGG 

Sequence 2582 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCTTTC I I I M I I II I 1 1 I I I I 

TTTGAGATGGAGTCTCGCTCTATCGCCCAGGCTGGAGTGCAGNGGCGCGATCTTGGCGCA 

CTGCAAGCTCTACCTCCTGGGTTCACACCATTCTCCTGCCTCAGCCTCCCAAGTAGCTGG 

GACTACAGGCGCCTGCCACCATGCCTGGCTAATTTTTAGTAGAGACGGGGTTTCGCAGTG 

TTAGCCAGGAAGGTCTCAATCTCCTGACCTCCTGATCCGCCCGCCTCGGCCTCCCAAAGT 

GCTGGGATTACAGGCGTGAGCCACCGCGCCCAGTTGTGCATTTCTGGTTTCTAAGAATCA 

MCCACTTGGG CTGTTTTTAGAAGTTACTTCCCATGTTATAAAGCTGAGGMGCTTTTTT 

1 1 I I 1 1 1 I I I I IGAGACAGAGTCTTTGTCNCCCAAGCTTGGAGTGCANTGGTGCAATCTT 

AGCTCCCGGGGTTCAAGCMTTTTTCTGCCTTAAGNCTTCTGAGTAAGCTAAAAATACAG 

GNGNGCCNCCACCCCGNTNGGCTT A i MM 

Sequence 2583 

AGGGCGAATT GGAG CTCCCCGCGGTGGCGGCCGCCCGGCCAGGTACGCGGGGGCACTCAG 

GGAGCTCAGATTTTGAGACAGTNGCTGGCCGATGCTCCCAGNTGAATAAAGCCCTTCCTT 

CTACNAAAAAAGAAANNGAAAAAAGAAN ACAGGATATCTGAAATTAAG ACNGCNGATGG A 

GNNGTTTCTNNAAATGACAGGGNCCAAGGNGNGACCACGGGACCAAGNGGCTGAACTGGN 

ATGAAGTTAAGAAGCAGNAANAAACATCCNATAATATGGTGATCAGNTCAACAGAATGAC 

ATATTNACCATCaTNCCCNAGGAGGNGATGACTGAGATTTCAAATT 

Sequence 2584 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCTGGATAGCCTCCAGGCC 
AGAAAGAGAGAGTAAGCGCGAGCACAGCTAAGGCCACGGAGCGAGACATCTNGGCCCGAA 
TGCTGTCAGCTTCAGGAATGCCCCCCCGCGTAC I I M M I 
Sequence 2585 

CGGCTGCGGCGAGCCGGTATCAGCCTNACTCAAAGGGCGGGTAATACCGGGTTATCCACA 

AGAATCAGGGGAATAACCGCAAGGAAAAGAAACATGTGGAGCCNAAAAGGCCAGCAAAAG 

GCCAGGGAAACCCGTTNAAAAAGGCCCGCGGTTGCTTGGGCGTTTTTTCCAATAGGGCTT 

NCGNCCCCC 

Sequence 2586 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGGATT 

TCCGCTGAGTGACCCTTACAAGTCCTTCTTGATCCTGAACTGGGTTAGGTGCCGCTGTTG 

CTGCTCGTGTTGGATCTAGAACCGTAGCCAGACATGGGACTGGAGGACGAGCAAAAGATG 

CTTACCGAATCCCGGAGATCCTGAGGAGGAGGAAGAGGAAGAGGAGGAATTAGTGGATCC 

CCTAACAACAGTGAGAGAGCAATGCGAGCAGTTGGAGAAATGTGTAA 

Sequence 2587 

CTATAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I M M M M I M I M M 
TTTTTTNGGGTATGAAAACTTAGGGACTAAAATTAATATAAAAATTGGCATAATGTTGGA 
TTGMTCTACATTTTGGCAGAAGTTAAACATTCCCACATAATGTCAAAATTATACATCAT 
GCAGTTCTG I MM I I GTTTGTTTTATTTTGTTTTGTTTTTGAGTCTGGCTCTGTCACCC 
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AGGCTGGAGTGCAGTGGCGTGATCTGCAACCTCTGCCCCCCGGGTTCAAGCGATTCTCCT 

GCCTNAGCCTCCCGAGTAGCTGAGATTACAGGTGCGCGCCACCACACTTGGCTMTTTTT 

GTATTATTAGTAGAGACGGGGTTTCGGCATGTTGGCTAGGCCGGTCTCTCCTGACCTCAG 

GGGGATCAGCCC 

Sequence 2588 

TAGGGCGAATTGGAGCTCCCCGCGGIMGGCGGNCGNNGNACANAATAANGCCTGTCACATA 

TTMGT^GTAATMCGCATTTATTATTACTTATCAGGGTATGATTTATGMTTGNGGAA 

C CTGN GATTATGGGAGAGTCTGGCTTCAATCAAGGGCTGAAATTCCATTTCCACTGACAT 

CTTTTNCTTCCCCATCCCCCGATTCTGTCCTGCAACAGGGTAACAAGAAGGGGCCCTTAG 

GCCGTTGGGACTTTGATACCCAGNAAGAATACAGCGAGTATATGAACAACAAAGAAGCTT 

TNCCCAANGNTGCATTNCAGTATGGTNTCAAAATGTCTGAAGGGCGGAAAACCA 

Sequence 2589 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAGTGTCGCGAACTCA 

AATCGGCATTTAGATAGATCCAGTGGTTTAAACGGCACGTTTTTGCCTATAAAAAAAGTG 

C AAAA AAGATGTGGTTTACAAGTTAAAGCTACAGMTCCCTTTTTGCTGTAATTGCACCA 

GTTTTAAAGCCTCTGGACAGAGCAGTATTTCGTTTAAAACTTTG 1 1 I I 1 CTTAAAAGCTT 

ACAGTGTTTGGCTMTTCTCCTCCCCTTTTTACAAGACGGGGGCCGGAGGGTGGACACTG 

GTGGCAGGTTAAGGGATACTGTCACTTTAAGAAGCCTGCAGATTGAAGTGTAAACATGGA 

GAMTTAGGGGCTGATTTTTTAAACTGTGTGAGATATTAACCAGCCGCCCTGTTATAAAA 

TCAGGGAMTCCAMCAGCGATTTACACCCGATTAACACCCCX^TTTATATATTTTTTACA 

AAAATACACTGGGGAAAATAATCNAACCGNTTTCATCTCTCTTGG C I Mil l GG I I I I I I 

AA 

Sequence 2590 

TTAGGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTT C I I I I I CI 1 1 1 ICTT 

I I I II I I I I AATGTGAGACAGGATCTCATTCTGTTGCCTAGGCTGGAGTGCNGNGGCGCA 

ATCTCGGCTCACTGCAACCTCTGCCTCCTGGGCTCAAGCAATTCTCCCACCTCAGCCTCC 

CAMTAGCTGGGATCACTGGCACAMCCACCATGCCCAGCTMTTTTGTATTTTTTGTAG 

AGACAGGGTTTCACCATGTTGCCTAGGCTGGTCTCAACCTCCTGGGCTCAAGCAATCCTC 

CTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGATGTGAGCCACCGCATCCAGCCCCACA 

CCCTCATTTATACCAATTACCTGCCCAGTAACTGTGGACTTTTGCTTCCTCACCCCTGCT 

CTGATCTGGAAiGGAGAGGGATTATGTTATAGCTTGTCAGCACAAGTCCCAAGTTCAATAT 

TTCTGCGGCAAAAACTTCTTCAAAAAATAAATGT AC 

Sequence 2591 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCCCTCTTCCCGCTGAAG 

GGAAATTCACAGAAGCTATAGTTGATGCAGAGCCGAAATATCTGATAGTTGNGCGACCTG 

CTCCACCTCCAAGTCAAAAGAAGTCATGTTCAGGTAAAACTCGTTCTCGCAAACCTCTGC 

AGCTGGTGGTTGGCACTCTGACACCGAGCTCGGTCTTCCTGTCATGGGGTTTCCTCATCA 

ACCCACACCATGACTGGACATTGCCAAGTCACTGTCCCAATGACAGATTTTATACAATTC 

GCTATCGAGAAMGGATAAAGAAMGMGTGGATTTTTCAAATCTGTCCAGCCACTGAAA 

CMTTGTGGAAMCCTAMGCCCMCACAGTTTATGAATTTGGAGTGAAAGACAATGTGG 

AAGGTGGAATTTGGAGTMGATTTTCAATCACAAGACTGTTGTTGGAAGTAAAAAAGTAA 

ATGGGAAAATCCA 

Sequence 2592 

CTATA GGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCTG 

ACTCCTTTTTCGGACTCAGTCTGCCTGCACCCATGTGATTAAAAAGCTTTATTGNTCACA 

CAAAGCCTGTTTGGTTATCTCTTCACACAGACATGCGTGACACTTGGTGCTGAAGACCCG 

GGATGGGGGACTCCTTCGGGAGACTGGTCCCCTGTCCTCACCCTCACTCCATGAGGAGAT 

CCACCTACAACCTCGGGTCCTCAGTCCAACCAGCCTAAGGAACATNTNACCAATTTCAAA 

TCAGATCTTGGCTTAGTGGCTGAAGACTGATGCTGCCCAATTACCTCGGAAGCCTCCTGG 

ACCATCACAGATACTTTGAGTAATCTCTTATAGTGGAAGGATGCAAAGTTGGAATAA 

Sequence 2593 
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CCGGGCANGGTACCATATGTTTCAAAGTAGCTGTTCATCCACAGATAAAGAGATCAAGAA 

ACTCTCATATACATACACTAGGAAATATTCTCCAGCCATCAAAATAATGAAGCAGTGTCA 

TTTAGAGCAACACAGATGAACCGCGGAGACCTGCCTCCTACTCCACCATCACATGGAACC 

CACCACTGCTTCTCCGAAGCTCGCTCTGACCACGCCGCTGCTGCTGCAGGGGCCTCGCAG 

GAAGTGCAGTCTCCCCCGCGTACCTN 

Sequence 2594 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTGGTATG1 I 1 1 1GTCTTG 

AAAAGAGGTTTTCTAGCTAGAAATTCGGGGCCCAGAAGGTCAAGGCTCCAGAGATCCCCT 

TCTCATCATCCTTCAGTTGTCTGAGGAGACAGAGGATCACATGTGTCACAGTGAGTGCTG 

ATTCCCCAGAATCTGCAGTCTTAAGCTCCCCTGCTGACGCACACAGCCAGTTCTCTGCTT 

CATATTCATGTGACCCACTGTGAAGTGACAGGCAGCTGCCAATTCTGTCATATAAGAAGC 

AGCTCTGCCCATGCMCTTTGAAAGTTCCTGCACTTCTCCATAGACTGCTCCTTCATGGA 

MTAAACACATCATCACACTGTGGGTGCCTAAGCTMGTTTGTTTGCAGTTTGATGATGC 

CCAGCTCCTTTTCAACTCAGCCTTAATTGGCACTTTA 

Sequence 2595 

CGAATTGGAGCTCCCCGCGGTGGCGGCCGCCGGGCAGGTAC i I I I 1 I I 1 1 1 I 1 1 I I I 1 GA 

GACAGAGTCTAGCTCTGCGCCCAGGCTGGAGTGCAGNGGCACCATCTCGGCTCACTGCAA 

CCTCCACCTCCTGGGTTCAAGCAAGTCTCCTGCCTCAGCCTCCTGAGTATCTGGGATTAC 

AGGCACATGCCACCACGCCCGGCTAATTTTTGTATTTTTAGNGGAGACGGGGTTTCACTT 

GTTGGTCAGGCTGGTCTCAATCTCCTGACCTCGGGATCCATCTGCCTGGTCCTCCCAAAG 

NGCTGGGATTACAAGGCGTGAGTCACCGTGCCCGGCGAGAAGCAACTCTTAAAATACTTT 

ATTCCTTCCTCTAGGACCCTTTAAATGGTGAAAATGGGCAGATGAGTAGCAATAAGTGGA 

CCTTTGTTACTCTTCTGAGTTAGAAAAATTCTAATTTAAGTACCTCGGCCGNTCTAAAAC 

TAAGTGGATCCCCCGGGC 

Sequence 2596 

ATTG GACCT NCACCGCGGTGGCCCGCCCGGGNANNAACCATAAACCGTNGCAGNCTCAGC 

ANATATTTTCCCTTCCTTAAGTCAGTAACTTTCACCTTTTCACTTACAGGMGCACTTTA 

CGGCTTCTCTTTAGCATATGCAMTTGCCAGCATTACCACTCTTGGACTTTGGGGCCACT 

GTTAAGTAAAGTAAGGGTTACTTGAACATAAGCACTGTAGGCCGGGGCATGGTGGCTCAC 

GCCTGTAATCCCAGCACTTGNGGTAGGCCAAGGTGGGTAGATCACCCGAGCTCAGGGAGT 

TCGAGGACCANCCTGGGCCAACATGGTGGAAAACCCCACCTTTACTAAAAAATACAAAAA 

TTAGCCCAGGTTTTGGTGGGCTGTATACCTGGNGATGCGCANCCTATTCAAGGAAGGGCT 

ANAGGGCAGGGAGAAATCGCTTGGAACCTGGGGAGGCCGGNANGTTGCAGNTGAGCCCAA 

GAATCCGTGGCCCACTGCACTCCAANCCTGGGGTGACAAGAAGCCGAGGACTCTGGNCTC 

AAAAAAAAAAAAA 

Sequence 2597 

CCGGGCAGGTACTTCTTGATTTCATCATACAAGACAAGCACAAAAGCACCACCCATGCCT 

CTGAGAACATNGGACCATGCACCCTTGAAAAAAGCTTTGCCTNCTTCATCACGAGCAATC 

TTCCGCCAGCAGTCAAGCCGTGCCTGTGTACAGATGGGGTTTTGCCATGTGGACCAGGCT 

GGTCTCGAACTCCTGGCCTCAAGTGATGCACCTGCCTCGGCCTNCCGAAGTGCTGGGATT 

ATAGGAGTGAGCCACCACGCCCGGCTACAGAGTTGGGTTTTAACAGAAGAGGACCTTGAA 

TGCTGMGCTTNACAGGGCGGCCAAACTAACTCGCTGATTTTTGCAAGACCACAGTGTAA 

AGGTCGGATGTCCACCTGAAGAAGGGGTGGGTGCAACTCTCTGGGTGCTGCACACACCAT 

GACCAAGCCTGGGCATGCAGCACCCCAGCTCCCATNCATTCACACTGGTTGCTTTGTGAG 

GTCATTTTGAGAGGGCTTTCANAGCCTTTTAATGAGAA 

Sequence 2598 

AGGTACCTTTGACCCATCATCTTGGGAGGTGGGGAGGACCNCGAGGGNCCAGGCAGGGTG 

TAGGGGAATGTATTAGNCCAANGAGATTTCCCTCTTCATCCGCAGCAGNGTATCTATTCT 

ATACCTGGCTATGGGAGAAGACCCCTTGCATGGGAGGGACCCCTTGCTATGGCCCCTTTA 

AGCCAGGCAGTGGGGATCTACCTGNGGCCCCGGCCTCCCTAAAGTCATTCACATTGAATG 

GGGGGATGAAGGNTCGGGACAGTGGCTCATAAGAGCCCGAGTATTGAGCCCTAANCTGTG 
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TNAAAAAAAANNAAANGTACCTTGCCCGGGCGGGCCGCTCTANAACTAAGNGGGATCCC ■ 
Sequence 2599 

GGCAGGTACGCGGGTGGACACMTTTCCACAACCTTTCAGATGCTGATGTAGAGCTATTG 

GGAAAGAACTTCCAAACTCAGGAAGTTTGCAGAGAGCAGACAGCTAGAGATAACTCGGGA 

CCCAGAGTTGGTCGACAGATGTTAGGATGTATCCTAGCTTTTAGCTATAAACCACTCAAA 

GATTCAGCCCCCAGATCCCACAGTCAGAACTGAATCTGCGTTGTTGGGGAAGCCAGCANT 

GGGCCTTGGGGGAAGGAAAGCCCATGGCTGTGGTTCAAGAAGAGGGGTGGGGCCTGGGCA 

AGCCCACTTTTCGGGGGAAMACTCCTTCCGCCCCAGGGTTTCTTCTTCTCTTAAGGGAG 

GAG ATTAT TTNTTCACCAAACCCCGCTGCCTTTCATGCCTGCCTTTCAAAAGCTAAGATC 

CATGTTTTG CCTTT GCTT MGAG NAATTACCTGCAAAATCANGCCCCAAGTGGCTTTGGG 

CGMTNCCATTTTAAAAGATTTTCTAGGCCCCTTCMGGGTTTTTGTAGAAGTGTNAAGC 

CCTTGGTGGGGCAAGGGNTTG 

Sequence 2600 

GGGCGAATTGG AGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGGACTTNN 1 1 1 1 1 )1 1111 

I I I I I 1 II I I 1 I I ACGGAGTGTNTGTGGACAGTGCCAACTTTATTGAAGACAGNTCCCTN 

TGAGACTACAGGCCCAGCCACTCANAAGGCTGAGGCAGGAGGATCGCATGAGCCCAGGAG 

TCTGAGGCCGCAGCGAGCCATGGTCACACCACCGCACCCCAGTCTGGCAACAGAGCAAGA 

CCCTGTCTCCAAAAAACAAACACAAAAACAAAACAAAACAGCCAAAAGCAGGCTAGAGGA 

TGCTAMTGAGACTTTCCTAAATCTCCATTTTGGCTGGGACCCAAAGCTGAGGACAAAAA 

TTGGCAMTATATCCATATGTTGCAGATTTGCTGCCNATCCATGAATGATTTGTTAAGGA 

Sequence 2601 

AGGTACATGGAGTTTTGATCAACGGACTTATNCCNCCTCTCCTAGGAACTAAAATGCCAG 

GGCCAGGCTGTGTATTTCTTTCCCAGGAMTTAGCTTTCCAGCCCCTTTATATATGGAAG 

GCTGGGTTAAAGTTATGGTTCCAGMGCTTCCAAATCCTGAAATAGATGTTTTAAAGATG 

CAACCTCAAACACCAATGCTGTTGTTAAAGAGCCTATGGGGAATTGCTGCTGCTCTAGAA 

ACTAGTGGATCCCCCGGGCCTGCAAGGGAATTCGGATATCAAGCTTAATCGATACCCGTC 

GACCTCGAGGNGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGGGGTTTAN 

ATTTGCCGCCGCCTTGGGCGGTAATTCATGGGTNCATAGGCTGTTTCCTGGTGGTGGAAA 

ATTGGTTTATCCCGCTTCACAAATTCCCACACAACATNACCGAAGCCCGGGGAGCCATAA 

AGTGGTAAAGCCTGG 

Sequence 2602 

TGGAGCTCCACCGNGGNGGCGGCCGAGGNACCAAAAAGACTCTCAAAAACCAATACTCCC 

CCGGGCMGGGAATAGCCAAGTTTGTTGCGGTTTNCAATGAATGACATCAGCCCTGTGTA 

GGTCTCAATCAAAATGGGTTCAGTTAACACCATCAGTTTCTTTCCTCTTCCAGATCCAGT 

TGAATTCTTGTGGGCATTCTGGATAGCTGGAACAAGCTTAGACATGAACCCAGACAACTT 

GCAMTTTCAAGGAATTTCTCACTGGTGNATTTCATAGGATGCTCAGTGAAAGTAGNATA 

AGGAACTTCAGTGGACCATGGGTTCCAGCGGGGACAAGAAAGAGACTGCTCCTCCGGGAC 

TCCCCCAAGTAGAATCTTAAGGGCCTTCTCCTTGGTCTTCTTGTCCAGGGGACATCCCAG 

GGAAAGGTGGAACTTTGCCCAGGC 

Sequence 2603 

CCGGGCA GGTAC GCGGGGGGCATATCAGATACGAGGGCACCATGTAGCACAGCTGGACCC 

CCTGGGGATTTTGGATGCTGATCTGGACTCCTCCGTGCCCGCTGACATTATCTCATCCAC 

AGACAAACTTGGGTTCTATGGCCTGGGATGAGTCTGACCTCGACAAGGTCTTCCACTTGC 

CCACCACCACTTTCATCGGGGGACAGGAATCAGCACTTCCTCTGCGGGAGATCATCCGTC 

GGCTGGAGATGGCNCTACTGCCAGCCATATTGGGGGTGGGAGTTCATGTTCATCAATGGA 

CCTGGAGCAAGTGGCCAGGTGGGATCCCGGCAGGAAGNTTTGNAGACCCCTGGGATCATG 

GCAGGTTCACAAATGGAGGGAGGAAACCGGACCCTGCCTGGGCCAGGCTTGTGCCGGNTC 

C ACCC AANGTTTNGAAGGAAGTTTNCTACAGCCGGAAAGTGGGNCCCTCTGANGAAGCCG 

GTTTTGGTTCTAAGAAGGCTGGCGGAGGGNACCCTTCGGCCCGCTTCTAAGAACTAGNTG 
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GGATCCCCCCCGGGCTGGCAGGGAAATTCGGATATCNAAGCCTTATCGATACCCGNCCGA 

ACCCTCGANGGGGGGG 

Sequence 2604 

CTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGCGTGGAGGGTTTAGGCAGCGTGTTCT 

GATTCTTTGCGGGACGGCGAGCGCATTTGTGCTTTGCCCGCCGCGGCCTAGGAGGCCTTT 

TGAGGCCGCGTAGTCGGTGTTTTTGAACTTACTCTACAGCTTCTGGCAGGCCGTGCGGCG 

CCCTGACCCGGCCTCACCATGTTGGTGCTGTTTGAAACGTCTGTGGGTTACGCCATCTTT 

AAGGTTCTAAATGAGMGAAACTTCMGAGGTTGATAGTTTATGGAAAGAATTTGAAACT 

CCAGAGAAAGCAAACAAMTAGTAAAGCTAAAACATTTTGAGAAATTTCAGGATACAGCA 

GAAGCATTAGCAGCATTCACAGCTCTGATGGAGGGCAAAATCAATAAGCAGCTGAAAAAA 

GTTCTGAAGAAAATAGTAAAAGAAGCCCATAAACCGCTGGCAGTAG 

Sequence 2605 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTGGGTCTGAAGGGGTCAC 

TCAAGCGCTCTGCTATCTCTGCAGCTAAAACGGGTGTCAGGTTTTCAGCTGCTACTAAAG 

ATAATGAGCATAAGCGTTCACCGACCAAGACTCCAGCCAGAAAGCCTGCACATGTGACCG 

TGTCTGGGGGCACCCCAAAAGGCGAGGCTGTGCTTGGGACACACAAATTAAAGACCATCA 

CGGGGAATTCTGCTGCTGTTATTACCCCATTCAAGTTGACAACTGAGGCAACGCAGACTC 

CAGTCTCCAATAAGAMCCAGNGTTTGATCTTAAAGCAAGTTTGTCTCGTCCCCTCAACT 

ATGAACCACACAAAGGAAAGCTAAAACCATGGGGGCAATCTAAAGAAAATAATTATCTAA 

ATCAACATGTCAACAGAATTAACTTNTACAAGAAAACTTACAAACAACCCCATCTTCGGG 

ACAAAGGAAGAGCAACGGAAGAAACGCGAGCAAGAACGAAAGGAGAAAGAAAGCAAAGGT 

TTTGGGAATGCGAAGGGGCC 

Sequence 2606 

CCGGGCAGGTAC I 11 I M I I II I 11 I I AACATCTTTG I 1 I 1 1 AAAATAGAATG ATAGAAC 

TTTGCCAGTCTTTAAGATCTTGGCTTMTTTMTGTATTAATCTGTTTGTGCAAACATAA 

TACCACCATTTAAAAATGTTAGGGGGATGAGTTGCAG I 1 I 1 1 ATAATAG A I 1 1 1 I I I I I A 

MGTTTGGTATTGTAAAACATTCACACCTCTGTCCCTCAAAATTGATAATTACCGTTTAA 

AGNGCAGNCATTTGNGGTNTAGAAATCTTGTTTTGTTTTGCTTCCATTATTGAGTTCCT 

CTAAGGGAAAATTGGAGGAGAAGGGGACTGGAATATGAAAGCCCCAAATTCATATAAAAA 

AGTTTGCGGTTNTMGGTTTGTATTAAAAAATAGNATATTATTAANGAAAAAAI I I I I I I 

TCACTTGATGTTTTGGTTAG 

Sequence 2607 

ATATGGGCGAATGGACTCCCCGCGGTGGCGGCCGGGCAGGACI I I I 1 1 1 1 I C I I I I I I I I 

I I I I I I I GGAAACTTCCTTTTCTTAGTTGTTGTATTCTTGAAGAGCCTGGGCCATGAAGA 

GCTTGCCTAAGTTTTGGGCAGTGAACTCCTTGGATGTTCTGGGCANGTAAGTGTTTATCT 

TGGGCCTGCAATGAAGCCAGCCGAAGTCCATCCCTTGGGCAAGGGCCGGGCTTGTTGGTG 

GGGTTTTTGGAAAGAAGTTTGGGACCAAGGGGTCCTCCTCAAGGGGGAAGCCGGGGGGGG 

GGTTCTTCCCTCGGGCTCTNGGGCCGCCTGCCATATTTCTTCCTTGCCTTGGGCCGAACC 

GCCTGCTTGAATACCTTGGMTTGTmCTTGCCTTGCTTGGTTGTTTTACCT 

TTTCCCTCMTGGTMTGTGGTTTGGTMTTTTAMCCTMTMTCCTTTGGGCTCAATT 

TTCMTCCCACCTCTTGTCCMTCAAGCMACTTGNTAANGATTCTTTCCCCCAAAAATN 

GAATTTGGCTGGCTTGGGCAMGGGCCTTGMGCMATTCNATTGGTTTTAATCTTGCAA 

ACMGCCTGAACTTTCTTTTTTCMGGGTTCCCCACCATTTMGGGAACATm 

AGGATGTGGAAATT 

Sequence 2608 

CGCGGTGGCGGCCGCCGGGCAGGTCAAACGATGTGTCCGTGATGACTGGTGGGGCTCATG 

TMTCCCCACCTMGAAAAGTAGAAAGTGCMCTTTATCTTTTAGGTTAATAAGTGCTGA 

GAGATGGAGGTTTTTCCTTCTCATTTTGATGGAGATGCCTAGAAAACCTCGCCTGACACT 

CTTTGTCCAACGCAGGATAGAGAACATAGCAACAGAAAGGGTGAGGCAAAAGGCATGGCT 

GGTN AAA AGGCA CTGCATGTTATTAAGGATGNGGGGCCTGGTCCTGTTCGNTTCACATGT 

TTTTCTT^r^TTTTATACAGAAATAGGAATCTACCAGACAGTAATAAATGCCACTTCTCAC 
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AGAAAGTCTGACAGGCTTCCACTGCCTCTGAGAGAACAACAACATGTTGGCTCCATAACA 

TAAAGAAAAACAATGCTGGGTGCGGTGCGGTGGCTCGTGCCTATGATCCCAGCACTTTGG 

GTGGCTGAGGCAGGAGGACTGCTTGAGCACANGAGTTTGAGACCAGCCTGGCCAACANGG 

CGAAACCTTCTCTNTACTAAAAATNCAAAAAAAAAAAAATTAGCTG 

Sequence 2609. 

TCACCGCGGTGGCGGGCGGCCGCCCGGGCAGGTACI I 1 I 1 1 I I I I 11 i I 1 M 1 11 I IGTG 

GAGACAAGGATCTTGCTCTGTCACCCAGACAGATGTGCAGTGGCATAATCATGGCTCACT 

GCAGCCTCAACCTCTGGGGCTCAAGCAATCCTCCTGCCTCAGCCTCCCAAGTAGCTGGGA 

CCACAGGTGTGCACCCCCACTCCTGGCTAA I I I II I IAI 1 1 I J I GTAAAGATGGGGNCTT 

GCTATGTTTNCCAAGGTTAGTCTNAAACTCCTGGGGTCAAGCGATCCTCCTGCTTAAACT 

NCCAAAGCACTTGCGATTCAGGNGTGAGCCACCAAGCCCAGCCCCAGCACCACTTTTAAC 

TAGCTGCATAATCTTGGGTAAACTACTAACCTTTCCAAAGTACCTCGGCCGnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNATTGCCCGCTTGGNGNAA 

TCATGGCAMAGCTGTTTCCGGGNGAAAATGNTTATCCCGTTACAATTNCCNAAANATAC 

NAACCCGGGAGCNTAAAGNGTAAAGCCNGGGGGCCCANTGGGGAGCTAACTACAATTAAT 

GNGTTGGGCTACTGGCCNNTTCAATGGGGAAAACCNGGNGCCACTTNINITANTGAATCGNC 

ACCCCCCGGGGGAGGGGGTGGTTTTGGCCTTTTCCTTCTNGTAATANTNNTNGCCGGGGT 

NG 

Sequence 2610 

GCTCCCCGCGGTGGCGGCCCGCCCGGGCAGGTACAATGAGATGGATACAATTAGTNAAAC 

CTTAAAATTAAAAAAGCTGTAGACAACAGAAGGNAAACTGGAAATCCATTTACAATTCAA 

AAGAACTCACTAATAACAAAATTAATGTTCATCAACTTCATTTATAATCCATTTNGGCCT 

ACATNGCNTAACTAAANTGACACATGTCCCCGGGGGCTGCAGGCGTNGCNCCAATCTTCG 

CTCTGAGGNGCTNTCTTAACCGNNANACCCTGGAAGCGGGCAAGTCTCTTGCTGTGTCGG 

GACCTTGCAGNCCCTGGCCCTTCCGCCACCATGGGAATACCTNGGGCCGTTCTAGAACTA 

GGTGGGATCCCCCGGGCTGNAGGGAATNCNIMTATCANGCTTATCGATNCCGTCGACCTTG 

AGGGGGGGGCCCG 

Sequence 261 1 

CCGCGGTGGCGGCCGAGGTACCAAGTGTGGGAAGATGTTGAGCAACTGGAACTCATGCGT 

GGCAGGTAGGGATGTAAAATGGCACMAGACTTTGTAAAATACTTTGGCAAATTNNNAAA 

AAGNAAACACATAGCTACCATACAACCCAGCCATCCCACTCCAGTATTTAACCCAGGTGA 

AATGAAAACTTATGTCCAMCAAAGACTTGTACTTTTCTATGATGACCCGGGCCGGCTTC 

TTTMCGNTTTTNGGTGCGAACCGCNGCCCATGTTGGCGGGTCCTTGGTAAAAGACCCCG 

CGTCCTGCCCGGGCGGCCGN 

Sequence 2612 

GGAGCTCCCCGCGGTGGCGGCCCGAGGTACANGAAAGTCTAGATGATCTTGTAGTGCCAG 

AAAGTAAGAAAGTAATAAAAAGATGACAGGTCTGTCACAATGATACAGAAGCCAATGTGA 

CAAAGCTCTCAATAGTTAMTCTCGAATTTGAGTAGTAAAAGTGACACAGTTTTAGATTA 

TAACCCAAAGAACTAAATAAATATCCATGAGCCCTATTGATATAAATGACAATTAAGGGT 

TTTTTGTTTTGG I I I I I I I GGTTGCTTTGGTT 

Sequence 2613 

GAAACCTGTCGTGCCCAGCTGCATTTANTGAATCGGCCAACGCGCCGGGGAAGAGGCGGT 

TTGCGTATTGGGGCGCTCTTTCCGCTTTCCTCGCTCACTGACTCGCTTGCGCTCGGGTCC 

GTTCGG 

Sequence 2614 

GGAGGCGGCCGCCGGGCAGGTACTTTGTTT C I I I II I I ICCTTTTTCI Mill I AAGACA 

TGGTCTTGCTCTGTTGCCCAGGCTGCAATGCAATGGTGCCACCTCGGCTCACTGCAANCC 

TCGACCTCCTGGGTTCAAGCMTTTTCCTGCCTCAGCTTCCAGGTAGCTGGGATTACAGG 

CGCCCGCCACCACACCAGGCCACATCTATGTATTTAGAGACAGAGTCTTGCTCCACCTGG 

GAGACAAAANCNGACCTCCGTTTAAAAAAAAAAAAAAAAGGGAAAGGAAAGGAGGAATTT 
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TTGAACCCAGTGCCCGTTCCTTGCAGATCTGCGAACAGTGGGCACCAGCCTCTAGTCTCA 

AGTGTGAGGTTTGACTCCAGGACGNTGAATTTA 

Sequence 2615 

CGCGGTGGCGGCCGCCGGGCAGGTACGCGGGTGCTACACGGAGGATGAAGCTAAGGAGC1 

GGCTGCGGAGGTGGAGGGTTCAAGACGGCCCCAATGAAGATGGGGAGATGTTCATGCGGC 

CAGGGAACTTGTCGACTATTTCCCAAAACCATACCCCAACAGTGGAGGGCTGNTNGAGNT 

GCCAACAACGGAGCNTNGCCCCCTGACCTCAGCTACATCGTGCGAGCTAGGCATGGTGGN 

GANGACTTCCNTTTTTTTCCTTGTTNACGGGCTNTTTGGAACCNCCCCCCG^ 

NTGGGGNAAGGGTNTTTATTAAAAANCCNTTTT 

CCNCCTTTNAACAAA I MUM I ANNTTTTTNGANGNGGGCCCCCCATCTTNCTNTGTTC 
CAANATAGCCAAGGGTTGTGTGCNCCTTCCTGCCGCTGGGCATTTTGANCCAGAACNCCA 
CCATAAAAAACGCNTGGGCTAAANATTGTTNATAATGATGGNTTTGGGGNNNCCCCTNTC 
TANMNATAMGGGGGACATGGGNTTTTTTCTGAAMGGNGGNATTTTTNTNCCCCCCAT 
TNAAAAAATGCNCCCCCCGGGGGAAAAAATTTTTN I M M M M I I CCCCCCCCCCCC 
Sequence 2616 

CCCGCGGTGGCGGCCGGGTACATTGCTA I I I I I CCACCTATGTCCACACCTTCTCATGAA 

CTTTCAAAGTCAGAGGCAGNTACCAGTGCCATTAGAAATACAGATTCAACAACGTNGGAT 

CAGCACCTAGGMTCAGGAGTTTGGGCAGAACTGGGGACTTAACAACTGTTCCTATCACC 

CCACTGACAACCACGTGGACCAGTGTGATTGAACACTCAACACAAGCACAGGACACCCTT 

TNTGGAACCGATGAGTCCTACTACGTGACACAAGTCACTCAAAGATCAAACATCTATCCA 

GGCCTNAGCATTCCCTTTCCATCTTACTGGAAGTCTACCTGNGCTCGGGACACANGGGAG 

MGCTCTNTTGAGGCACCACTTTTTGGGAAACCATCTACAGACACACTGTCCAGAGAGGA 

TTGAGACTGGCCCAACAAACATTCAATCCACTCCACCCAT 

Sequence 2617 

TATTNAGAAGTGGTTCCAGGTCTTAGCACAAAACATGGATGCACTTTAGGCCANNCATGG 

TGGCTCACACCTGTAATCCCAGCACTCTAGGGGGCCAAGGTGGAAGGATCCCTTCAGCTC 

AAGAGTTTGAGACCAGCCTGAGCAACATAGGGAGACCCTATCTCTACCAAAAAAAAAAAA 

AAAAAAAAAAAAGTACCTGCCCGGGCGGCnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnCTCGAGnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnTAATTGCCCCCTTGGGCGTAATCANTGnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnTCCGNTCACAATTNCACACAACATACNNGCCCGGG 

AGCATAAAAGTGTAAAAGCCTGGGGTGCCTAATGAGNGnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnGAAACCTGTCGNGCCCAGCTGCATTAAATGA 

ATCGGCCAACCCCCCGGGA 

Sequence 2618 

CCCGCGGTGGCGGCCGGGTCTTTAGGATGCTGGTAAACTATGGAATTTCAGACGTGGACA 

TCTGGCAGACATTTTCTTTAAAATGAATAAAGTGAGCTTATCACCTCAAGGAAAACAACT 

NAAGCCATCTGTTGCCAATGATMAMTATGAGATTTAAAAAAATGTTGAAGNTTCAGGC 

TAAMTTAAAATTTTGAAAMCTTATATTGGCCACCACANGTTTGACACCTTACCAATTA 

TTAATGGGANGGTTTTNTrTMGAGGNCMCAGTGATNTTAACAAATC 

TNTTGGNTTATTGGGAGGTTTANCNATTGGGAAGATCTGTAAGACTCAAGNGAACCATGG 

NTNTCCAAATGACTAAGACATGATGGNTACAAAATCACCCCTGGGTAAAAGATCCATTCA 

AAGTGCAGGACAGACCAATGGAATTTAGTGTAACAGAGGTNTGAAAGGTTCATGAAGGCT 

GGGCATGGTGGCTCACGCCTGTNATCCTAGCACCTGGAGGGCCCG 

Sequence 2619 

CTCCCGCGGTGGCGGCCGGGTAC I M M M CTTTTCTTTTC I M M M M M M M GAGA 

CGGAATCTTGCTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTCGGCTCACTGTAAG 

CTCCACCTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTTGAGTGGCTGGGACTACA 

GGTGCCCACCACCACGCCTGGCTACI M M M IGTAI I I I I ATTAGAGACGGGGTTTCAC 

TGNG TTAGC CNNGAATAGGNTCGATCTCCTGCCTTGNGATCTGCCCCCCTTGGCCTTCGA 

GCCCTnTTGACCTNAAGGACCAGCACTGGAATAATGlTTGGGANNCCCNTGCTTTTNAG 
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GCAGGCTTAGGGAGCAGGAAAGCATACAGGTGTAGCAGCCTTTCCAGCTGATCCCCATGC 
CCTGCTGCACCTGGAGGGCTGGAACAAGCTATTCTCATATTGGGGAAAAGGGCTGA 
Sequence 2620 

CTCCCGCGGTGGCGGCCGGGTACTTNGTTCCTTTTCTTTN CI 1 I I I I I I I 1 1 I 1 1 I GAGA 

CGGAATCTTGCTCTGTCACCCAGGCTGGAGTGCAGNGGCGCAATCTCGGCTCACTGTNNG 

CTCCACNTCCCGGGTTCACGCCATTCTCCTGCCTCAGCCTCTNGAGTGGCTGGGACTACA 

GGTGCCCACCACCACGCCTGGCTACI I I 1 1 I I IGTAI I I I I ATTAGAGACGGGGTTTCAC 

TGNGTTAGNCCCGGANAGANTNGATCNCCTGACCCTGGNAACTGGCCCCCTTGGCCCTCT 

AACCCTTTTTGACCCCTAAGGGAANGTACTGGAATATTGTNrTTGGGNNCCCATGCTTNITG 

NGGCAAGGCTTAAGGGAGCAGGAAAAGCATACANGGTGTAGCANCCTTTCCAGCTGATCC 

CCATGCCCTGCTGCACCTGGAGGGCTGGAACAAGCTATTCTCATATTGGGGAAAAG 

Sequence 2621 c 

NCCGCGGTGGCGGCCGCCGGGCNGGACCTGTTTTATCCCAGCTGAGAGGCAAGGAGAACC 

TTTGTTTCTTAAAAAATAAGCTGGTTTCAGCCAGGTGCGGTGGCTCACGCCTGTAATCCC 

AGCTCTTTGGGAGGCCGAGGCGGGCGGATCACCTGAAGCCAGGAGTTCGAGACCAGCCTG 

GACMCATGGTGAGACCTTGTCTTTATTAAAAATGCAAAAATTTGGCCAGGCGCCGNGGC 

TTTACCCCCTTATTCCCAGCACTTTGGGAGTCCCANGCAGGTGGATCACAAGGTCAAGAA 

ATTGAGACCCTTCTTGTTTACACCAGGGMCCCCCTTCTTTCCTAAAAATTTNTAAAAAC 

CAAANTTGCTGGCATGGNGGGGGCACCTGTAGTCCTAGCTACTTGGGAGCCTGAGGCANG 

AGAATGGTGTGAACCCGGGAGGCGGAGCTTGCAGTGAGCCCGAGATCGCACCACTTGCAC 

TGCAGCCTGGGGGACAAGAGTGAGACTCCCTCTCNAAAACANAACAAAACAAAATTNNCC 

NAGCGTGGTGGCAGGCCCCTTGAGGCANGANGAATCACTTGAACCCCGGGG 

Sequence 2622 

GGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGANGACCTTAGAAAAAGGAG 

GAAAGGAGGAGAGGCAGATAATTTGGATGAATTCCTCAAAGAATTTGAAAATCCAGAGGT 

TCCTAGAGAGGACCAGCAACAGCAGCATCAGCAGCGTGATGTTATCGATGAGCCCATTAT 

TGAAGAGCCAAGCCGCCTCCAGGAGTCAGTGATGGAGGCCAGCAGAACAAACATAGATGA 

GTCMGCTTTGCCTCCCCCCCCCCTTAGGGAGTTAAGCCAAAAGCTGGACAAATTGCCCA 

GAGCCTGTGATGCCTTCTTAGCAGGGAGAGCAGATGGAAATCCACCTTGTAGAGGCTTCC 

CCAGAAGAACCTCCAAATTCTGTCAGCTNATACCAGAAGTTAGAACTTCTGCCAGAAAAA 

AAAAAAAAAAAAAGTACCTGCCCC 

Sequence 2623 

TGGAGCTCNCCGCGGTGGCGGCCGAGGTACTTCN I I I I I 1 1 I I I I I I I I M I ICTTATTT 

AGGGGAGAACTTTTACCTTTTCACTTAATGCACTTCTCTTTGGTGTATCTGTNTTGNGNG 

CAACACTACTCTTGCTCTTNAGGGCCATTAANTAAAATAAGAGTTACTCAGGCTGGGTGC 

AGTGGCTCATGCCTGTAATCCCAGCACTCTGGGAGGCCAAGGCGGGCGGATCATGAGGTC 

AGGAGTCAAGGGCAANCCGGCNAMTNGGGGAMCCCCMNTTTTTCTAAAAATACAAAA 

ATTCNTCCNGGCAATGGGGGGCNNATGCCTNAAATCCCAANANACCCTNAAAAGGCTNGG 

GNCNGGGAAAANTNTTTNTAANCCAGAANCCCCGGGNGGNNGGGGNGNTGNANATANGCC 

CMTANNCCCCCNTTNNCNTNTNCNCCCGGGGGMNNNTTGGAANNCCTTTNTTNTNAAA 

AAAAAAACNAAAAAGGNTNTCNCTNTNCCCCCCNNGGGNAANNNGANACCTC I I I 1 1 I I I 

CTCCNCCCNNNTTTCNCTGGGGGGGGGGNNGAATNTTANTAAAANAANCCNCCCCNGGGN 

G ANANAATNTNAN 1 1 1 I I I I I CACCCCCCCNCGGGG 

Sequence 2624 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAACATACTGCTTTGATTGATAT 

ATACAAAACACCACCTNCTTTTGATGGAAACGGGGCCCAGAAGCAGATCCTGNGATGAGG 

ATTAATGTGCAATTCATGTAATNTATANCACAGTNTTCAGGANGTGCTCACAGGATAAAC 

CAATGAGGGAGNGGGTTCAGGGGCNGGACAGGGAAGGGAACTAAACTGNGCAAGGGACNN 

TTTNNGGGCANGGNNCGGNCTTAAGCCTANTCCCGTTGGGAGCTCTGGACATACATNAGG 

CTTCAAAGNTNGGNNCAACTGGGNGCAAAAAAACCTGGGCTGTTTATGCCTGCACCCCTT 

TAGGCACCCTAAGCCCTACTTTGAAGTNTTTA7TTTAGCTCTCAATGTGTTGAACTTATA 
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AGGAAATTAAACGTTTCATGAGTTAAG 
Sequence 2625 

TCCCCGCGGTGGCGGCCGAGGTACAGATAATTCAAATCTATTATCTAATGTATTAGGTTA 

ATTTCTATTTGTTCTATAC I I 1 1 I I I I I I I I I I I GAGATGGAGTCTTGCTNNGTCGCCCA 

GGCTGGAGTGCAGNGGTGCAATCTTGGCTCACTGCAAGCTCCGCCTCCTGGGTTCATGCC 

ATT CTCCT GCCTNGGCCTTCCAAGNAGCTGGGACTACAGGTGCCCGCCCCGACACCTGGC 

TNATTTTrAAAAAAATTTrrTAGTAGAGACAGGGTTTACCGGGTTMCCAGGA 

NATCTCTGACCTCGGGATCCCGCCCNTTTGGGCCTCCAAAAGACTGGGGATTACAGGGCG 

GTG AGCCACCG ACCC GGCCTATTG ACA 1 Mil I AAGGGTCAAGATTTTCTTTTGTGTGTC 

TAGTMTTCGTCTTTTATTGCAAAGATAATTTGCTTATTNGACTNAGAAAAATGATTTGT 

GGGCATACAATATTGTATGTGGTACCTGCCCGGGCNGCCGCTCTAGAACTAGNG 

Sequence 2626 

TTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGATAATTCAAATCTATTATCTAATGTAT 

TAGGTTAATTTCTATTTGTTCTATAC 1 1 I I I I I I I I I I I I I GAGATGGAGTCTTGCTCTG 

TCGCCCAGGCTGGAGTGCAGTGGTGCAATCTTGGCTCACTGCAAGCTCCGCCTCCTGGGT 

TCATGCCATTCTCCTGCCTNGGCCTTCCAAGTAGCTGGGACTACAGGTGCCCGCCCCGAC 

ACCTGGCTAATTTTTAAAAAMNTTTTTTNAGGAGAGACAGGGTTTCACCCGTGTTAGCC 

AGGATGGNCTCGATCTCCTGACCTCGGGATCCCNCCCNTTTGGGCCTCCCAAGGACTGGG 

GA TTACA NGGCGGTGAGGCCACCGCACCCGGNCTATTGACATTTTTTAAGGNTTCAGATT 

TTCTTTTGNGNGTCTAGNAATTCGTCTTTTATTGGCAMGATMTTTGCTTAm 

AA 

Sequence 2627 

GCTCCCCGCGGTGGCGGCCGNGGtACAGATAATAACATCTGATATCCACATGGGGTCTGG 

AGGTGCNAGCCACCTTCCTTTCATCCCACGGTCTCACAGCAGCCCTGGAAAGAGGCTGCT 

CTCTGTTGGAGGCTAAGGGCCAGTGTTGGAAGGAGCTCTGGTGGAAAAGTGTGGTCTGCA 

TGANGGGCTCCCATGAATNAGAGGATAGGGGTGGCNGGTACCTGCCCG 

Sequence 2628 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCAGAAGGTCCCGGCAGCAGCAGGA 

AGAAGACGGACCCCGCGATGAGGGCGGCGGCANGGAGCACCTTCATGTTGCGTTCGGAGC 

CCCGCGTACCCTATGGACAGTTGTGTCCCCAAGGAAGGATGAGAATAGCTACTGAAGTCC 

TAAAGAGCAAGCCTAACTCAAGCCATTGGCACACAGGCATTAGACAGAAAGCTGGAAGTT 

GAAATGGTGGAGTCCMCTTGCCTGGACCAGCTTAATGGTTCTGCTCCTGGTAACGTTTT 

TATCCATGGATGACTTGCTTGGGTMGGACATGAAGACAGTTCCTGTCATACCTTTTAAA 

GGTATGGAAGAGTCGGCTTGACTACACTGTGTGGAGCANGTTTTAAAGAAGC 

Sequence 2629 

ACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCCGAGGTACCTCATCCCCTCAG 

TGACTMGAATTGCAGNATTTMGAGGTAGCAGGAATGGGCTGAGAGTGGTGTTTGCTTT 

CTCCACCAGAAGGGCACACTTTCATCTAATTTGGGGTATCACTGAGCTGAAGACAAAGAG 

AAGGGGGAGAAAACCTANCAGACCACCATGTGCTATGGGAAGTGTGCACGATGCATCGGA 

CATTCTCTGGTGGGGCTCGCCCTCCTGTGCATNGCGGCTMTATTTTGCTTTACTTTCCC 

AATGGGGAAACAAAGTATGCCTCCGAAAACCACCTCAGCCGCTTCGTGTGGTTCTTTTCT 

GGCATCGTAGGAGGTGGCCTGCTGATGCTCCTGCCAGCATTTGTCTTCATTGGGCTGGAA 

CAGGATGACTGCTGTGGCTGCTGTGGCCATGAAAACTGTGGCAAACGATGTGCGATGCTT 

TCTTCTGTATTGGCTGCTCTCATTGGA 

Sequence 2630 

CCGCGGTGGCGGCCGCCGGGCAGGTACAGATAGCAAAGACTGGGACCACAGTGGAGGGAT 

GCCTAATCCAGACAAGGGCAGGAATAGGCAGGGAAGGCTTCCTAGAGGAAGTGATTTCCA 

AGCTGAMCTTGACAGATGGAACAGAAGNTAGCCAGAGATGGGAAAACTATTTTTGGTCA 

ATGGAAGANCAGGTGGTTGAGATAGAATCTGACACATGANAGCAAAAAAAAAAGTCCANN 

GNTGGGAGAATACNGNGTGAGAATAAGACAATNTTAAACTGGCNATATAAGTAAGNGATC 

ATCACAAGGCTTTGTAGGACATAGTAGGGAGTTTAAGACTTTTTATTCTGAGGGCAATGG 
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GGAATCACAAGAGGGAGTTAGGCACTNTACACAACTTC 
Sequence 2631 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGCAGGCCTCCTACACCT 

ACCTCTCTCTGGGCTTTTATTTCGACCCGNGATGATGTGGCT 

Sequence 2632 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTGTGGATATCATACGAAAGTGT 

AGTTTCAMGGAGATGGACAAACGATACCTACAGTTTGATATTAAGGCCTTTGGNGAAAA 

TAATCCTGCCATTAAATGGTGTCCTACTCCAGGCTGTGACAGAGCAGTAAGACTAACGAA 

ACAAGGGTCAAATACATCTGGATCTGATACACTCAGCTTCCCATTGCTGAGAGCTCCTGC 

TGTTGATTGTGGAAAAGGACACCTCTTCTGCTGGGAGTGCCTTGGTGAAGCACATGAGCC 

TTGTGACTGCCAAACATGGAAGAATTGGCTGCAAAAAATAACCGAAATGAAACCAG 

Sequence 2633 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACC I ITTTTTTTTTTTTTTTTTT1 I GAGA 

CGGAGTCTCGCCCAGGCTGGAGTGCAGGGGCNCAATCTGGTCTCACTGGNNGCTNNGCCT 

CCCGGGTTCGTGCCATTCTCCTGCCTCANACTGTCGAGTAGCTGGGACTACAGGCACCCA 

CCACCAAGCCCAGCTACTTGGGATCGCATGAGCCTGGGANTTTGAGGCNGCAGTGAGCCA 

TGATCACTCCAGCCTGGGCAACAGANNGAGACTCTNTCACAAAAATTAACTTTACTTAAT 

TACTTTT ATT ATTG N C AT AATCG CTCC ATTTT ATTGTTGTTTGTTGTTGTTG N AAACC AA 

ATTGTAAAACCATTTTATNTTTTGTTGCAAACCAAAMGTTTCTGAGACAGGTCTTAATT 

TATTTAGAAGTTTATTT 

Sequence 2634 

TGGAGC TCNCCC GCGGTGGCGGCCGAGGTAC I I 1 I I I i I I 1 I I I I I I 1 1 I 1 IGTGATAAG 

GCTAATAi AAAAGGAGGCACAGTGAATTTAAGACATTGACATTTATAGCCGTAGC 

TCGACGCTTACTCTTGGAGGCCCAATTATCATCCCTGTCCATCTTGTAAGTGTCATGTCT 
TCGTCATCTTCTAGACCCCAGCTAACTGTGCCATCTCCTACTCCTTTCTGGCCTTCTTCG 
AGTTCTTCCAACAGTCGGAAATTGCGAGGGACTTTTACTCCCGAGCCCGTGGTGGCTGCC 

ATCTTGCGTCGCTGTTGCTTGAAGGCCGGCCCCTTCTCCCCGCGTCCTGCCCGGGCGGCC 

GCTCTAGAACTA 

Sequence 2635 

CTCCCCCGCGGTGGCGGCCGAGGTACTCTCCGCCTAAGTCCATAGAGAGGCTAACACCCT 

TCTAGTTTATTAGTCTTTGNTAMCCTATGAMTTCTGAAT^AATGGCTANNTTATATG 

CANNGTGGAGGNCACNCMAAATGTTTCTCTCAACAGCTGTGATAAGTATAGGCTTATTT 

TGATGTCTAAAGATCTGNTACCTGTATCTGNTTTCATCTTTCAACACAAATTCATGGGAA 

NNTTAACTATGNNCCTGTGNTCNGGACAAGTGTGCATGAGAACATCATNCACTAAGTTTA 

TCATNAAATGGGAAANGAAGCAGACNTTTTAAAAAGCACCCAAACCGGCCGCTCTAGAAC 

TAGTGGA 

Sequence 2636 

AGGTACGCGGGGGCTGACTCTCTTCTCAGATTCAGCCCAGTTGCACCCAAGTAAAATAAA 

CAGTCTTGTTGCTCACACAAAGCCTGTTGGTGGACTCTCTTCATATGGACTCATGTGACA 

TTTGGNGCCGAAGACCTGAGACAGGAGGACTCCTTTGGGAGACCGGCCCCTGTCCTCGCC 

CTNTTTCA TGAG ATCCACCCATGACCTTGGGGTTCCTCANCCCAGCNCGAANGAACTTCT 

TCACCAANTTTTAAATCGGGGACCGCAAAGCCAACAAGAATGAAGAAGTGNCCAGGTATG 

GTGGCTCATGCCTGTAATCCCAAGCACTTAGGGAGGCCGAGGAGGAAGAATTGCTTGAGG 

CCNAGGAATCCCANACCAGCCTTGGACAACATGGCAAAATNCCANTNTTTTCNAATNANA 

TTAAAGNAAANAAATTAAAGTTACCTGCCTNGGCGGGNCGTTCTAGNAACTAGTTGGANC 
CCCCCGG 

Sequence 2637 

TTAGGGCGAATTGGACTCCACCGCGGTGGCGGCCGCCCGGGCAGGTACACGGTCAGTCCG 
GGTCCTAAGGGCGCAGGGTAGGGCATNCCACTGGGAGTTCAAAGGGGAAACNAAGATGGT 
TCCCACTGC TGCTC AGACAGTGTCGCTAAAATTCCCTCACTCATTTTCAAGTCTTGTTTT 
ACATMTGGCTTTTAAAGCAACTTTTGTTAATGCTNCTGATNCTTTAATNCAAACTATTA 
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ATGTNCCATTTTGGTGACATCTGGGTTTTCATACANAAAAAAAAAAAGTCAAAAAGGGGA 
MCAAAAANATATTTACACATTTTATATATAAACTTAAAAACCTTGTACCTTN 
Sequence 2638 

CGAGGTACTATGGCAGTTTTGCCTCAGGTGCTGAACATTTCTCAGCCCTGGCTAAAAGGG 

AGCAGCACAGGGAGAGAAACAGGATAGGAAAGCAGAATGGCGAGCAGCCTATGGCCCAGG 

GCCTGTAATCCCTTCCCAAGACTAGCTGCTCAGGGTGGTGCAGGGACAGGACCAGACCCT 

GCGCCTATTTCCTGCCTTCTTTCCCTATAGGGAACTCTGTAGGCTGAGCCACTGTCCTGC 

TCTTATGACATTATATCTTGTGCCTTTCTCCTCAGCAGTGAGCAGTGAGCTACTCCTGGC 

CCAGGCCCTAGGGGAMTGGATCAGTCmGAGGTTTCTATTTGGGGAGGGGAGTACCTG 

CCCG 

Sequence 2639 

CCGCNTGGCGGCCGAGGTACTTG I I I I I I I I I I I I I I I GAGACAAGGTCTCGCTCTGTCA 

CCCAGGCTGGACTGCAGTGACATGATATCGGTCACTGCAACCTCTGCCTCCTGGGTTCAA 

GGGTG ATTCTCGT GCCTCAGCCTCTCAGGTAACTGGGATTACAGGCATGCACCACCATGC 

CTCGTAI I I I I I I IGCGTGTGTTTTTAGTAGAGACGTGTTTCACTATGTTGTCCCGGGCT 

GATCTCCAACTCCTGTAAGTCAAGTGATCTGCCCGCCTCAGCCTCCCAAAGTGCTGGGAT 

TATAGACATGAGCCACCACACCTGATGTCTGATGCTTATTTATTATGTGACCTTAGCGAA 

GTGTGGTAGTCATTAAGTGCTGGTCTATCTCTATACCTTCCCCAGGCAAGGTAGGATTGC 

ACTTNCCGCTCCACTTGTGATTAGGTGGAGCCATGTGACTACTTT 

Sequence 2640 

TGGGCTCCCCGCGGTGGCGGCCGAGGTACTAGAAACTTTCCAAGGAGTCTTGGGTGTGTA 

GCCAAGAGGAGCCATGAGCTATGGACTCCTCAAGCACGGGAAGAGGAGGTGTGTGCTGAG 

AACAGAGAGGCCCTGCCCTCTGTCCACTAGCGAGAATCCCTAGCTGCCCCAGCCCAGTCT 

TTCTCCCCGGCATTCACAAACTTTGCAAGCGTTGGTCCAGGGCCCTTCTCCAGATCTGTT 

NCAACTTT GNAG AGTGAAGGGCTTGAGCATACGGGGGAAGAGAGTCTGCATNANGTTAGG 

GGGAMACTTTTAAAAGATACCCTCATTGTGTCAAAAGAAGTGTGCCMTCTATTTTTGT 

ATCAGCATTGGGAAGNGCACTTTCCCCTGGGGCCGTGTGGGTGNGTGAATGTGCAAGTGT 

CTGAGAGATACTGCATCAAGCCCTAGACCCTCAAGAGCCAGTCCCGCCCTTTTACAGAGC 

ANTCCCTTATCCTGGGGCCATGGGTCAGGCTGACCTTCAA 

Sequence 2641 

CCCCGCGGTGGCGGCCGANGTACGCGGGGGACTTCGGGCTTGTTCGCTGGTGGCGTNNGA 

GCCNAGCCCGGACTGGTCAGGATNGATCACGGACGTGCAACTCGCCATNTGNNGCCAACA 

TGCTGGGCGTGTCGCTCTTCTTGNTTGTNGATCTCTATCACTTACGTGGNCNGTCAAACA 

ATTCCAAGMTGCANGAATGAAAGTTGGCGCTTTCTTCCGCCCCANGGTCCCAGGACATT 

AGTCTGNGGCANGATNGAGGGGTNTNGMGGGGCCTTTCACACTTAACTTTATTCCTTTT 

ACCCTTCACAACATACAAAAGGCAACTTACACCTGGGATTTTTNCAAAACMCCTTTTAT 

TTCCCTCAGANGNCTTTCCNTTAATCCCTATGGAACAAAGAANGCTNGNCCACTTGAANT 

AGGGGCCCNAGTATAGGGGGCTTTGCTTTTCTACTTCCNTC 

Sequence 2642 

GGACTCCACCGCGGTGGCGGCGAGGTACGCGGGTATCTGTCATAAGCTCAACATCTGTAG 

ATCAGAGGGCT ACCAG AGGAACCAGTTTTAGAAGATGAACAACACCTNTTAGAAAAGAAA 

TTGCCTGTNACGTTTTGNAAGATAAAGAAGCGGGGANAACTTCGAACGTTGCAAACCTGN 

AAACTGGGAGAAACGANGGCAAGCTCTNTTGGAACAAGCAAGCTGCAAGGGAGCAAGGNN 

CGCCTGNCCCANCATGGAGCCGGCCNNGNCNANGGAAAAGGAAGGAGCGTGAGCGCCAGG 

AGCAAGAAGCGCAACAAGACAACTTGGAACTNGGGGAAGCAACTGGGAAANGCATGCTGT 

NAGCTANNAACCTGCANTATNAGAGGGAGGAGAGGGAGGAAAAGAAATTNGANAGGCGAA 

GAGGCTNGCAAAACTGGGAAACTTGAAAGGCAACGACAACTTNGAGTGGGAACCGGAANT 

CGAAGGCAAGAACNTNCCTAAATCAAAGAAACATAGGAACAAGGAGGGACATAGTTTGNA 

C 

Sequence 2643 

CAATTGGACTCCCCGCGGTGGCGGCCGAGGTACCTTATGTAGCCCAAGAAATTCAAGAGG 
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AAATTGATGGGGCTCCTTCAGGAGCAGCGTGCAGATATGGACCAGTTCACTGCCTCAATC 

TCAGAGACCCCTGTGGACAGTCCGGGTGAGCTCTGAGGAGAGTGAGGAGATCCCACCGTT 

CCACCCNTTCCACCCCTTCCNAGCCCTACCTGAGAACGAAGACACTCAACCCGGAGTTTG 

TACCTGCCCGGGCTGGCNCGCTTCTAGAAACTAGTTGGATCCCCCGGGCTGCAGGGAATT 

CGATATCAAGGCTTATCCGATACCGTCCGACCTCGAG 

Sequence 2644 

AATTGGAGCTCCCCGCGGTGGCGGCCGGGCAGGTACI 1 1 I I I I I I I I I I I I 1 1 I I I IGGA 

AACAGGGTCTTGCTATGTTGCCCAGACTGATCTCAAACTCCTGGCCTCAAGCAATCCTCC 

TGG CTTAG CCTCTCAAAGTGCTAGGATTACAGGTATAAGCCACTGCACTAAGTCCAGTGA 

ATCATTTTCCAGACAACATTTACTGAATATCCGACACTGCCTTCCACAGACGGGGGAGGT 

TTCCCAMGAGGCCGTGCTTTTTGAGAAGCTGGCAAGATAACCCTGGGGTGGAGGGAAGG 

TGGCCCGCAGCTGAGACCACCACCTTACTGTCCCCAAATCCTCACCAGAGGTTGCTGACA 

CTTTCCTNTTCCC CATCC TCATCTACTTTTCAAATTAGAACAGTAGAAAATGGGATGATT 

AGGCTCCTTNNCATTTTTAAAAAACCCCGTTGTTTCAAATTTTTNTAGACNCCAAGTCTN 

AAANGGGNGTAATCNCCTTNGTAAAATTTAAATT 

Sequence 2645 

CCGGCAGGTACCAATCATATAATNTATATAACATTGCTATCAGACTAAAAACACATTCTT 

AGCTAMGATAACTTACCATTTAGAAGTCAAAATGCAGGGAATCTTACTCCTGTTTCCAT 

TTTNTGNCCCNCTTGCTTCACTCGNGTATGNCATGCTCTATCTCTTCTCCTATGCAGACT 

TTANGNCNGTNGGCCATTMNTCTTGMGAAATTTCCTTCNNTCTTGCTGTCACNTACCA 

NNTTANTTGGTCTGCGTGCAACAAGAAGGNGTATTATANNAAAAAAGTTCTTGCTTAACC 

ATTCANGATTAAATAAANAAAMAATTCCTTTGTTTNAACATTTTGNTATTTTTT 

TACACCAMCTTTTTTAATTGCCTTTTNCANAGNNCCTTTCCCTCCAAAAAATAAAAAAC 

AAAAAATCTTCAATCNACATAAAATCAAACACCTGTATTGATCCATGTTCATGCTAAGCT 

GGGNAA 

Sequence 2646 

ACTCCCCGCGGTGGCGGCGAGGTACAAGCGCTTTGAATATCATGGGCACCATGACTGTGA 

CCCTACAGGTAGGATTGGATCACTCCATGAGAGTAGCCGGCAGGTTTCTACAATGGCCTG 

GGAATGGACTGATTATTTTTATACATTTTCTGGCCTGAGAGAAAGCCAAGGTCCCCTGCT 

GTTCACAGCAACCCTGCCTGGGAGCTTGGAATCTTGGTAMTCTGCCCCGTTNGGATCTA 

TTGGAGGTAGGCTCACCCTTTTTNGTCTTTTGTGGGAAMTTAAAGAGAAATAATTNTCA 

GACNTATCATCACCTCCAGTGGAACTAGAGAANACCTGGACCCANCTGCACTATTTTAAT 

GTAAAAATAACAATATGGCCAGGGTGCAGTGGCTCACGCCTGTAATNCNATCACTTTGAG 

CAGCCAAGGCGGGCGGATCACGAGGTCAGGAGATTAAGACCATCCTGGCCAATATGGGTG 

AAACCCTGTNTTTACTAAAATACAAAAAAATTAACTGGGCATNGTNTTGCGTGCCTGTNG 

TCCCANCTACTTGGGANGGCTGNGACCAGGGGAATTGCTTTGAACCCCGNANGGCNTAGA 

ATTGCANTGAGCCNNAATCANGCNTCTGACTTCTACCTNGGCGACAGGANTGGGACNTTT 

TNTTAAAAAA 

Sequence 2647 

AGGTACCCTATATTCTTCTTGATTTCTAGCCTTTTATTGGCTCTCAGATTGCCAGAGTTG 

GGACTCAATAGTAAGCANCCATTCTGGTGAGGCGGAAGNGATNCTACCAGGGTGNGTTNT 

CATGACMGCANMTCACTGNGTTTTTCTCTCTACTCTGTGGCATANGACTCTATGCCAT 

AGAGNGACGTGTGAAAGGCTTGAGGCT 

Sequence 2648 

TCACCGCGGTGGCGGCCGCCGGGCAGGTACTTATTTTT Ci I 1 I 1 I I 11 1 I 1 GGGGGTGGG 

GNCCTGGGNANTTTTNTNMGGGGCTTTTTAACNNGGGANGANANCCTGTGCCGGTTCAN 

C CCAN GCCGGTNGTNAATTTTCANCC I I 11 I I CANCTNGCTTTGGGTTTTTAAGGCTTTN 

GTTTTCNANCNTTTTTCTTTNAAANCCTTCTTCACAAGTNGANGCCATTCNTCCNGGTTC 

CTTCN AAATNT GGGCCTGGGTCCCTTNTCCCCGGNTTTAAAAAANAACTTNCCAAAAACA 

AAACANTTTTTTCCCCTNG 

Sequence 2649 
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CCGCGGTGGCGGCCGCCCGGGCGGGTACGCGGGGGTCTCCAGAGTGAGTGTTCCGGAGAG 

CACCGTGTTAGGGAAGGAGGAGGCTGCGGGCTAACCTGCCGGGAGGGAGGATGCTACTGC 

CTGTTTCCTATTAGTGACAACCCACCTCCTAATCACGTCCTGCTTCAAACAAGGTAACAT 

CACAGGACAGCCTCCGAAACAATMCTGTTTGAATATCCTTAATCTTCGGCAACTTCAAT 

MCTCCCCAAAATATATGTAATCAGAGMTTAACTTTTACMCTTTTGGTTATTGTTTGC 

TTGAACCATAAAGCAGAGCTCTTTCTGGGATAAAAAAAAAAAAAAAAAAAAAAAAAAAGT 

ACCT 

Sequence 2650 

AGGTACTTCACAATACAACTCTTGCAGAAAGTATGAAGACACTCTGTGATGGTGGTGGCA 

TCCACGAAGTAGCCGGCGCATAGGCAGCMACAATGTGTTCATTCAAGTCTTTGATCTTC 

ACTCGAACCTCCTCCTGACCCTCCTTTCCCAGGGGAGACTACACAACGTCGGCGACACAA 

CGCGCAGGCCCCCGCGTACCTGCCCG 

Sequence 2651 

TATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATAACAAGCAATAATAAA 

AGTCCTCTGCCGMGACAGMTAGTGTAAAACCTTTCCTTCCCCTGCCCTTACAGATTTC 

TAGAGAAGATAATTTGGGMTAAMGGTATAAMGGTTTTTGAGAAAAGAGAGATCAAAG 

GACATTCAGAGTTCGGGAGGATATCTGGGAAGCAGACTTAGCTTAGATCCTAAAGGGTGT 

AGAAGATCTGCATTGCTGGGGAGGAGTGTGGGAGTGGTAAACCAGTCAAAGAATTCACAG 

ATGTGCTGGGAGGAGGAGAAGGGGAGCAGAAGCAGGAAAACAGTCTGGCTGAAACACTGG 

TTGTATTTCAAAGAGCAGTAGCAAATCAGGGTGGATAAATGTGTGGGCCTAAGCAGCTGG 

TGAGCATTAAATGTGTAAGMTCMATTGTATCCTTAGCAAACCTCTGAAGATTTCTGAG 

TAAGATTTTTATGAGTGTTGGTAAATTTATATAGGATGCATATGGGTAGTACCTGCCCGG 

GCGGCCCGCTCTAGATCTAGTGGATCCCCC 

Sequence 2652 

CCGCGGTGGCGGCCGAGGTACATAGTGTCGCGAACTCAAATCGGCATTTAGATAGATCCA 

GTGGTTTAMCGGCACGTTTTTGCTTATMAAAMGTGCAAAAAAGATGTGGTTTACAAG 

TTAMGCTACAGAATCCCTTTTTGCTGTMTTGCACCAGTTTTAAAGCCTCTGGACAGAG 

CAGTATTTCGTTTAAMCTTTGTTTTTCTTAAAAGCTTACAGTGTTTGGCTMT^ 

CCCCI I I I I ACAAGACGGGGGCCGGAGGGTGGACACTGGTGGCAGGTTAAGGGATACTGT 

CACTTTAAGAAGCCTGCAGATTGMGTGTAAACATGGAGAMTTAGGGGCTGATTTTTTA 

AACTGTGTGAGATATTAACCAGCCGCCCTGTTATAAAATCAGGAAATNCAAACAGCGATT 

T ACAC CGATTAACACCCCCTTTATATATTrnTACAAAAATCACTGAGAAAATAATNAAC 

GTTTTCATCTCTCTTGGCTTTTTTTGGTTT^ 

NTTAAAAATTTAA 

Sequence 2653 

CACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCGCGGGCAGGTACACCAA 

ATGGATTACAAGCAGCATCCAGCAGAAGACAGACCCCCCAACCCTGCCCACCAGGGCTCA 

CACTCTACAAAACCCTGAGGGCCTAGAAATCTGTAAATGCATCGCCAAGCACTGGGGCTG 

ATTTGCAGTAATTCTCTAAGCAAGGCAAACATGATCTAGCTTTGAAGGCAGCATGAAGGC 

AGCGGGTTGGTGAGAATAATCTCTCCTTAAGAGAAGAAGAAACCTGGGGCGGAAGGAGTT 

TTCCCCGAAGTGGCTTCCCGCGTACAGMCACAGMCTTATTTCTGTCAGTTATTTAATA 

CATTGMAATTTAGTGAMTGTTCAMGAGAATAGATGTTTCCCAAAACAACMCTTTAT 

GTTAAAAATAGTCATTAAAAGATCTGTTGTAATTAAAA 

Sequence 2654 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGTAAAGGTATAGT 

AGTTGGCAGCAGAATGGACCCATTGAGGATAACTATAAAATTACNGAAAATATTTAAATG 

CAACTTATTGCTATAGAAGCAAAGAGGACTAAAGGGCAAAATTCTAGAGAGTGGTAAATC 

TCAGAAAGCACAAGCATAAAATGCAGCTCTGGGGGCCCTTTCCACTTCTGGCTATAGGGA 

AGAACCTGAATACTGAACTTGATTCAGGCAGAGGACCATAACCTGGGGGTCAGGGGAAGG 

CATGGGGGGGACCAGAAACCAGAAGAACGATCAAGACTGCAATGAAAAAAATGGATACAT 

TAGGAGCTTCAAACACATATAATTTCTCAAGAAATTTCCAGATTCTCATGCTGCATAGGG 
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CAGGAGCCTGAAAAGCTAATTTGAGAAGATAATMGTTGGATTTTO 

TTTGCMGTACCTGCCCGTGCCGGC 

Sequence 2655 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGGAGGTACCACTGAATCCAAGG 
CTCTCTTGGGTAGCCTATGTGCCTCTTGGATGGTATGTGGMGCCAAGGACTGTCTGAAC 
GTGCTGAACAAGAGCAACGAGGGGAAAGAATTACTCGTCCCACTGACGAGTTCTATGTAT 
GTCCCTGGGAAGCTGCATGATGTGGAACACGTGCTCATCGATGTGGGAACTGGGTACCT 
Sequence 2656 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCCCCGGGCAGGTACTTGAACAGTAGGAGG 

AGGTGGTTCCTCATTCGTCTCCCGGGAGCGTCCTCTTCTCAGTCAGGCTGGCACCATGAC 

CCAAGGAACCTCGGCGAGTGACGGATAAACACCAGTCGGCCCGCGACTAAGAGCTGCGCC 

CCCGCGTACGCGGGTCACCAGGGTCAGTTTCTTTAATGATGGTTTCCAACTGGCCTAATA 

CATTAAGTAAGACTGGCTGATAACATGACCAGACAGACATAAAGACCCTGTTGGGAATGA 

CATTGAACTCTCAAAGTCAAGATTTCTTACACAAATCTATCAGCTGGAGAAAATGAAGGC 

AGTGTGGTATATGTGTGCAAATAAGGACATTATGAAGCTTAAATATGGAATGTCTCTTGG 

ACCCCCGATGTCATCTGNATTCTCTTTTTCTTCTTGTACCCT 

Sequence 2657 

CGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACCATCTTGGCTCACTGCAA 

CCTCAACCTCCTGGGTTCAAGCGATCCTGCTGCCCCAGCACCACCCCCTCTCCAAGTAGC 

TGGGACTACAGAAGTGCATCACCATGCCCAGCTAGCTAACTTGAATTTTTAGTAGAGACA 

GGGTTTCACCAGGTTGCCCAGGCTGGTCTTCTGAGCTCAAGCAATCCACCTNCCTTGGCC 

TTCCAMGTGCTGGGATTACAGGCGTGAGCCACCGCACCCGGCCCATTTTATATATTATT 

TTTGCATAGCCTTCATTGTCTTAGCAAAGTCAGAAGAAGACCANTMCATMTGTNATTT 

AATTAACAACAACGNCAACAACAACAACAAAACAACCCNGGATGGGGGCAGTGGCTCAAG 

CTANTWGGCCGNCC TTATGAA AGNCCMGGGGNGNGGATTGGTTGATTCCATTTTAAA 

ACCAMCTGMNCCCTNTTTTTTANAAAAMTMACCGTTAAMAMTCCTTGGTTTTTG 

TTTAMNGNGCCGGTCCCCCCCCCMGGAGAAAATTTTTNGGGGGGGGGGNGTTAAAAAA 
AA 

Sequence 2658 

GGGCG AATCGGAGCTCC CCGCG GTGGCGGCCGAGGTACTTTN I I I 1 11 I I I I I I I I 11 1 I 

mil iCTCCANAGGCTAGTTTTTCTTCAGTCCTTAANAACTCTGCTCCTTATATGGGCT 

TCGGTGGCAGTCATGGGGCAGCACCGCAGGTCTACAGTGGGGTGGAGGTGTTCGGTCCTT 

GCGGGGCTTCATGACCTTGATTTNTGGCGGTCGGGGGGCANCACCCNCAGGTCTACAGTG 

GGGGGAAGGNGTTCGATCCTTGTGGGCTTAATGACCTTTGATTCCTGACTACCGGGCTGT 

GAATNGGCACAACTTCACACAAGTATTGTGNTTTCNCATCNACCCTGGGAAAGAACCCTG 

CCCCGGGGGGGCGGGGTTT 

Sequence 2659 

TTAGGGCGAATTGGAGCTCCCCGC GGTGGCGGCCCGCCCGGGCAGGTACTTTNTTTTTTT 

M I 1 1 M nAI 1 1 11 1 1 1 I 1 1 I 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 I 1 1 H 1 1 1 1 1 I 

i n ( i m M 1 1 I I I I I I I I I NAAAAA'l I 1 I I I I i 1 I I I I NAAAAAAAAAANCCNCCNCCC 

CCCNMNNNCCCCCNAAANAAAAAMAMNNNNTTNAAAAAMTTTCNCGGGNNNTTCCC 

CCCNNAANNTTTNTNTNGGCNAAAAAAANNNNNNN I Mill I TNGGGGCCCNNCCNTAAA 

MAAAMNTTTAAAATTTTTTGGGGGGGNCCNGGGNANCCCCAATTTTNANNTTNGGGGG 

GTTNAAAAAAANAAANGGGGNAANNAAAAAAANGGNCCNNNNAATTNGNNTTNTNAAAAT 

TNCNCCCNNNAAAANTTTTTCTNAANCCCCMMNGGGANGGGNCNIvrrNGGTTTTTTNA 

GGTAANGG GGGGCCAGGGGGGGNTTTTCCCAANTTTTNCCCGGCCCAAAAAAGGAAAAAT 

Ml I GTTGGGGGGAGGGAGTTTTNAAAAA 

Sequence 2660 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGGAACAGGGATAAG 

TTCTTGGATMGGTGCCAACATACCTATAAMGCTGATTTTTGAGTAAATTATTGATTCT 

AACATATGTMTGGATTTGGTGTGATAATTTTCTGATCTTTMCTATAAGTGACTTTTTA 
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TTCTCCACCAGAAAAGATAAATGACTGAGAATGTAAGTCTGCGCTCTGATTAACACAATG 

GAGAAACGGAAAAACTATCTCTGTTAAAAACTGATTCCTGTCATTCTTCTGATATCAAAT 

MGAGGAAGGAAAATAMCTTTTTGTGTGTAGATAGAAAAACATACCTGAGGCCAGGTGC 

AGTGGATCACGCCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGCAGATCAGCTGAGG 

TCAGGAGTTCGAGACCAGCCTGGCCAACATGGTGAAATCACGTCTCTACTAAAAATACAA 

AAATTATCTGGGTGTAGTGGTGCGTGCCTGTAATCCCAGNTACTCGGGAGGCTGAGGCAG 

GAGAATCACTTTAATTC 

Sequence 2661 

TCGGAGCTCNCCGCGGTGGCGGCCGGGCAGGTACTCTGGCTGCAGACTGACCTTGCTCAG 

GTCCGAGAAGGATGGGGCAGCCACTGGAGTGGATGCCATCTGCACCCACCGTCTTGACCC 

CAAAAGCCCTGGAGTGGACAGGGAGCAGCTATACTGGGAGCTGAGCCAGCTGACCAATGG 

CATCAAAGAGCTGGGCCCCTACACCCTGGACAGGAACAGTCTCTATGTCAATGGTTTCAC 

CCATCAGACCTCTGCGCCCAACACCAGCACTCCTGGGACCTCACAGTGGACCTTGGGACC 

TCAGGGACTCCATCCTTCCTTCCCMGCCCTACATTTGNTTGGCCCTTTTCTGGTGCCAT 

ACACCCTTA 

Sequence 2662 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGGGAGGGACATACAAT 

ATTTMTAGGATATTTCTACAGAACAATAACTTATATTATGTCCTTGTAAAAATCTGTAC 

CTCTTTAAAACATTTAACTGAAACATCCA I I I I I I I I I AGCTTTGCTAATCAAAATTGTT 

TTAAGAATTAAAACTAGGTTGTAACTAATGTCAGTACCTGCCCGGGCGGCCGCCCGGGCA 

GGTACATAGGCATCCTATTCACTGCACCCTGTCACACCCGGCACCCCCCGCCCCGCACAT 

TATTTGAAAGACTGGGMTTTMTGGTTAGGGACAGTAAATCTACTTCTTTTTCCA 

Sequence 2663 

AGTTCGGTGTAGGTCGTTTCGCTCCAAGCTGGGCTTGTGGTGCACNGAACCCCCCGTTTT 
CAGCCCGACCCGGNTGCGCC7TATTCCGGTAACTATTCGTTCTTTGAGTCCCAACCC 
Sequence 2664 

AG GTACT GCAACTGCCAGAACTTGGTATTGTAGCTGCTGCCCGCTGACTAGCAGCTGGAC 

TGATTTTGAATAAAAATGAMGCATTAMGGGTTTCCCTACAAMCATTTTTCTT^ 

TACTTTTGAAMTGGCTATMNCAGTTGACTTTCACCCTTGGAGAGCATCACACTGTGTG 

AGGTTCAGTGATTGTTGACCCTNCCCAGCCCCTNCTGCTTCTTTAAGTTATCTGTGTGCG 

TGCNCTTCCTCTCMTCTTNTTTTGCACCGCTCATTTNTTTTTCTCTGACCCATGAAGAA 

AAGGAAMACTTTACTGATTGATAAANTT7TTAAAANAA 

Sequence 2665 

GGGCG AATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTAAI (TTTTTTTTn 

TTTTTTGGTAGMGTGGTGTCTCACTATATTGCCTTGGCTGGTATTGMTTCTTGGGCTC 

AAGCAATCCTCCCCTCTT GGCCT CCCAAAGTGCTGGGATTACAGGCATGAGCCACCTCAC 

CCAGCCCACTTATTCATCTTTTTGCCTGCAAGCTACACCACCAAAGCCCCAGGTCAAACA 

TCTTTCTCCACAGACTGTGAGAAAAGAGCCTTTCTTCCTCCTMTTTGTMTGGCTTTCA 

GCTCTATNTGTCTAGCTTCAATCCTGACATCTGCAGCTTACA 

Sequence 2666 

GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAATGAGGCTTGGCTCTGTGTGAT 

GCACTCTMCCAGCCCACACTTCAGCAGCGGCATCATTTGGATTCAGGGAGAGTTTCTGC 

AGCATGAAGGAGTAGGAAGAACACAGGGTGAGTTCACAGGAATCCCTACTTTTCCATGTG 

ACCTTGTTTGGCCTGMGTGATTTTCTTGACCTCTCTGGGCCTGACTTTCCTCATCTGTA 

AAGTGGAGGGTTTGAAAATATCACAGAAAACAAAATACCTTAAAGGATGCTCTGGCACAC 

AGCAGGTATTTCCATATMTGATACCTCCCATTCCTTTTTATGTGAGCTATATCCCCTGA 

AAC CCAG GTTTGACTAAATTGAGACCAACTTTCATAATATACACAATGACTGNTAGATAT 

GAATTTTGGTGTGGTGAAGATGGGGAGTGAAAAAAGTAGAAAAAGTCAAATCTCATTGAA 

TAAAAAAGGG 

Sequence 2667 

NNGGCGGCCGCCCGGGCAGGTACTTTCTTTNCTTTTT7T N I I I I I I I I GAGGCAGAGTCT 
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NGCTCTGTTGCCCAGGCTGGAGTGCAGCGGTGCGATCTTGGCTCACTGCAAGCTCCGCCT 

CCCGGGTTCACGCCATTCTCCTGCCTNAGCCTCCCAAGGAGCTGGGACTACAGGCTCCCG 

CCACCACGCCTGGCTAATTTTTTTGTATTTTTAGTAAAGACGGGTTTCATCGTGT^ 

AGGATGGTCTCGATCTCCTGACCTNATGATCCGCCCGCTCTGTCTTCCCAAAGNGCTGGN 

ATN ACAG GGCCTGANCCATTGTGCCCAGCCAAANTGTNCCTTTGNAAAGTTNGCGAAATC 

AGATTTTGTTTCCCMTAGMCCAAMTTTTTATGAGGGATC 

ATANTMTTAGTTTACAACCTGMNAMTATTATGTTTTGTANTAGATAAATATTAAGGT 

GNGCATTTTAA 

Sequence 2668 

CTAT AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTATTACAT 

ATGAl 1 1 1 I ATTAGTTTCTGGAGGCAAATTTMTTTTTATTTTAMATCMAATCTATTT 

TAAAAGAAATAGTTCTCAAAAAGACAACGATGACTGGGTGTGGTGGTGTGTGCCTGTAGT 

TCAAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTTGAGGCCAGTTCAGTCTAGCCTG 

GGTAACATAGCAGGACCCTGTCCCTAAAATAATAAAAAATTTAAAAACCACAATAATGTG 

AGTTACAAAAAAGTGTAACTTATGAAAAGTCCGTATTTATATTGAC 

Sequence 2669 

AGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGGGCAGGGGT 

GGGAGC ATTTA TGGTGATCAGATGTGCCTGGCAAGCCCCTGTTCAGACATTGCTCACATT 

CCMATGTTTTCGTGTAGAATATTGCACAGGTCTGGGGACGCTCTACCTGTGCCCTGTGA 

GTGTTAATAATGGTGGAGAAAGAGTGTAGCTGTGCCCTTGAGAGAGAAGGTGAGGGAAAG 

AGTGCACCAGTCAGCTGACCGTCAGCTGGCTAGGCTCTTCACTGAGTCCTATGTCGCAGT 

GCACAAATCACTGCCCATCANGCCTCAGTTTCCTCATCTGGTAAATGGTGATAACATCAA 

TCTGCCCCCCCGCCAGGGTGCTGTTATGAGGGTCAAAAGTGGTAGTGGAGGGTAATACTG 

GNTGAGTCCATTTGTGTGTGGGAGGMGAAAGGCTTTACATTNACCTGGTACCTTGGGCC 

GCTTTAAGAACTAGGTGGATNCCCCCGGGCTGCANGAAATTTNATATTAAAGCTTATTCG 

ATTCCCGTCCACCTTNGAGGGGGGGG 

Sequence 2670 

AT AGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCTTATTACATAT 
GATTTTTATTAGTTTCTGGAGGCAAATTTMTTTTTATTTTAA 

AAAGAAATAGTTCTCAAAAAGACAACCGATGACTGGGTGTGGTGGTGTGTGCCTGTAGTT 

CAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTTGAGGCCAGTTCAGTCTAGCCTGGG 

TAACATAGCAGGACCCTGTCCCTAAAATAATAAAAAATTTAAAAACCACAATAATGTGAG 

TTACAAAAAAGTGGTAACTTATGAAAMGGTCGGMTTTAATATTGACCTTTGGTAAGAG 

CTCMTTANTTTCAGGGAAAGGCAAGGGAGTATCACCATTCTGAGTAATACAATTTCAAC 

TMTCTTTATTTCTTCTACTTGAAGTCAGTGCCTATCTACCACAAACATTCCTATATCAG 

TGTGCAAATTMTTTNGAGACMTAGGCTTTTTACGACAATGMTTGGTTACTTTNAA 

Sequence 2671 

ACGGCGA A TTGGAG CTCCCCGCGGTGGCGGCCGAGGTACTTNN I ITTTTTTTTTTTTTTI 
n i 1 1 i mCAI II I I ATAAGAATATATAAAAAATGATATAAANGGACATTTACGGTAGTG 
GGGGAAGGCATATATNTACGTTAAAAGGCAGGACATTTTTAA 
Sequence 2672 

TACGAGCCATTTACAAAAAATCGACCGCTTCAAGTCAGAGGGTGGCNGAAAACCCGACAG 

GGACTATTAAAGAATACCAAGGNCGTTTCCCCCCTGGGNAAGCTCCCCTCGTGCCGCTCT 

TTCTNGTTTCCCGMCCCCTGTCCGCTTTANNCGGGAATACCCTGGTCCCNGCCTTTTTC 

TTCCCTTTTC 

Sequence 2673 

CCGCGGTGGCGGCCGAGGTACGCGGGATGTGGTCTAATCAAAGCCATCTCAATTTGTAGA 

TGAAGAAGGCAAGGACTAATGACAAGAAATGAATTGTTGGCCGGGCATGGTGGCTCACGC 

CTGTAATACCAACACTTTGGGAGGCCAAGGCTGGTGGATCACCTGGGCTTGGGAGTTCGA 

GACCAGCCTAACCAACATGGAGAAACCCCGCCTCTACAAAAAAAAAAATACAAAAATTAG 

CTGGGCATGATGGCGGGCGCGCCCCCTAATTCCAGCTACTCCTGAGGCTGAGGCAGGAGA 
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ATTGCTTGAACCCAGGAGGCGGAGGTTGCGGTGAGCTAAGATCGCACCATTGCACTCCGC 

ACTCCGGCCTGGACMCAAGAGCAAAMCTTTTGTCTTAAAAAAAAAAAAAAAAAGTACC 
TGC 

Sequence 2674 

CCGCGGTGGCGGCCGACGNACCNGGTNCTCTGTGGGATACNCATTAGAGTTGCTCGNTGG 
ANATGGMTGATGGTGGGGTGCATTTNANCATGGCTGANTGCTTTCTGCTTAAGGACCTG 
ATGTATTAATGCTCTCCANGTCATTCATATTTGGGGGAAGGAACAAAGAAGGGTACCTN 
Sequence 2675 

TAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGTAGCATGATCTTGGCTTA 

CTGCAACCTCCGCCTCCCTGGTTCAAGCAATTCTCCTGCCTCAGCCTNCCGAATAGCTGG 

GATTACAGGCATGCACCACCATACCCGGCTAATTTTTGTATTTTTAGGAAGAGATGGGGT 

TTCACCATGTTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATTCACCCGTCTTGG 

CCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTGCCCGGCCTCCATTACACCTCT 

TTAT7CTAGTTCAACTCAGACCGTGAAGTTAGCATACAGGTCCTCAGGAGTTTGAGGCCA 

CTTTCC MGG ATMGGGCCACCTTCMGGGCACATCTTGCCCTTAMCMTTAAATTTNT 

GAAAGCTTTTGGGAAAAGGGGNNGAACATGCCTTTT 

Sequence 2676 

ACTTAGGGCGAA TTGGAGCTCCCCGCGGTGGCCGGCCGCCCGGGCAGGTAC I 1 1 1 1 1 1 1 I 

1 I I I I I I I I I ! I IGAGATATAAATTGGTATGTTCACATGTTGGTAAGAAAGATCTAATAG 

CTAGAAAAGGAACTGCTGAAGTTGCAGGAAAGAGAAGAGAAATNTAAGGAATGAAACCTG 

AAAAATAGAGGGCAGAGTCAGAGGGGTGGAGAAGGAAGGGGCCCTGACGGGAGGNGGACC 

ACTTTTTNCAAGGCACATGCAGGCAGGCACCTNTNTGGAGGGTCCCATGGCTGGCTATGT 

GTGGCAGTGCATGAAATAGTCACTGAATATGCTCCCTTGCTGTCCTCTGCTTGTGCTCAC 

CGAGGAGTGACTGTGTCTACTGGAMCAAGTTTGCCTTTTCACTGGTCTAAATAAANGGA 

TCAATGACCTCAAATCTAAAGAGCATTAGGAAACCTTNTNAAGCCANAACTGTACCCTTG 

GCCGNTNTAAAACTAAGGGGGATCCCCCCGGCTGCAG 

Sequence 2677 

GGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGGGCTGCGTGCGGCGGG 

AATCATGGCTGCTCGCAGAGCTCTGCACTTCGTATTCAAAGTGGGAAACCGCTTCCAGAC 

GGCGCGTTTCTATCGGGACGTCCTGiSGGATGAAGGTGGAGTCTTGCTCTGTCGCCAGGCT 

GGAGTGCAGTGGCGCGATCTCGGCTCACTGCTCTGACTACACCCGGATAACAGAAGACAG 

TTTCTC AAAG CCTTATGATGGGAAATGGAGTAAAACAATGGTGGGATTTGGGCCTGAGGA 

TGATCATTTTGTCGCAGAACTGACTTACAATTATGGCGTCGGAGACTACAAGCTTGGCAA 

TGACTTTATGGGAATCACGCTCGCTTCTAGCCAGGCTGTCANGCAAACGCCAGGAAGCTG 

GMGTGGCCACTGACCGGAMGTTGCANGAAAGGTGTTTTTTTAAACCCAGG 

Sequence 2678 

NCCCGGGCAGGNAC I I I I I I I M I I I 1 1 1 1 I I I I 1 1 I I II II I AATGGGGATTTGTCTAT 

TAAAATAGGTTGTNGGAAGNACCCCANAGGACCCTTNTTGGATCAGGAATCAATAATCCC 

ACCNAGACGGGGNCCCAGCATTTATTGGGNCGCAGCATAAAACTTGGCCACCTCCTTATT 

GGGGNTGCCCTGGGCTGAGTCAAACCACATTTGGATGCANCnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnCTGCAGGAATTTCCATTATTNAAAGCCTTATCGGATNCCCGTTCGAC 

CTTCNAGGGGGGGGGGCCC 

Sequence 2679 

TAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGAGGTACTAACANGGGTGTCTGGTCTT 

AGAAGCCTCCCTTCAGATCCCAGCTGACCCTGGTGACTGCCTGGCCTTGATGTTGGCTGC 

AGCCTTCTGATAGAACCACATGGATTCCACCCACAGCTGGCCAGGCTTGTTACATGGGTC 

AAGGGAATACAAATGGCCCCCCCCCAGGGAGCAGGTGTTGGCCTCAGTTTTCAGGGACCC 

TTGGTGTTGCTCCTTACCTAGAGCCCATTAATCTACCCCATNAACTCTCTTGCCATGAAA 

AGCCATCTTCCAGGAGCCCTGTTTT 

Sequence 2680 

AGGTACTCAAAGACGMTCATGAAAAAGAAAAAAACTTTATTTCAAACAGGTTCAGTGAT 
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•ATATGTGGGTGCTNCAGCAMGGCTGGTTGTGGCAAAGTTTCATTTCAAACTGTATGATG 
TGGGCTGGGCAAGGTGGCTTCACGCCTGTAATCCCAGCACTTTGTGCCCGCGTACTCAGC 
TGTGTTCATGTGGNGGTCTGTGGAAAGAAAAGAAGACTCGTTTGGAAATGAAGCTGTCCC 
TTTCCMGCAAGTCTTCTGGTGGCTTTTCTTCTCTCAAAAATGGGATCCCGATAAAATAT 
TTGAATAGGAGCNGAATTGGTAGAAATGTTCGTGCCTGTCACCCCAGAAAAGCCTTGCCT 
GGTTTT 
Sequence 2681 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTGGTTTAAT 

TTCACAGTGAGGGAGTCGGTTGGTGATCTCTAAGAAATCCCCAAGCACCTGGTGTTGGGA 

AA GTCCCT CAAATAAAGAAGTGTTCTTTTC I * I 1 1 I I 1 1 CCTCTCTCTCTTTCTATTTAT 

TTATTTTTTGAGATGGAGTCTTGCTCTGTCACCCAGACTGGAGTGCAGTGGCAGGATCTC 

AGCTCACTGCAACCTCCACCTCCCGGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAAT 

AGCTGGGATTACA GACAC CCACCACCACGCCCAGCTMTTTTTGTATTTTTAGTAGAGAC 

GGGGTTTCACTATGTTTTGTCAGGCTGGTCTCGAACTCCCAACCTCAGGTGATCCACCAC 

TTAACCT 

Sequence 2682 

CCTCAGATTTTGGGCCTAGGAAGGTAGGTGATTTAAACTCACTGAAAGCATGTACACCTT 
GCTGTTGCTGCTTGCTGCCACTGCTGCTGTTTCATCTGTTCCTGCTGCCGCTGGAATCGT 
GGAGGTAAAGACTTCTGAAACTGTTTGAAATAGCCAGATAATACAGC 
Sequence 2683 

G CGAATTGG AGCTCCCCGCGGTGGCGGCCGAGGTACTTTC I MUM I CTTTTTCTTTTT 

C M II M M I AATGTGAGACAGGATCTCATTCTGTTGCCTAGGCTGGAGTGCAGTGGCGC 

AATCTCGGCTCACTGCAACCTCTGCCTCCTGGGCTCAAGCAATTCTCCCACCTCAGCCTC 

CCAMTAGCTGGGATCACTGGCACAAACCACCATGCCCAGCTMTTTTGTATTTTTTGTA 

GAGACAGGGTTTCACCATGTTGCCCAGGCTGGTCTCAACCTCCTGGGCTCAAGCAATCCT 

CCTGCCTCGGCCTCCAAAGTGCTGGGATTACAGATGTGAGCCACCGCATTCAGCCCCACA 

CCCTTATTTATACCMTTACCTGCCCAGTAACTGNGGACTTTTGCTTCCTCACCCTGTTT 

TGATCGCTTTAAMCTAAGTGGGATCCCCCCGGGCTGCAGGAATTTCGATATTCAAGCTT 

ATTCGAATACCCGCCGCCTTCGANGGGGGGGG 

Sequence 2684 

GACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTTC 

TTTTCTTTTCTTTTC I I I I I I I I CI I I I I I I I GAGACGGAGTTTTGCTCTTGTTGCCCAG 

GCTGGAGTGCAATGACATGATCTCAGCTCACCACAACGTCCACCTCCCAGGTTCAAGTGA 

TTC TCCT GC CTCAG CCTTCCCAAGTAGCTGGGATTATAGGCATGTGCCACCAAGCCTGGC 

TMTTTTCTATTTTTAAGTAGAGATGGGGTTTTCTCCATGTGGGTCAGGCTGGTCTTNGA 

ACTCCTGACCTCAGGTGATCCACCCACCTCGGGNCTCCCAAAAGTGCTGGGATTACAAGG 

CCGTGGAGCCCACCGNACCCAGCCAAAGGCCAAAATNCTNAAGTCTTTTGN I I I I I II M 

TCAMTTGAGGGGTNGTTATTTAACAACAAGCTTGGATCATTTGAGNCCMTTNTTTTGG 

TTCCCCGTTTCCCCACCCGCCAATTCN I M I I II M I C 

Sequence 2685 

AGGTACMCATTTAGAGMCCTTAGCTGCCCAGAAAAACTCAGATTTTCTGCTTTACAAA 

AGMTAAAAATCATCGMTTTATTACCCTGGACTTTATTGGAAATCAGTGAAGAATATTC 

ATACCAMTACCAGGTTTCACGMCNTTNCCTCTCTCTCTTTCNTTCAAGGTAAGGGGGT 

TTGGNCAAAAGGNTTGTTTNTTTTTNGAAAANGCCGCCCNGNTTGGNNCNAGGTGGNNCC 

TGT TTAAG GGTTGGGCNNAC GGGG GGTGGTTMGGAAAAMTTCAAATTTAMNCNTTTT 

TATATTTTCCCGGGGGCTGNGTTTTAANCTTTAAAACCCCCTNGGTTTTNCCAAAGGGGN 

MTTCCCAAGGTTTTTTAAAAAAAGGCCNN I I I I I I I I I 1 1 GGGGACCCTNGCCCNGGNC 

GGGGCCCT TTTANAA AAAMGGGGGGGNTNCCCCCGGGGGGTGGGANGGAANTTTTGGAN 

NTTTMGGCTTTTTTGGGTANCCGGGGAACCCTTTNAGGGGGGGGGGG 

Sequence 2686 

CCGCGGTGGCGGCCGAGGTACTCCAACCCAAGCAACAGAGCAAGACCCTGTCTCAAAACA 



• 
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AMCAAAACAAMCAMCAAAAAAMTGAGGTAGGCATGTTTTATTCCCATCTTACAGAT 
GAGGAGACTGAGGCAATAATAATTCAATGGCTTATCTAAAGTCACAAAGCTAGTAAGGAG 
CAAAATCCAGTTNTGTCTGCTTCCAGCCCACCTTGTCCCACTTGCTTCTTTATT 
Sequence 2687 

TACTATAGGGCGM TTGGAGCTCACCGCGGTGGCGGCCGCCCGGGCMGGTACTTTTTTT 

I I I I II I I I I I I I 1 1 I GGGTATATAAACTAT7TATTAACAGACAAGGCCTACAGACTTAT 

TTCTTCTTGGACACACCCACGGNGCGGGCACGGGCGGNNAAGTNGGTTTTGGTGTGCNTN 

GCNTTNGNACACTAAAGGGCCCAAAMGTTGACGCANTCCCTTTTNTGGGCCCNAATTTT 

TTTMNT CNTTC NAAGGTTTTTNACGGGAGNTTGTTGTCCAGACCATTGGCTAGGACCTG 

GCTTGTATTTTCCATCNTTTANATCCTTNTTGTTCTGTTCAAGAACCANTCTGGGGATCT 

TGTACCTNGGNCCGCTNTAGAAACTAGTGnnnnnnnnnnnnnnnnnnnnnnnTGGATATT 

AAAGCTTATTCGANNACCCGTCCNNCCNTTCAANGGGGGGGG 

Sequence 2686 

AACACTACTTAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAGTG 
GAGAAAGAAGGGGGACCCCACCAGGGCTATGGAGAGACAGTAGAGGCAGGACTGAACCGT 
CAGCAAAGATTAAAAGGATGACCTGAGGCTGGCAACCACAAAACC 
Sequence 2689 

CACTACTTAGGGCGA7TGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACAGAGTCT 

TTTGCTTCCTCCCACCCCTAGGGGGAAAMCTGCTTTGTGCTTTGGGAAGTTGTCTCTGA 

AACCCGGGGACAGAGGACGCAGGACAGACTANGAGGGAGCCCGGGAGGATGGGCTGCANC 

TGTGGAGGAGGGTTTTAAAGGAGAGAGGGTCGGAGAGCAAAAGGCCTNAANAAGCCCAIVIA 

AGCAAGTTGAGAGAGGGTGGAAAAGTGAACCACCGGCTTGGGCTTGAACCCGACACGCTT 

TTCTTCCMTGGTTAAAATAGCCACCTTTTAGAAAAAAATTTNACAAAGGNTCCCATTCC 

NCNAAAAAAAAAAAAAAAAGGAAAAA 

Sequence 2690 

TATACGACTACTATAGGGCGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGACG 

GGAAGGATTCCTGCCAGGGTGATTCTGGGGGTCCGCTGGTATGTGGAGACCACCTCCGAG 

GCCTTGTGTCATGGGGTAACATCCCCTGTGGATCAAAGGAGAAAGCCAGGAGTCTACACC 

AACGTCT NGAG ATCACGAACTGGGATCCAAAAAACCATTCAGGCCAAGTGACCCTTNACA 

TGTGACATTTTACCTTCCGACCTACCACCCCACTTGACTGGTTTCAGMCGTTTTTNACC 

TAMCCTTGGCTTTCCCTTNTNTNNTGGCCAGNTTTTNACCCTGATGCGTAAAAAACGCA 

ACCGACGTGAGGGGTCCTGANTNTCCGTGGGNTTTACCCCAACTCCATNCTTNGATTAAT 

GGGGG 

Sequence 2691 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGGGCACGGTAC I I I I I I I I I 

Tl I II I I I I I I GGTAGGAGATGAGGTCTCATTATATTGCCCAGGCTGGTCTCAAATTCCT 

GGCCTCAAGTAATCCTCTCGCCTAGGCCCATAGTATTGGGATTACGGGCGTGAGCCACTG 

CGCCCAGGCTTTACTAGTTTTCCATCTATCTTTAGGCCTNCTCANATTTCTTGTGGGGCT 

CCACTTCCACCAGTCACACCTTTAA 

Sequence 2692 

TACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTGCACTCAAA 

AAMTTATTTCTCAATTTGTTTGCTCATACTATTG A I 1 1 I I I I I CTAGAATCTAAATAAT 

TGTGAATTAACTATGATAGCAGTATGCACAAACAGTAAGTGAAATCAGACCTATTAATTC 

TGAGAGGAAGGAGGTGTCAGGATTTTCACAGAAGAAGAGCACTNAGGCCAGGCGCAGTGG 

C TTAT GCCTGTAATNCCAGCACTTTGGGAGGCTGAGGCAGGCNGATNAAGGNGGNCAGGN 

GTTTTANACCAGCCANCCMACNTGGGGAAACCCCGNTTTTTANTAAMNTCCAAMTTT 

CCCCG CCTGTANTCCC AGTTACTN AG N GGGG NTTGGG NCAAAAAA I I I 1 1 1 I IACNCCCG 

GNGGGNGNGGGTTTTNCAGAGNNTATAMTCGCGCGNTTNGCTTTNAAATTTTNGGGNAA 

AA AAG A AAC G AGTTC N CTTTTG AAAAA A AAAAAAAAAAAAA AAN G G 

Sequence 2693 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACATAAGCCTAA 
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ACAATTTCACCTANGTAAAATATTGATGTCATAACCAAACTATATGGCCCCGTTTCATAA 

AGGTTACTATATTCTATAGAGAGTGAAGAGGTGGCCTTTCTATCCCAGCTTACCCTATTC 

TTGTTATTGTTCAAATTCTCCTGAAGCTTGCATAACTAGCTGCCATCAGGTAAATGCTAT 

TGGCTAGCAGAAGACTGCAGTTCTGTTAATATTAGAACCAGCAGGGGGAACTTGGGAACT 

TGACATTAAAAATCTAGAAACGGAATTTTA 

Sequence 2694 

CCGCGGTGGCGGCCGCCCGGGG AGGTAC 1 1 I I I I I I I I I I 1 1 I I I 1 1 I GGGCGGGGGTCT 
TTATTTGAGTTTAGGCATGATTCGAATGAAGAGGATCATGCTAATGAAGATGAAGCAGAC 
GATAATGAGCGTGGCCCAGAGCAGCCAGTTGACTGACT 
Sequence 2695 

GAGCTCCCCGCGGTGGCGGCCGAGGTACACCTGTGGTCCCAGTTACTCCAGAGGCTGAAG 

TGAGAGAGTCTCGTGAGCCCAGAAAGTTGAGGCTGCAGTAAGCTGAGCCATGATTGCACC 

ACTGCACTGTAGCCTGTCTACAAACAAATAAACGAAAAAACAAAAAGACTTGTGAAAAGT 

GCTGATTTTTAATTAGGAAAAGATTAAACATTGGATAGTCATGGAATTGTTTACTGAACA 

TTAGAAATTGGTTGCAAGGGTCTATGCTTCTGTAAAATAAA 

Sequence 2696 

ACTACTTAGGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACATATATGAATCACTA 

AGAGTTCAGAAATTAAGCTCACTTTAAGAAAACTCAGCCAGGCACAGTGGCTCACGCCTG 

TAATCCCAGCACTTTGGGAGGCCAAGGCGGGTGGATCACGACGTCAGGAGTCCAAGACCA 

GCTTGACATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCGAGCGTGGTGAC 

ACACGCCTGTAATCCCAGCTACTCAGGAGGCAAGGCAGGGGAATTGCTTGAACCAGGGAG 

GCGGAGGTTTGCAGTGAGCCCAGATCGCGCCATTGCACTCCAGCCTGGACAAGAGAGCGA 

GGACTCTGACTTCCAAAAAAAAAAAGGAAGAAAAAAAAAAGTTCCCTTGCCCGGGCCGGN 

CGNTTCTAAAAACTAGGNGGGNTNCCCCCGGGGCTTNNANGGAATTTNGATTATCAAAGC 

TTTNTCGANNNCCCGTCGACCCTCGAGGGGGGGGC 

Sequence 2697 

CTACTTAGGGCGAA TTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC I 1 1 I I 1 1 1 

1 I I I I 1 I I I I I I I I I IGGTANGGGACCGGGGTTTTACCATGTTGGCCAGGATGATCTCGA 

TCTCCTGACCTCATGAN CTGC CTGCCTCGGCCTTCCAAAGTGCTGGGATTACAGGCATGA 

GCCACTGNGCCAGGCCTTTTGTCCATTTTTTATTGMCTGCCTATTNCTTCTTACTGATT 

TGTAGAAAGTCTTTATTCTAGTCTGGCAGGTATTTTIsrTTTCMCACTTCCAAGATT 

ATTGGCTTTGGNAGCTMTGCTTCATTTGANAAAATGCTACCTGTATTATTGATGCCCAT 

GTGA 

Sequence 2698 

CACTA CTTAGGGCGATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACAATTTCT 
TTTTTCI I I I I I I I I I I I I I I 1 1 I M I CTTTTTCTTTGAGACTGGGTCTCGCTTTGTTGC 
CCAGGCTGGAGTGGAGTGGCGTGATCTTGGCTTACTGCAGCCTTTGCCTCCCCGGCTCGA 
GCA GTCCT GCCTCAGCCTCCGGAGTAGCTGGGACCACAGGTTCATGCCACCATGGCCAGC 

CNACTTTTGCATGTTTTGTANAGATGGGGTNNNACAGNGTTGCCCAGGCTGGNCTTAAAC / 

TCCTGGGCTCAGGCGATCCACCTNTTTTANCCTCCCAAAGTGNTGGGATACAATTGNGAG 

CCAC CACGTCCAGCTGGAAGGGTCAANAATTTTTTACATTTTTNGNAGCACAATNTGGAT 

TTTT CACCCANCC TTTCCCCTTCTTTTTCCC^ 

NCNNTTTTNTrrTANAAMAAAAAANCNTTTTTTNNCCNAAAAAAAA 

Sequence 2699 

CTACTATAGGGCGM TTGGAGCTCNCCGCGGTGGCCGGCCGCCCGGGCAGGTACTTTTTT 
n I I I I I I I I I II I I IGI I I I I I I I 11 I I I I I I 1 1 1 I CCGACCAATATGGTTTATTTNTG 
CCCCAGCCAAGCTTNTTTGGACCCTGGCTGGGGGGAAAGGCACCCCAGGCACCGGCAAGT 
TCCAGTCATTGCAGATCCTCCAGGTNTAGGNGTGACTGGAAGTANCCTGGGCACTGNTGC 
TGGACCGTNGGATTCTCCTTCCTTNTTCCGCCGGCGGGTGGTCACCAGGACACCGCANAT 

CAGGCATGTGATGAGTCCCAGGAGTCCTGCCAAGCCGATGAGGATGACAGCCCAAAAGGG 
AAGGT 
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Sequence 2700 

A CTTAG G GCGAT TGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTCAGTGGATGCAAATGA 

AATTTTATCTTTTGAAAGTAAGAGGAACTATGGCTGGGCGGGGTGGCTCACACCTGTAAT 

CTCAGCACTTTGGGAGGCCAAGGTNGGGCGGATCACCTGAGGTCAGGAGTTCAAGACCAG 

CCTGGCCAACACAGTGAAACCCTGCCTCTACTAAAAATACAAAAATTAGCCAGGCGTGGT 

GGCACGCTCCTGTAATCCCAGCTACTTGGGATGCTGAAGCAGGAGAATCTTTTGAACCTG 

GGAGGCAGAGGTTGCAGGGAGTTTGAGATTGTCGAGTATAGTGGATTGAGTGCTGTGGAG 

CCGAGATTGCCAAGTGCAGTGGATCGAGTCCACTGCACTCCAGTCTGGGCAATAGAGCTA 

AACTCAGTCTNAAAAAAAAAAAAAAAAAAAGTACCTGCCCGGGCGGCCGCTCTAGAACTA 

GGTGGATCCC 

Sequence 2701 

CACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCTTTGCTCACCAA 

GCTCTTGAGGCCMGCTTGATACGGCTGTTGTTTTTTTCTACATCGTAGCCAGCAGCCGC 

MGCGCTTTCTTAAGCGCGGCCAGAGAAAACGCCGCTGCGCTCCTTAGAAGCTGCCACTG 

CCTTGGTGATMG CTCAGA TACTGGGGGTCCGGATGCTTTGCGTTTCCCAGCAGTTGCGC 

CTGCCTTCTTCGCCTTTTTCTTCACMGGTGTTTTTTCTGCGGGTGCAGGAATGGTAGGA 

GCMGTGGAGCAGTCTCCGACATGTTTTTGTCTTCCCAGAAAAGACAATAAGTAATCTCA 

AACTGTCAGAACAGCATGTCCCCCGCGTACCTGCCCGGGCGGCCGCTCTAGAACTAGGTG 

GATCCCCC 

Sequence 2702 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGAGGAAGGGGAGG 

GCTTGGTGTTCCCTACGCGAAGGTCTGACTCAATGCTTATTCCCTCTTCTTCCTCTCTCT 

GCAGACTGAAGAATTGAACCGGGAGGTCGCTGGCCACACGGAGCAGCTCCAGATGAGCAG 

GTCCGAGGTTACTGACCTGCGGCGCACCCTTCAGGGTCTTGAGATTGAGCTGCGGTCACA 

GCTGAGCATGAAAGCTGCCTTGGAAGACACACTGGCAGAAACGGAGGCGCGCTTTGGAGC 

CCAGCTGGCGCATATCCAGGCGCTGATCAGCGGTATTGAAGCCCAGCTGGGCCGATGTGC 

GAGCTGATAGTGAGCGGCAGAATCAGGAGTACCTGCCCGGCCGCTCTAGAACTAG 

Sequence 2703 

CCGCGGTGGCGGCCGAGGTACGCGGGTGTGAGGAGGTGGGGAGACCACCCCACCCCCATG 

TCCACCATGACCCTCTTCCCACGCTGACCTGTGCTCCCTCCCCAANCATNTTTCCTGTTC 

CANANAGNTGGAGCTGAGGTGTCTCCATCTATGNCTCAACTTCATGGTGCACTGAGCTGT 

AACTTCTTCCTTCCCTATTAAAATTAGMCCTGAGTATAMTTTACTTTCTCAAATTCTT 

GCCATGAGAGGTTGATGAGTTAATTAAAGGAGAAGATTCCTAAAATTTGAGAGACAAAAT 

AA ATG G AAC AC ATCA A AAA AAAATT AA AAAAAAAGT A 

Sequence 2704 

TNAGGG CGAA TTGGAGCTCCCCGCG GTGGCGGCCGAGGTAC I I I I I I I I I I I I I I I I I I I 
MINI! CTTTCTT I I > I I i I 1 1 i I I I I I I I 1 1 I I I 1 1 I I I I I I i I N G CATC AAA AAG CT 
TTATTTCCATTTGGNCCAAGGCTTGTTAGGGATAGTTAMMAGCTNCCTNTTGGCTGGN 
GGGAGAGGCTTAGGCANAANCCCTNTTACTTTGNANGGGGCCCTT 
Sequence 2705 

CCGCGGTGGCGGCCGAGGTAC I I I I I I 1 1 I I 1 1 M I I I I 1 1 1 1 GTTTGAGACAGAGTCTC 

ACTCTGTCACTGAGGCTGGANTGCAGNGGCGCGATCTGGGCTCACTGCAACTTCCACCCC 

TTCGNTNTAAGTGATTCTNCTGCCTCANCCTCCNAANTAGCTTGGATCACAGGCGCCCGC 

CACCACACCCGGCTMTTTTTGTATTTTTAGTAGAGACATGGNTTCACCAMTTTTTAAA 

GAAAAATAAAGGTGCATGATCAACAATCAAACTTNTAGGACCGTCCCCTANCAGGAGAGC 

AGCAGCAGNAGCAGCACACANACCTGCCC 

Sequence 2706 

CACTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTGAGCTNACCACTT 
CTMGAAACTCCAANAAAGGAANCATGTGTNTTNTATTCTGACTTMCTTNATTTGTCAT 
AAGGTTTGGATATTAATTTCAAGGGGAGTTGAAATANTGNNAGATGGAGAANAGTGAATG 
AGNTTCTACCACTCTNTACTAATCTCACTATTTGMATTGAGCCCAAAATAACTATGAAAG 



# 
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GAGACCGAAMTTTGNGACAAAGGATTGNGAAGAGCTNTCCATNTTCATGATGTT 
Sequence 2707 

NCACTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGGGGAGTGGGGG 

TGAAGCGTGTTCTCTACATAGGCAACACAGCCGCCTAAGTCACAAAGTCAGTGGTCGGCC 

GAGGTACTTTGGTAAGGAGATAGGGAAGGAATTAAGGCTATTACTCTGAAGAAAGTTGGG 

GGGCCAGGGCTCCTATT7TTTTGCTGAGGAGATGGAAGATCAGGGCTTGTATTCAATAAG 

AATG GGAGGGGCCAGGGGATGCCTGGCAAAAGCCTTGCACTGTGAGGTGCAGGTAGAGGC 

TTTTATTCTGGTGAGAGGACATGGACTCTCTCTCTCCCCTCAGGTAACTGTGCCCTGTAC 

CTGCCCG 

Sequence 2708 

GM TTGGAG CTCC CCGCGGT GGCGGCCGAGGTCTNMTTNTTTTNCCCTTNNNNGAGACG 
ACCTTTTTTGGANATTTTT^ 

TAATTTGGNTAAAAGCTGATTATGTCGTACAATGNCAAAGTCAGCTAACTGNCGTCTACT 

TAAGACTTNTGGNCATTTCCAACTTATAGAGGAAGGGNGNCTCTAAAATCTCTTCTTCAG 

AAGGCACCTCACTTNTCANACTTAAAANNCCACATCAAGTGTTCCATTAAAAGAAGATAN 

GGCATTCTGAGTGCAAACAAATGGGGGCTTNTTAAACTA 

Sequence 2709 

AGGTACATGTCTCTTCTCCTCTGACTCTGGGAATGMGATTTTTCTACTCCCACAGGCTT 

GAACTCTCCTTATAGGGAGTGGTCTTNCACATGGCCCAAAAATCAAGAAGGAAGGTCAAG 

AATAAAAACCTCCCAAGGCTTGAAAACTAAGAGCTGGCACGTAAGTACCTGCCCGGGCGG 

CCCGCTTCTAGAACTAGTGGGATCCCCCGGGCTGCAGnnnnnnnnnnnnnnnnnhnnnnn 

nnnnnCGTCGACCTCGAGGGnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGTGAG 

GGGTTAA7TGGCGCCGCTTTGGCGTAATTCATGGGTCATTAGCTGTTTTTCCTGTGTGGA 

AATTGTTATTCCGCTTCACAATTCCACCACCAACATACCGAGCCCGGGGAGCATTAAAGT 

GTAAAAGCCTGGG 

Sequence 2710 

CCGCGGTGGCGGCCGAGGTACACGAGAANAGGAGACCCCAACACGCTTCAAGCTTTGAGT 

GGAG AGGACACAGCCTCTGCTGNGACAGGGAACANAGGGATGCGGAGACCCTGAAGATGC 

TTTTGGACAGTGGTCTGAGGTTGGGACAGTGGCAGGAGATACCATTCACCCAGGATCTCC 

AGGACAAGAGATCAGCCTGGCAGTTACATGTG I I 1 1 I I I I CAAACTGGTTGCCAGGTTGG 

CATGAGCGATGACATCAGGAGATTCCGACCTTTTCTTATTGGANGGGACCGGACTNTGTN 

GGAC CTGGGGAGTTNAGTTGGACAAANAAGAACCTTTTAAMGGGGNTTTCNNTTTTGI^ 

TTTTCCCCCCCNCNAAATTNTAAMNACCTTTTTTNTTTNCCCCMGGGGNCCAAAAAM 

CCCNGCCCCCCCCNNNNTNTTTTTTGNGGGGGANNGGGNNANTTTTTGGANAAAMTTTT 

TNTCCCCCCCCCCCCCCCC 

Sequence 271 1 

CCGGG CAGGTACCTTAT TACAT ATGATTTTTATTAGTTTCTGGAGGCAAANGGAATTT^ 

ATTTTMAATCAAMTCTATTTTAAAAGAAATAGTTNTCAAAAAGACAACNGATGACTGG 

GTGNGGNGGTGNGTGCCTGTAGNTCAGGCTGCTCGGGAGACAGAGGCAGGAGAACCACTT 

GAGGCCAGTTCAGTCTAGCCTGGGTAACATAGCAGGACCCTGTCCCTAAAATAATAAAAA 

M TTTA AAAACCACAATAAATGTGAGTTACAAAAAAAGTGTAACTTATGAAAAGGTCCGT 

MTTTTATTATTGGACCCTTTGTTAAGGAGCTCMNTMTTTTCAGGGAAGCAAGGGAGG 

TATCACCCATTTCTTGAGTTNAATACCAMTTTTCAMCCTAMTCTTTTAATTT^ 

Sequence 2712 

CCGGGCAGGTACCCCCTGACACTCCAGAGCTACTGAGAAATACTCCTTTGTCTTTCTGCA 
CTTGTAGCCAGGCTGAGCTTTGTGMAGMGATTGTGTATGTGTGTAC7TTCTTTTTTT 
Sequence 2713 

CCGGGCAGGTGCGCGGGTGGGTGACGCTGGTGAANTGGCCCANGGAAGTGCACATGTCTC 
TNCCTGCTCTTCCAGGGTGATTTTTTTGATGTTTGGCTCTTGGTCCTTGTTTCCACTGGC 
TTTCCATCCCCATGGGGCAGAAACAGTGGCTCCTGGGAGCAGAAAAGGAATTGAGGTGGG 
CAGGCAGMGAGCCTGGATTGCTCACTGTTTTGGGAAACTTACTTNNAGANNTANANAAG 
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ATCCNGGAAAACCCAAAANCCTTCTTGACCAAACCATTTTTTGGGG I I I I I 1 1 I I GNGGG 
CCNAAAAAI I I I I I I I I I I I AAAAAAAAAAAGNNNGTTGNGGANAAAAACCCCGGGGGGG 
GGGGGGGGTTTC I I I I II I 1 1 1 NGTNGGGGGGGGCCCNNAAAAAAANGGGGTTNAAAACN 
CCCTTTTTTNCCCCCCNNNNNNNNNNN I 1 1 I 1 1 1 I I 1 1 NAAAAAAAAAAAAAATTTTTCC 
CCNNCAAAAANNANNNNNTTTTTTNTTNNNCNNN 1 1 I II I I I 1 1 I I I NGGGGGGGGGGGN 
NNAAAAAAAAAAATN 
Sequence 2714 

CCNGGCAGGTACNCGGGGGCGCGTGAGCAGCTGCAGCGGCAGAGGCAGCATCCAGCGGCG 

GCGCCAGCAGTTCCAGTCCGTTGCTTTAC I I I I I I GCTTCACCCGACANTTAGTCNTTTA 

ATGCCCCGAAAGNAGGAAAAGGTCCTTCCCAGGAGGAAATAACNAGTAGGGGGNCAAAAA 

NGAATNGGGATTCCCCAAAAAAGGGTAAAACCTTAAAAAANCAANNNGAANGCNCCCACC 

AAAGGGAACGGGTTCTTGCCCCMGGAATTTGTTCAGNCGAAAAACCCTTNNCTTTTCCN 

ACCC C AAAAA AAACCCCTTGTNNANCCCCCCAANAAACCCCAAAAGGGAAAAAANAACAT 

TTTTGCTTTTAGGAAAAGGGMCCCTNNTGTTNGGNNAACCCCAAAAATTTTATTTCN^ 

GNNGGGGTTTGGCCCTTTAAAAAAGGGGGGAAAAAGNAAAAAGGGGAGGGGNNAAAAAAA 

AAGNNCCAAGGGGNAAAAAGGCCTTTTNGNAAAAAANGG 

Sequence 2715 

CAGTAAATGAACTAATCTACAAGCGTGGTTATGGCAAAATCAATAAGAAGCGAATTGCTT 

TGACAGATAACGCTTTGATTGCTCGATCTCTTGGTAAAATACCGGCAATCCCTCCNGCAT 

TGGGAGGGMTTTTGGATTTTCCAATGGAGGAATTCCTAANTAACCTGGTTTTGGGNAAA 

AAMACCGNCCTTTTTCAAAMAGGNANGGGNCCAAAAAATTAAMCCTTTTTTCCCCTT 

NGGTNGGGGGCCCCCCCTTTTTCCAMAAMTTTTTGGGTTCCTm 

CGNAAAGGGGGTTNGGGGGAAAAATTGGGAAAAGGGAAAAAAAAAAAAAGGGAACCCCCA 

NACCCCCCCCAAAI I I I I I I I I I I I GGGTTTAAAAGG AAAAAAAGGGGGGTTTGGGGGNG 

AAAGGAAANTTGNCCCTTGGGGCCAAACCAAGGGGGGGGAGGGGGGAACCCCCAANGNGA 

ATTCCAAAAACCCMAGGGGGGCCCTTTTTAAATTTTTAANGGAAMAAGGGAAAAATTN 

NGGAAAAACCTTTAAAAAGGGGGNGGNGGTTCCTTTAACCCCCCATTGGGAAAANTTAAA 

I I I I I I I I I I 

Sequence 2716 

AC I I I I I I I I I l t I I I I I I II I GTCAG AAAAACTCCTTTATTACCATCTCCCTATTACATT 

TCTATTCTAGGGTTAGTGTTMTCTCAGGGTCTTATTTCTTTAAGCACNGCNAAAAAGGG 

CCTTTGGTAGGAAAGTTAGGTGGTAMGGTTTTTCMTCC 

ANGGGGAATTGGAAAAAATTCCANAAAGGNCCATAGGGGNAGGCCNTNGTTGGGACCTTA 

GGGGANCAAATTAACCAGGTTTAGGGTCCTTAGGGGGGCAATAAMCTTTTCCCCAAAAC 

CCAACCATTTTTMGGGGGGGAAAAMAGGCCCCCCAAGTTTTTGGGGAAAACCCACCNA 

GGAGGCCTAATTAGGTTTTGGGGCCTTNCAAMCCCCCCTTrTAATTTTCCAAATTGGAA 

NCCAAATTTCCGGGTNGGTTTTTTTCNMCCAAATTCCMNTTTTTCCAAAAAAAAAGGT 

III I II I II I IGGGTNGGGGGNNAMAAGGTTTCCTTTCCTTTGGTTAAAGGNTTGGGAA 

NCCCCCAANCCCCC1TGGGMTTNMCCCTTNGGCCCCCTTGGGGGGGGGGMGGGTTTT 

TTAAI I I I I I I I ICCC 

Sequence 2717 

AGGTACI I I I I I I I I I I ICI I I I I 1 1 I I I I I I GGTGGGGGTGTATAGGTCTGGGGAGNGC 

CTTTCAGGTGCTGCTCCATAGACATGTGTGTGCCCTGTATAGAAACCGCCCCCCATTGTN 

TAGAAAAAACNNCCAGACNCTTNTCCCAAANAGNCNNGNTTCTTNACCCATGTAAANNGG 

AAGGNTGNGCGTAATTAGGAAAACCAAGGGCTCAACCANANAGGGCCAAACCAATTTAAT 

TAAAAATTCCGGTTCTTGMGACCCTTTGNCCAAMTGNATTACCTTGTCNCCCMTTTT 

ANAGGGGMGGCCCCTATNTANNTTTNTNTCTCTTTGTCTGGTGGGCCCCCCCCAACTTT 

CAAGGTTTGCCTTNGGNCTNCTTACCTTAAAANTNAGNNAAACCAANGGCGGNCCAATNA 

AAMANGGCCAAAAGGGGGNCTTTGGTTNAAAAAMTTAMAAAAAAAGGTTTTTANTTT 

TTAANCCAATTAAAAATTA 

Sequence 2718 
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TCGCGGTGGCGGCCGAGGTACANAGTCTTTTGCTTCCTCCCACCCCTAGGGGGAAAAACT 

GCTTTGTGCTTTGGGMGTTGTCTCTGAAACCCGGTNGNACAGATTGTACCGTCATTGTA 

CCAGGACCTTACNGCANCGGNNAANGCNCCGGGNGCAGGGCATTNGGGGGNCCTTGGCCA 

AGTCATTGNTTNGNGAMTGNGGMGGNGGGGATCTTTTACCAAGGAAAGGGGTATGGAA 

NGTTANGNAGGNTTCCCNGGTGTAAAGAAAAGGTCCCAATGTNATNAGGACCCCCCTTGG 

GMGTTAAAANGTCCCCCCATGGTAAGCGGTCCCAAGAGGGTTNTTGTANTNTAANGGGA 

AGNGGNGATNTGNGGGNAAAAAAAGGGATTGNNAAGANCCCAAATTTACCGNNGGANATC 

CCTTTGGGGNGGNGCCCCTTTGGNGTGGAAAGAGCCCNCCCGATTNTAAANCCAAACCCG 

TGCCTTA CTTTCCCISTTTCNCCNN 

TTTTTTTMNGGGNAAAAAAAAAAAAMTTTT^ 

CCCCAAANTTNCCCAAACCAAAAAAAAAAAAAAAAAAAAATG 

Sequence 2719 

GCCGAGGTACAAACGGGTTTCCCACCTTGCCAGGGATCCTGGGGACAACAGATGTAAAAC 

TCCTGAGTCTCTGTGTGTGTGCCTGAGTGGCCAGTCTGCCAGAACTCCACACAGCTCTGT 

GTATTGAACCCAAGGCCTTGGTGGCCTGGGCTCATGAGCAGGTCTCCTGATCCATGGATT 

GCAGAGATCCATGGGAGMGCATGATTTCCCAGGCAGGGTCGCACATTCACTCATTGCTT 

CCCTTGGCTGGGGATTGGGGTTCCTTTGGCTCTGTGCCACTCCTGGGTGGGCCATTGCCT 

CACCCTTCTTTCCTCAGTTCTTCGTGGGTTGTTTGCCTAGTCAGTCACAATGTGAGAACC 

TGGCGATTTCAGTTAA 

Sequence 2720 

AGGTACACTCGCCAGCGGTTTTGCCACAGGAGTGTACGGGAACAAAGGAGACAGGCTCAT 

TTATAATCTGACGCGGNCACCCTNCTGCTGCGTTCGGTTTCCATTGGCTGGGACNGNACC 

TCACCTTCTGTATTTGTCCCGACTGGCTAGCACTTAGAACTTTTTAAAAGAGGCAAAGGC 

ATACAGAGANCAAAGGAAGGAGGAAGTNACTTGTGGAATATTGAGAAAGGTAAAAACACC 

TTTAAATAAGGAAGAGGAACAGGCTATGACCTAATGCTTGTTNGGATCAGTATAAGCATG 

TTAGGGCAMTATTTANGCTAMTTGTGGGAGCTMGMCATAAAGTATATTGATTTTTT 

ATTATGGCTAGCA 

Sequence 2721 

AGGTACMTTTAATTTTTCTGCTTGCCCNGGAAACAAAGCTTCTGTGGAACCATGGAAGA 

AGATGAAAATGAGAC TGGC AAAGAACAAATGCTGAATCTGAAGAAGAGGACAACTTTGGG 

CAAAT AATCTGCATACTTTT AATTGGG AATAAGATGGAAAATATGAATGCTAAATCAAAT 

TTTTTAANNNATACACCACACGATACGACTCCCCGCGTACATCTTTGCTGTGGCTCACAG 

ATTGTTCCTCCCATTCCCCTTGCCGCTTTTTGCCTATCGATGGGTAGCAAGAGTCTTTGA 

AATAAGCCCATTTGAGCCCTGGATAACAAGGGATAAAGTGGAGCGGATGCACATCACAGA 

CATGAAATTGCCTCACC 

Sequence 2722 

CCGGGCAGGTAC I II I I I 1 1 I I I I I I I IT Cl I I I 1 1 M I I I i I U I I I I I M I 1 1 I I i 1 1 

I M I I n I I M I M M I I I I M U I I I j M M I I I (I I INTTTNl I I I I I I I I I I I IGGG 

GGGNAAAAMNNTTTTNNNAANANNNGANAAAAAANNGGGGGAACCNNNNGGGGGNNCNC 

NCCCCCNN AAAAAAA NGNNCCCCCCCCNNAANNNAAAMANNANTTTTCAAAAANNAAAA 

AANG GGAANTTTTTTANCNAAAAAAANNTTTNTNTAAANNNNNNTNTNNGGGGGNGGNGN 

TTTTNAAAAAAAAAAAAAAAAAAAAAA 

Sequence 2723 

GAGAANGACACCATGTGCCTCAGAAACTGCTCGGTCAGACGGTGATAGCGAGCCACGCAT 

TCACAGGGCCACTGCTGCTCACAGAAGCAGTGAGGATGATGCCAGGATGATGTCTGCCTC 

GCGCCTGGCTGGGACTCTGATCCCAGCCATGGCCTTCCTCTCCTGCGTGAGACCAGAAAG 

CTGGGAGCCCTGCGTGGAGGTATGTGGCTGGAGTCAGCTCCTCTGAACTTTCCCTCACTT 

CTGCCCAGMCTTCTCACTGTGTGCCCTGGTTTGTTTATTTTTGCAAAAAAAAAAAAAAA 

AAAAGTACCTGCCCG 

Sequence 2724 

CTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTCCAATGA 
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ATGGTMCTGATCCAGGCACGTTATCACACTTCCTAGTCATCTCCACCTTTCCTGTATTG 

CCTGTGGCTTGTTGTTTAAGATTAAGAATCAAAGAGATTAAGAAGTATCACTTCAAGTCT 

TGCTCTGCTCACTTCTATG7TTGCAGTCAAATTATTCCTTATGTTGGTGACCTAAAGAGA 

ATTACTTTCATTCATTTCATTTCCCCCGTAGCAGATGGAAGTGAGAAACCTCTGAGAAAA 

TGAAMCATCCTTMCCACTATCTTTCCCTTTrAmGATTATTTTATG 

CAMAGTTTTTTTCTCCTCCTTCTCTTCCTTGTTGCTTMCTTTTTM 

GCAGATATCCAATTATGTGCATCCTGTGAATAAACCACGTCTTGGTCACTGTCATATTTT 

GAACCATCTCATCAGAGATGAATAATA 

Sequence 2725 

CCGCGGTGGCGGCCGAGGTACGTATTACTGTTTCCCATTATATGTTACAAATGAAATGAA 
CAC AT TCTCATAAGTT AAAAAMTATAGAATATATATTTTTCTTTTTCCTTTTC 1 11)11 
TTTTTCTGAGACAGAGTCTCGTTCTGTCACCCAGGCTGGAGGCTAAGGTGGGAGGATCAC 
TTGAGGCCAGGAGTTTGAGATCAGCCTGAGCAACATAGTGAGACCCCCATCTCTAAAAAA 
AAAATAAGAAATAAAAATCAGCGACGAGGCATAGCGGCTCATGCCTGTAATCCCAGCACT 
TTGGGAGGCCAAGGCAGGCAGGTCGCTTGAGGTCAGGAGTTTAAGACCAGTCTGGCCAAA 
TGGGGGAAACCCCTTTTTTCTTTAAAAAANTNCC I I I 1 1 1 1 1 CCCCCCCATNGGGGGAAN 
NCCCCNTTTTTCCCCNCCCNCCNNNGGGGGGGGGGGGGGGGGAACCCCCCCCCCCCC 
Sequence 2726 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC I II I I I 1 I I II I I I 1 I I I I I I I I CCAAAACAA 
AACATGCTTAGCATGCACACTTTTACCAC I I I II I CGAGNGGAAAGTTTATTGGCAATAT 
TAMTTTCACCCTAGATAGGATATGAGMTGTTTTGATAMTCACAATTTATAGTATATT 
AATGCCATGTGAGAATTTTGTTTCCCAAGTAAGAGCTCACATGGAACTTGGTCATTAAAC 
CTTAAAGAAACCTTTCTCACATATCTATAGGCCTCAAATTGAAATAATCTATAAATGAAT 
TTGTAG ATTTC I I I I I AGTTTAATTCCTGAGTATACAGGGCAAAAGCTTATNTCCTTTAT 
ATAAACTTCTGCTTTGGTCTAAAACTGATATATCTTCACGTTGAGGTTTCATCTGAAATG 
CACCACGTTTGCTGACTTGCTTCAATATGAATTTGTATGGCTATAAAATTGNGC 
Sequence 2727 

CCGCGGTGGCGGCCGAGGTAC I I 1 1 II 1 1 1 1 I I 1 1 I II I I I 1 1 II I CGGGACTTTAGATA 

GCTGGATTATTCTTCTTTTCTCTTTTTGTCTTTCATGTAGTTATTAATGAGATGGATA 

AAACCAACTACTAGGTNCATATCCCCAAAAAATGAAGTCAATATGTTGAAGAGATATCTG 

AACTCTCATGAtTATTGCAAGACTATTCACAATAGCCAAGATNGGGAATCAACCTAAGTA 

TCCATCAACAGATGAGGNGATAAAGATAATGTAGCATATNTATACAACAGAATNTATTCA 

GCCTTAACANCAACAAAATAAATNTTGTCATTTAAGACAACACAGATAAACCT 

Sequence 2728 

GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACACAACTCCACTAACCTGGGAACCTT 

GCCCCCATCCCCCATAGCANCCACAGCGAGACCTGCCCAAGGAGAGTCTGAGCTCAGACA 

TGCCTAGCCCTGCCCCAACTTGATGGGCCTTCCTATCTACCCTGGTAGCTGAAGGCAAAG 

GACATATACCCTTGGGAGTTCTAGGGCCCCGCCCATCGCCAGTTCCTCTCCATACTACCA 

CAGCTGATGCTCTCTGGGAAGTGCCACCTCCCAGCAGCAGGCCAATCAGCACAAAAATAG 

AACATTAAACCACCAAAGCTAANAACCCTCAGAGAATCCATTTACCCCCCT 

Sequence 2729 

CCGCGGTGGCGGCCGAGGTA C I I I M I I II II I I I I I I i CCCTGTATTTTTCTTTTTCCC 

GAGATGAAGTCATGCTCTGTTGCTCAGGCTGGAGTGCAGTAGTGTGATCTCGGCTCACTG 

CAACCTCCGCTTCCCAGGTTCAAGCGATTATCCTGCCTCAGCCTCCCAAGTAGCTGTGAT 

TACAGGCATGCATCACCATGTCTGGCTAGTTTTTGTATTTTTAGTAGAGACAGAGCTTCA 

CCATGTTTGCCAGGCTGGTCTCAAACCCCTGACTTCAGGTGATCCACCTGCCTCAGCCTC 

CTAGAGTGCTGGGATTACAGGMTGAGCGACCACACCTGGTTTGTTTCTTAAAAAACATC 

TTATATTTCTCTGCTTAACGNGCTCAATGTTGAACATGTGAMTATAATAACTTTTCATA 

ATCTCTTCTGATTCTGGCTCTATATCATCTTTGGGGNCTGGTTTTAANGGMTTCTAATT 

TTCTTGGCAAT 

Sequence 2730 
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CTATAGGGCGMTTGGAGCTCCCCGNGGTGGCGGCCGAGGTACACGAGTCTTTTGCTTNC 

TCCCACCCCTAGGGGGAAAMCTGCTTTGTGCTTTGGGAAGTTGTCTGTGAAACCCGGGG 

ACAGAGGACNCANGACAGACTAGGANNGGAGCCGNGAGGATGGGCTGCANCTGTNGAGGA 

GGGTTTCANAGGANAGNGGTCNGATAGCACCAGGCCTGAGAAGCCATAGCCTAGGTGGAN 

AGAGGTTTGAAAGNGACANAGCGGGCTGATTANCTGCCGTANACNCTTTNATNCCATGTT 

ANATAMCATNNTTNAAAAACCCTTCNTNTTGTTCCCAATNNTANAAAAATTAANCCCGG 

GCNATGGGGGGGCNTGCCCCCTNGTNATTCNCTNTTTTAATNAANAGAAAGGGNTTGNCC 

CCGGGTNANTTTTANTTTTNTNACTTTNNGG 

ANMTTGGGGNGNTNTTCAMNNNATTTNNTTGTTNNMTNTTTTTATATTN 1 1 I 1 1 1 1 I 
TTNANCTNNCCNTNNNAACAAAAI NIMH I NTNTNNCATATAAAAAAA 
Sequence 2731 

GGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTACTAGTTATTTTAAATTCCACTCATA 

ACTTATCGGCCAAAAGTAGTCACATGGGTCCACNTAATNACAAGNGGAGCGGGAAGTGCA 

ATCCTACCTTGCCTGGGGAAGGTATAGAGATAGACCAGCNCTAATGACTACCACACTTNG 

CTAAGGTNACATAATAAATAAGCATCAGGACATTATGTGTGGNGGCTCATGTCTATAATC 

CCAGCNCTT 

Sequence 2732 

CTATAGGGCGNATTGNAGCTCCCCGCGGTGGCGGCCGTGNGCNCGGAGNTGGTATTGACA 

TAGCCTTTGTAGAAACAGTGCTTGAGTTCCGCTTCNTCTTCGGAATAAAACTTGGTCTGA 

TTCACCCCGGGCGTCCCGAGGAGGGTGACAGTGAACAGTGGAGCGATAAATCCGGCATTG 

GCGGTGAGATTAAA 

Sequence 2733 

AGGCGGGCGGCNGCCNCNCNCCNGGTACCTGATAAAMTTTANTNCTCCTTGGCCAGGCA 

TGGNNNCTCACACCNGTAATCCCAGCACTTTGGGAGGCCAAGGCCAGCANGTTGCTTGAG 

CTCAGGAGTTTGAGACCAGCCTGAGCAATATGGCGAAACCCCATCTCTACAAAATATACA 

AACATTANCCAGGTGTGGTGGCNAACGCTTTCAGTCCGAGCTACTGATGAGGCTGAAGTG 

GGAGGATGGCTTGAGCCTGGGMGTGGAGGTTTCAGTTGAGCTTGAGTTTNATGCCATTG 

AACTCCAGCCTTGGGCGACGGGNAGACNCTNTTTTNAAAGAATTTGGAAAAAAAAAAAGG 

GAAAAAAAAAANNNNANGNCCCTTGGGCCGNNATTTNAAAAAANATGCNCCNCCCCCCCC 

GGGGGNnTTNNGAAATTTTNANTTTTANAAGTTTTTTCNNNCC^ 

GGGGGGGNGCCCCNNAANNNC I I 1 1 1 I II I I 1 1 ATTTAAGGGGGANGGNCCCCCCCCNAA 
AAAAAMNATNTTTNTTTTNTTTNTNNNCCCNTTNNNNANNNTANAAAAAAAAAAAAAGN 
GGGGGGGGGGGGGG 
Sequence 2734 

CCGCGGTGGCGGCCGAGGTAC I I I I I I I I M II I I 1 1 I I I I 1 1 I I ATGAGATGGAGTCTC 

GCTCTGTTGCCCAGGCTGGNGTGCAGTGGTGTGATCTCAGCTCACTGCAAGCTCCACCTC 

CTGGGTTCACGCCATTCTCCTGCCTCAGCTTCCCAAGTAGCTGGGACTACAGGCACCTGC 

CACCACACCCGGCTMTTTTGGTTTTGTATTTTTAGTAGAGACAGGGTTTCACTGTGTTA 

GCCAGGATGGTCTCGATCTCCCGACCTNGTGATCCGCCCGCCTCAGCCTCCCAAAGTGCT 

GGGATTACAAGTATGAGCCACTGAGCCCGGCCTCTCTGTAGCTTTTAAGATGTTCTTAGG 

TGACTTATGAGMTGAAMMTGGAGAATTTTCCGTCTTCCTGCCATGAAATCAATTAAT 

GGCMTTGCTACTGAAAGCTGTTCTGTTTTT 

Sequence 2735 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I 1 1 I I M 1 1 I I I » I I IGG 

GGGGTTGCTATTATTGTTATTGTTTCTAGATGAATGTGNGAAGCCAAATTATTCAGTGTC 

TTAAAGACCATATTAAAAACTCCTGCCAGGCGCAGCAGCACGTGCCCATGATCCCAACTA 

CTTGGGAGGCTGAGGCAGAAGGATCGCCTGAGTCCAGAAATGCTGGGCTATAGTGCACTA 

AGTCAATTGGGTATCTGCACTAAGTTCGGCATAAACGTGAGGGACCACAAGGTTGCCTAA 

GGAGAGGTGAACCAGACCAGCCTGGAAACAGAGCAAGTCAAAACTCCTGTCCTGATGAAG 

TAGTGGGACTGCACCTGTGCATAACCACTGTACCTGCCCG 

Sequence 2736 
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GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGGACAGACGAGATCTCGAT 
CGAAGGCGAGATGGCGGACGTGCTAGATCTTCACGAGGCTGGGGGCGAAGATTTCGCCAT 
GGATGAGGATGGGGACNAGAGCATTCACAAACTGAAAGA 
Sequence 2737 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTATTAAAAAAACAGTTTAT 

GTGCAAGGTGTATAAGAAAAGTAAAACATACCTTTGGTAAAAAGATTATAAAGGGGCATA 

AGMTGTGGATTTTTACCTACATTAMGGGTTAAAMCMTTATTGTmAAMG 

GCAAGTTTTAAMCGTTMTTATAAAGMAATTCTGTGTGTAMCATATTAGCTAAAGTT 

AAAAAGGTATCATCCAGTTTTTCTGTGAACTGGACATTAAAGTAAAAAATGCCACAGGTT 

TTTCTTAMGCATCAACCTGCTCTTTAACAAAAATTATAAAAGGTTAAAAAGAGTCTATA 

AAATCTTACCTTATGGTCAMCATGAAAMTTGGATAAATATGTCTCAAGGGTTTATTAA 

AATTTCAGTTTAAMTTMTMCACACTAATATAAAGGTAAAATTTAGCTTATCTGGTAT 

AAAAATCATACNAGAAACA7TA7TAAATATNAAATGGGGT 

Sequence 2738 

TCTAGGGC GAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I 1 1 I 11 II I 1 1 I I I M I 

I I II 1 1 1 1 IGAGACAGNCTCCCTCTGTCACCTAGGCTGGAGNGCAGTGGNACGATCTTGG 

GTCACTGAAACCTCCGCCTACTGGGTTCAAGCAGGTCTCCTGCCTCAGCCTCCCCAGTCG 

CTGGGA7TACAGGCACATGCCACCACACCTGGCTAATTTTTGTAI I I I I AGTAGAGACGG 

GGGTCTNACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTTAGCTGATCCACCTGCC 

CTGGGCTCCCAAAGNGCTGGAATTACAGGCGTGAACCACTGNACCCAGCCCNTTGACCTG 

TTNTTTATTATTTTGNGGTTMTGCCAMTNTAAMTMTGTTTATGTATAAAGCCC 

NTCACAGGGGGGAANTTTTTTTAAAACAANAATTCCI II I I I I T I AAGGAAAAAAAGGTT 

TGTNTTTTTGTTGCANCCTTNTTTATTCNAANGGNTNANCTTTCANAAAAATGANTAATA 

ATTCCI I 1 1 I I ICC 

Sequence 2739 

TNTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACGAAGCCATCTTGGCT 

CTGTGGAACCAGCTCTACATCAACATGAAGAGCCTGGTGTNCTGGCACTACTGCATGATT 

GACATAGAGAAGATCAGGGCCATGACAATCGCCAAGGTATGTCCTCAGGGCCACTTAGGC 

TGCCTGGAGGGAGGGCAGCGCTGCCCCCCGCAGTGCCTGTGTCCAACAGTTCAACCTTCT 

TGCTGTGTAGCAGTGCTTTTGTGTCTCGTNAAGCAAGTCAGCTCACCCCTCCTTAGAGGT 

TCTGGTCTGTCCAATAGAGAACGGGNGGGATTAGCATATGGCTGATTATGAGAGAAAGAA 

GCMTNCTAATTTAGGGTGGCCTGACAAGCAAGCCAGAATTGCCTGTGGAAAAGTTATTG 

CACTCCTGTAAGAATTCTGGACCCTATTGCTCCTTTGATGTAATGGAAAGTTAG 

Sequence 2740 

TNTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACGCGGGAAGTGA 

ACTGAGGGCCACCCTGGGAGGAAGCCGACTAGGCGAATTCACTTACTGACCGGCCTGGGC 

TGCTCTGAGACATGGAGGAAGCCAGTGAAGGTGGAGGAAATGATCGTGTGCGGAACCTGC 

AAAGTGGGGGTGGGGGGAGTTAANANTATTATCCCCANATTGNGGGGCGGGATCCTGCCC 

CGGGGGGAAAACTTTGGAACATCTCCGCAATAAGACAGA 

Sequence 2741 

GGGCANGGTACCCACTTGGATGACTGGGGAGAAGGGCTGGGCTGCTGCTGGGGAGTAACA 

CAGGCCTTGGGGCAGGGTTCAGGAGTTCATTAGTCTGGAGTCCAGATCGCCACCCAGGGC 

CCAGCCTGATGTAGTGTTCGCGTTCTCTCAGCGCTGCAGTTTTCCGATAAAGGAGAGGAC 

TCCTGTGTGCCAGAGCTCTGAATGGGAGCCTCTTCTCAGTCCAGCCAGGCAGAGGGTGAG 

GCTGCCACCTTATGGCCACTGGGGGAATTGGCTCTGGGCTTGGACTCCAATMGGGGCCG 

GGAGCTGCAGAGACCTCCAAAAGGTCTCTTAACTAGTCCTCCAAA 

Sequence 2742 

CCGCGGTGGCGGCCGAGGTACTGCCATACCTGGCTMTTTATTTTTTGTGGAGATGGGAT 
CTCACTTTGTTGTCCAATCTGTTCTCAAACTCTTGGCTTCAAGTGAGCCTCCTGCCTCTG 
CCTCCCAAAATATTGGAATTATTGGCATGAGTCACCATGCCAGATCAAGAAAATATTTAT 
GTATAATTTTATCATACCTCATTGGTCCTAATGTTTTTTTGCTTGTTTAGGTCCCTTCTA 
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GAGATAGGAGAAGAGAGAGATCCCTTTCTCGGGGAGAGAAATNCAAGCCGTCCCGATCCT 
CTTAGGGCTNGGA GGTAA MTCNTTTGATMCTTTGTATTNAAAACTTTGCATCCATAGT 
ATGCTMGGCIsrrTNTTTTANCCCAMAMTTTTNCCTTAAAAGTTrr^ 
AAAAAGGGCCCCNAANACCCAAGNNAAAGGGNNGGNNCI 1 1 I I I I 1 i 
Sequence 2743 

CCGGGCAGGTAC I M I I M H f I M \ 11 ITTTTC1 I I M M M 1 1 (1 I 1 1 I I I i 11 1 1 » » 

I M I M I I 1 I I I I I 1 1 M I I I I I I I I 1 1 I I I M I I I I I M I I 1 1 I I I 1 1 I I M I I I I 1 1 1 

TTTTTTCCCCNGGNNNNAAAAATTTTAANNGNNNGGAAAAAAAAAAANCNGGGGAAAANG 

NNCCNTTNNTNNNAANCCNTNAANNAAAAAAANNCCNNNAAAANNAAAAAAAAACCCCAA 

AANNNNNNrTNNNAAAAAAANCNTNGGNNNAAAAAAAMGGGGGGNGNNANTTTNNNNCCC 

CAAAAAAANGGTNNGGNNGGGAAAAAATTNNTTNNCAANCCANAANTTNANTNCAAAAAN 

CCNTNNTTCCCNTGNCCNTNNAAAAAAANGGGNACCCCCNGGGGGGANANAATANNAAAA 

AAATTTANTTTAACCCCCCCCCCAA 

Sequence 2744 

GGGCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI 1 1 I I I I M I I I I I I GAGACGG A 

GTCTTGCTCTGTCGCCCAGGCTGGAGTGCAGTGGTGCGATCTCGGCCCACTGCAAGCTTC 

GCCTCCTGGGTTCACACCATT CTCCTG CCTCAGCCTACTGAGTAGCTGAGACTACAGGCA 

CCCGCCACCACGCCCGGCTAATTTTTTGTATTTTTAGNAGAGAGGGCGTTTCACTGTGTT 

AGCCAGGATCGTCTTGATCTCCTGACCTCGNGATCTGCTCGCCTCGGCCTCCCAAAGTGC 

TGGGCTTACAGGCGTGAGCCACCACACCCGGCCCAGTCTTGTGTCTTAAATACCATTTCC 

ACATTGACACCTCCAGAATATTTATCTCTAAACCTGACCTCCAGAAGGTGCAAGCANATT 

GTGTAAACCATGTCTCCACTACACGCCTAAAGGCA 

Sequence 2745 

CCGGGAGGTGTCCGAACAGGCAGGTTGGTGGGTTAAAGGTC7TAATCTTGACTCGAGATC 

TCTCCCCGGAGTTCACAGAGTAGGCGACGAAGCCGAAGCAGCTGGAGCGCGACCCGGAGG 

AGTCTGACTTCTCGTTGTCTTCATMTTTTCATTCGTTGCTTTCTTCATGGACTTGCGGC 

TGGGGGAGGATCCCCGCTGGTCGCCGAGCANGCGGGCGGGTAAAGGTAGGCCGCGAGAGC 

CAGGTTATNGAGAGGAGAGGAGGC 

Sequence 2746 

AG GTAC CTGTGACTAACAAGGGGTCTGGGAGGATCTGCTGCTCCCATGCCCTCCTTTGTG 

TGTTTTAMTCtGTTTGAGCCTTCTGGGCTCCTGCGAATTAGGGAGTGGCAGCTCCTCAG 

TCTMCTCCTATTGNGACCAGGTTGCCTMTTGGCCCTTTGGTTTGGGCACCCACTGTCC 

TTCTGCGTGGTTGGATAGATGCTGCTCCCAATGTCCCTGATCTCTTACAGACCCCTCTGA 

TTCTTCACTCTTGGCTTTGAGAGCCCCTGATGCCCTGCAGTCTTGACTGAGCTTCTAATG 

GTTGATCAGACCCTTGAATGTTGAGCTCTTTCCATACTAGACTTGAATATTCTCCTG 

Sequence 2747 

GGCGGCCGCCCGGGCANGGTACANGACATTTTCAMGTTGCCAGTGTTACTTTAATTGGA 

CTGCCTTCGTAATTCATTGCCTCTGCTTCAACAATGTGCAACTCATCCTTTGCACCAGCC 

CCTAMCTGACCGTTCTTAAAGATAGCTGGTGCTCATTTTCATCATTATCCACCTTAAAG 

NGATMTCTTTGTCGGCCTTTAGTTCACAACCGAAAAGATAGTTCTGGGGCCTNAGGGGG 

CTCATGTCCATGTCCATCGAATCTTCCATNGGGNGGCGGCACNCNNTTTTTNTAGNAAAG 

AAGGCGGNCGGAANATAAAAAAAACTNTNCTCCAAAAAAACACCCGGNGCANGGAGGGNA 

NTCANACCAGGGGCCCCCGTTTNCCTNGGATCNTTTNATAAAAAAATNGGGGGNCCCCCN 

CNGNGNNNGGNNGNAAATTCTAATNNTAMNTTTTTANCCCCCCCCCCCCCCCCTNGGGG 

GGG 

Sequence 2748 

CCGGGCAGGACTTGGGAAGCTGAGGCATAAGAATCACTTGAACCCGGGAGGTGGAGGTTG 

CAGAGTGAGCCAAGATCGCAGCACTGCACTCTAGTCTGGGTGACAGAGCAAGACTCTGTC 

TCAAAAAAAAAAAGACTAGAGAATGTCAGGGAACACATGTGTATATTTAAACAACTTCAC 

TTTGCATTTAAAAACTCGAAGGACAGCGAAGGTGAAATCAATTCACGGGCCACCTAAACT 

TTTCAGTCTAGGACTNAGAGCTTGTANCCATNGATCTGTNGGGNTTGNACCTCGGGCCGN 
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TTCTANAACTAGTNGNATCCCCCGGCT 
Sequence 2749 

CCGGGCAGGTACAACATGAGACATGACGCCCTTCGGGACACATGCCTGAGGTAGTGACAA 

TCCMCTTTG GAAGAG TGGMGCCCTAGTTTCAMTTCMGCATGCTTTGAGTATAAATT 

MGTTTACCTCTTTTTGCACAGCAACATGGCCMTCTTTCCTAAGCTGCTCAGCTTACAA 

GAAA AGGAATCATACTGCTAAGAATTCAAACTTCAGCAGTCATAGGTAAAGTAAGGGAAG 

TTTTN TA AACCNT ATTTTTAGCCCCCNTACCCNGMCCCTNGNAMTTTTNGCNAGGGTT 

Mil iCAAl I 1 1 iCNAGGGACAGGTTGGGGTTTCNCTTTAAATCCANAGGGCCTTTGGAA 

NACCNTGGAANAAACCAGACCCTTTTAAAAAAGG 

Sequence 2750 

CCGCGGTGGCGGGGTTTCGCCATGTTGGCCAGTCTGGTCTTGAACTCCTGACCTCAAGTG 

ATCCACCCACCTTAGCCTCCCACAGTGCTGGGATTACAGGCATGAGTCACCACACCCGGC 

CAGTAGMCTCTTTA AACCTGAA AAATCAGTCAACTTTGCAGACTAGAGGAGGATGTTGA 

ACACCTATGTGTGTA I I I I I 1 1 1 CTTTACCAACTATGCACCTATTTTTCAGACACCTAAA 

AGTAATGTCTGTGAAACAGTGGGTTTTCTTTTT 

Sequence 2751 

CCGGGCGAATTGGAGCTCCCCCCNGTGGCGGNCGCCCGGGCAGGTACAATGCTTATAAAA 
TTCMTAATTTGTA7TAAAATACAAAATCCNATAACAACCAGGAGTTCTTCGGAAGAAAA 
AAAAAATCACAAAACAACCCCAACAGTGGTGAAGAACTA 
Sequence 2752 

TA GGGC GMTTGGAGCTCCCCGCGGTGGCGGCCAAAAGCTCCCAGATTATTTGGAAACCA 

TGTTTTCTTCCTAGTCCATGGTAACAAGAAAAAGCCANTGGAAGCATCATTCCAAACAAT 

AATCTCCAAAGATGGTGGCAACCAAGTGTCAAATGGGGACTGCAGGCACAGAAGAGACCA 

CCCCAAACCCTGCCTGGGTGGACGAAGCAGGTATGCTAGAATAGTCCTGTCCTGCAGAAT 

AGGGAACGGCAGCTTGGTCGATCTGTGCCCTGGAAAAAGAAAATGAGTTGCAATAGAAGT 

GACTNTAAGACAGACAATGAACCTACTNTTAAGAGAGACAGGGCCAGGCACGGTGGCTCA 

CGCCTGTATCCCAGCACTTTGGGAGGC7TNAGGCGG 

Sequence 2753 

GGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGCAGAGACAATGGAATAAACAGCCAAG 

AATGTTAAAATAGCTATTATAAATATGATCCTTATGCTCATAGAGAAAAGGAAAAATATA 

AGTATAATGTGAAAAGAAATGGAAGATATAAAAAGAAATGCAACTTCTAGAGGTGAAAAA 

TATGTCTGAAATGAAAACACCATATAGATGGAATTACCAGGAAATTAGACACTGCAGAAG 

AAAAAAATCCATGATGTTGAAGTATATTGCAATTAAAACTATCCAAAATGAAACTGAGAG 

GGGGAAAAAAGCCCCTGAAATGTATGAAGAAAGCCTCAGTGACCTGTGAGACAATATCAC 

AATGGCCTAACATAAGTGTAATTTGGAATCCCCAGAAGGGGTAGCAGGCCCGAAAAAAAT 

AA7TGTTTG AAT AAAG G G N C AAA A G GTTTTCCC 

Sequence 2754 

CTATAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTTA i 1 1 I I I I i I I I I 1 

I I I 1 1 1 IGAGANAGTGNGTGACTNTGNCACCAAGGCTGGAGGGNNGNGGNAAGANCATAG 

N TCACTGAAGCCTCCATNTN TGGAGTTNAAGGGATCCTCCCGCCTCANTCTCCNAAGCTA 

AAI II I I M I I I I I I I I I II I I AGNAANANACATGGTTTCACTATNTTTGCCCAGGCCAG 

GTCT AMCCTTNCTNACTTNMNGCMNTNNCTTCTNCCTTTAANCCTTCCCAAAAGGGN 

TTTTGGNATANNNTNANACNCAGNCCCTTNTNCCCTGGGCTCTTTCNTTNTTAAAGAAAG 

GACAC TTTAM TCANTTTTTNCNCNCNCAGAGATNAATTGNGCAACAAACANTATTANGT 

TGGGGTTTTATTTNAGGGTTCCTNNCCCGGNNNGGTCCCTTTNANAAAANNTNNGGGAAN 

CCCCCCCGGNGTNTGGGGGGANTTNTNANNNTNCAANTTTATTTTATTNCCCCCCCCCCC 

NTCG 

Sequence 2755 

CGCGGGGGCGGCCGGGAANGGNCCNGGAANGGNAGTCANGCAGGGAGCGTCTGTCCGAAC 

GGAGGCTAGGTAAGAATATTTCACCATGAAAATGTTAAAAGACATAAAGGAAGGAGCTAA 

ACAATATGGACCCANCTCTCCTTATATGAGAAGGTTATTAGATTCCATTGCTCGTGGAAA 
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TAGACTTATTCCTTATGATTGGGAMTTTTACCTAMTCTTCCCTTTCACCCTCTCAGTA 

TCTACAGTTTAAAACCTGGTGGATTGATGGAGTACCTCGGCCGCTCTAGAACTAGGGGGA 

TCCC 

Sequence 2756 

CTACTTAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGAGGTAC I I 1 I 1 11 I 1 I I I 1 I I 
1 1 1 i 1 I 1 1 1 1 I 1 1 1 1 1 1 1GGAGNGAAACAGGAGTGCTTTATGGTCTGAGTGGAGTGTTTG 
GGAGGAGTNCCTCCCGGNTCCTGCCTTGGGGCTCACCTCCCG 
Sequence 2757 

CCGCGGTGGCGGCCGAGGTACATCTTCTCCTAAAAACAAGGGTAGAGCCAATGGAAAGTA 

ATGGTTCTGTTACATAGAATGAGTTGTCGCGTTGATCTTAAATGATGTATTGGTAGATAT 

ACTTCCCMGTGGATTAAAAAGTTAAAACTTACAGCATAACAAAGTATTAGACTTACTGA 

GGTG ACTTGM TATCTCCTTTTGATTTTCACTCTATTmCTmCA 

GATAATTTTTTMTAAACCMGGCTCTTACCATAGCTGAACTTTAAAACTTAGACTGTCT 

TTTCTGTAAACGATTCTGAGGCAAAGGGAAATGACTAGAAGAGGATGAGTAAACAATAAC 

CTGAAATGGGAAACTCGAGGGAAGCACAGGC I 1 1 1 1 I 1 1 GGTTTGTTTTGGTTG ATNCGT 

TTTTTGGTCTTTG 

Sequence 2758 

CTACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAGGCCGGTGGGCT 

CCCTGACATCCCTTCCAGGCAACCTGAAAGCACTGAAATAGCTTATGGCCCTGTGCCAGG 

GACCTTGGCCCAAGCTGCTGACCTCCAGGGTGGGGAGGGAGCTACCCCCAGGAGAAGAGT 

CACTCAGACAGCAGTATGAGCAAGCCAGCCAGCAGCTCCGTGCCTGCACCCAGCTCAGGG 

GAATCCCAGGGGGTTCAGATGCCCAGGAAGGAAAAGGGGACAGCGCTACTGCTATGGAAT 

GAGACCACCACTTCTCCTGTTGTCCTTCCCAGCTTCTCCCCAACCTCCCCTTTTCCTAGT 

TTATAAGACAGGAGAAAAGGGGAGAAAGCAAAAAGCTGGAAAGAAACAGAAGTAAGATAA 

ATAGCTAGACGACCTTGGCGCCCCACCTGGCCTGGGNGGG 

Sequence 2759 

CCGCGGTGGCGGCCGCCCGGGCAGGTACAGAMATTAGCAAGAGACATTTTCTGCATTGT 

GA GMA TCAACATAGACACCTTAMGACCCCTTTGAGAGTGTGGCTTTTTGAACTTTTCA 

GATTTTGCTCAGTGACCTGCTAACACTTACGTGAGAGGCTCCAGGTGTAAATAGAATCTA 

ATGGCAGAATCTGTAAGTGTAAACAAGCATCTTAGGAGTGAGAGATCAAGACCACAAAAT 

GTCCA GAGCTAT GACCACAGCTATACCTACCCATAAAATACGATACTGGAGTAGGGTATT 

TTTGTCT7TTTTTCTTACCTAAGAGCTAGCTAATCAGGACAGGTGATGCAGGTTCTGGAG 

CTCTACCAGGGCAAGTTCTATTTTCTTTTTTTGAGACAGAGTCTCACTGTCGCCCTG 

Sequence 2760 

CCGCGGTGGCGGCCGGGCAGGTA C I I I 1 1 1 I I I I 11 I I I I I 1 1 I I 1 1 CGAATATTNCTTT 

CAGTGTTCTCAGATTGACTTGACCAGCCTAAGACAGATGCCAGGGACATCCTCTTNTNTG 

CCTNTNAACACTTCAGTCAGATGGGAATATGGAAGGATCATATNCAAGAGGATCATATTT 

TNTGAAGCCAATCCATTANATGTCAGGAAA 

Sequence 2761 

AATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC7TGTAGATGTTCTTTGGTATCTGGGCA 

TTGMGMTTAGGTATTTATTATMTCTTCACAGTCTGAGCTTGTTCGTAGTGGTCCTTC 

TTGGAAAGGCTTTCTAGATATTCAGAAGGACCGTAGGTGTTGTGATCTAAGCTGTATCTG 

CTTTAGAGGGCACCCGAAGCCCCGTAATGCTCTGGTTCTTGCAGACTTCTGGAGATACTG 

CCTTGATGGTCTTGGATAAGATTTGGAAGAATCCTCTGGATTATCAGGCAGAGACTCTTA 

TTCTCTTTCCTCACTTTTTCCCAGAGTCTTTCTATGCTGAGCTCTCTGGAGCTGGGGGAG 

G 

Sequence 2762 

TNNCCGCGGTGGCGGCCGAGGTACTGTCCMCCAAAACTTTCCACNGNGMAATT7TCCT 
GG GTGAGC CTCCCAGAAAAGCCCAGCTTAGTGTAAGCCAAAGACCTCCCAAGTCTGTCAC 
CAAl mm iCCCTATTACTCACCTGATCATGTGGGCAATATCCCAGTTGGTCTCTGTAGA 
CAATGGTCCCTCTATTTCAACACCTTTTTCGGTGACAGTGGCGATTTGAAACTGGAAGAA 
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AGCAGAGCATTCAATAATGCCCACGCCTTAAGTCCTTAAATGAAAGGTCAGGTGGAGGTC 

TTCCCCAATGTGAAAATAGGAGTCACACAAGTAAGGCGNATCTGTTCTTCAAAGCATAGG 

CTC 

Sequence 2763 

AGGTACTATAATCTCCTTCTCCACTGGCAAACAGATTATACCAGAAAACAGCAGTTCCCT 
ATGGAGAACAATTTCACAATTATCCCCMGTATACAATGCCAGGTMTTGCTTTCGGTAG 
TTATTAC I I I 1 1 I I I I I I IGAGACAGAGTCTTTGCTCTATCGCCCAGGCTTGGAGTGCAA 
TGGCTCGATCTTCAGCTCACTGCAACCTCC 
Sequence 2764 

GCGGNCGAGGNACAANCNACTTGGGGGGGCANAAAACCNGCCCCCCCCACGANGAGAAGG 

GGACNANGAGAANNNTTTACACACAAGNGGGGGANNNCCCCCNAAAAAACCGGGCGCAGN 

GGCNNACACNNNGNAANNCCAGNACNNNNGGGAGGCCGAGGCTNTTTNAAAAAAATTNNT 

TTNGAGGGGGGGGGGGCCCCCGN 

Sequence 2765 

CCGGGCAGGTACGCGGGNTTCCGCGGGGCTTGCTGGGAAGAGAGGCGAAGCCAGGTCACC 
TTTCAAGGACCCAGAAGTAGGGTTTTGGCCTAGGTAACCGGGGCAGAGATGTGGTTCGAG 
ATTCTCCCCGGACTCTCCGTCATGGGCGTGTGCTTGTTGATTCCAGGACTGGCTACTGTT 
TNCCT 

Sequence 2766 

ATAGGGCNAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTACCCTCTGCTTTTGCAAG 

GACTCTACTGTGTCATCTTAAGTGAGACAGGTGCATATGTATACTAGTAAAATTCTCTGC 

TTTCCTAGCATTGGACAAAAACAAAAATCAACAAAAGAATTGCCTCAGTGTCTTAAACTG 

GGATCCTTACTAGTTGACTAGGCACTTAGTTACTGAAGGATATGTGTGGAATTCAAGTTC 

TTTTCAACCTATAAGAAATATCCGGCCAGGTGCAGTGGCTCACGCCTGTAATCCCAGCAC 

TTTTGGGAGGCCGAGGCAGGAGGATCACAAGGTCAGGATTNTCAAGACCNGCCTGGCCAA 

CATAGTGAAATTCCTGNCTTCTACTAAAAATN 

Sequence 2767 

CGGCCGCCCGGGCAGGTACTATTACTAGGTTCATTGTTTCCAGAGGGGTGAAACGGGGCT 

TTGGAGAGGTTAAATAACTTGCCCAGGGTCACACAGCTATTAAGTGGTAAAGCTGGGATT 

TACATGAGCCCAGACAAAGAACCCAAGAAGCTAAGCTATTNTCTTGTAATACCTNCAACA 

TAGGAGGCAAGAAGTGAGGTATTATACAGGTTGAGGAGATA 

Sequence 2768 

CCGGGCAGGTACGCGGGCATCAGCTCCGTGGGAACTCACGAGCCTGGGGAGAGTTCGTCA 
TCCCCACATGGAACCTCAGTATGGCCAACAGGCAGCCCTCTGGTGGAAAAATCAACAATC 
AGTCCTTGCANCAACTTGATGGAACNCTAGTTGACCACCATAAAGATGTCAAGCCAGGCA 
GCA 

Sequence 2769 

AGGTACTACTGCTGAGGTCTCCAGGACAGAAGTCACCTCCTCTGGTAGAACATCCATCCC 

TGGCCCTTCTCAGTCCACAGTTTTGCCAGAAATATCCACAAGAACAATGACAAGGCTCTT 

TGCCTCGCCCACCATGACAGAATCAGCAGAAATGACCATCCCCACTCAAACAGGTCCTTC 

TGGGTCTACCTCACAGGATACCCTTACCTTGGACACACCCACCACAAAGTCCCAGGCAAA 

GACTCATTCAACTTTGGCTCAGAGATTTCCACACTCAGAGATGACCACTCTCATGAGCAG 

AGGTCCTGGAGATATGTCATGGCAAAGCTCTCCCTCTTCTGGAAAAT 

Sequence 2770 

CCGGGCAGGTACAGTTGGACCTGCTGGCATTCGAGGCCCTCAGGGTTCACCAAGGCCCTG 
CTGGCCCCCCTGGTCCCCCTGGCCCTNCTGGATCTCCAGGTGTAAGCNGTGGTGGTTAT 
Sequence 2771 

CCGGGCAGGTACTGTAGATTGAGATATAACAAAAAGATGATTTCTGAAATAATATTGGGA 
AAAGAATTTACCGGATGGTATTTTGTATCTTTGCACACCTCTGTTGTTATGATGCTATAA 
GGCAATCTTGTCCAACCAGCAGCTCACAGGCCACATGCCACCCAAGATGGCTTTGAATAG 
GGCCCTATACAAATTCGTAAACTTTCTTAAAACATTATGAAA I I I M I I U I I I M M I I 
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ACAATMGGCTCTTGAAMTTGTCATTACTTGTGTTTTCCTATACATTCATCTGTGTGAA 
AGCC I I I II CTTCTTTGATTTAAAAAAATTMCTATACAGTTAATGGTTTAGAACTTAGA 
ACTAC 

Sequence 2772 

AGGTACACTCAAAGGCTATGTTCTCTCTCAGGACTCAAGATGAATTACTGGCAGAATTCC 

TCACTACATCCTTTMTGGAAACCTCTGTCGGTCCTCATCATCAGTGCTTTCATATTCTG 

ATTCTTCACTAGATAATTCCTCATCCTCGTCTGGCAAAGGAGGTTCCTCAGGTGGCTGAG 

GAGATGTGGGTGGAAGAGGTGCATGCAATGGCATATAGTCTTCATACATGGGAGGTCGCG 

CAGTAATTGGTGCAAMGGTGTGGGCAMTTCATTTTATTCATAAGATGAAGGACCTGTN 

CCTGCCCGG 

Sequence 2773 

CCGCGGTGGCGGCCGAGGTACTCAATGTTTAGCTCCCACTTAGAAGTAAGAACATGCCCA 

GCACTTTGGGAGGCCGAGGTGGGTGGATCATGAGGTCAAGAGATCNAGACCATCTTGGCT 

AACAANGTGAAACCCTGTCTCTACTAAAAACATAAAAAATTGGCCCGGGTGTGGTGGTGG 

GCACCTGTAGTCCTAGCTACTCGGGAGGCTGAGACAGGAGAATCGCCTGAACCGGGGAGG 

TGGAGGTTGCAGTGAGCTGAGATGGTGCCCAACAAGAGGGAAACTNCTTNTNAAAAAAAA 

AAAAAMGAAAACACAGGTATMTTTCCTACCATGTTCTTACNGGATCATAACTTTAAAT 

GGTCACCCGCGTNCCTTGCCCGGGGCGG 

Sequence 2774 

CCGGG CAGGTAC I I T I I I I 1 1 1 I I 1 1 1 1 1 I 1 I I I I AG AATAAAAGACCATTTTTATTCTA 

ATTTTMCTCAGAAATTATTATGCTTATTCMTTTMCACTTTCGCTGAAAGGTTAAAGA 

GATMGMGGACAGATTATMTTGCTTAATATTGCTATGGTAACTTCCATTCAAATACCT 

GTGAGTCACCAGAAGTCAAAAAGGTAGCCAGCATTGCAACACAGGATGGATCATGCAACA 

GAAACTAGCACCAGGTTACCTTATCTTATMTATTATTTGCTGTTAAAATGAAATTTTAA 

AACAGCACCAAMGTTAAGTTGGGGCTAAAACTGTTGTGCAGGAAAGATTTCATATAGCA 

G 

Sequence 2775 

TGGCGGCCGCCCGGGTNGGTACCAAAGCCAGATCCTCCTGTTTTGTAGCAGGAAGCCCTT 

TATTAGTTNNTCTCTATCAATCCATCTTTNATAATNNCCAAAATAGGATAAG 

Sequence 2776 

CCCACGCGGGCGAAATTTTGGGGAGGGCAAACACCCCCCGCGGGNGGGCCGGGGCCCGCC 

TCCNAGTAAACCTAAGAGGGGGAGGNTNTTCCAANNCCCCGCAGNTNNNTTAAAANCCCC 

ATCAAGGCCCNAANCGGAANAACCCGGGGGGGACCCATCTNAAGGGGGGGGGGGGCCCCC 

CGNGGNACNCCCAAGCCAANNANGGNNNCCCCCCGNAAAGNNGGAGGGGGGCGNAAAANA 

GGCAGCCGCCCNCNNGCCGGNAAAANCAAAGGGGGNCCAAAAAGNCAGGGGGGGACCCNG 

GGCGGGGGGAAAAAAAGGGAAAANCCCCGGCNCCAACCAAAANNNCNANCAACCAAACCA 

ATAACCAAAAGCCCCGGGGGAAGGCCAAATAAAAANNGAGGGGNNAAAANGCCCCCGGGG 

GGGGNGGCC 

Sequence 2777 

ATGGGGGAAAACCCCGGGGGCGGCCGCNCNAGAACTAGTGGAGGNNTCCAGCCCGCAGGN 

AANNAAACCACnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnNCAA 

CNGGNCCCCCCAGNGAGGGGGAAAAAGCGCGCCGGGCGGAANCAAGGNCAAAGCAGGNGC 

CCGCGGGAAAAAAGGNANCCGCCCACAACAACCACACAACAAACGAGCCGGGAGCAGAAA 

GAGGAAAGCCCGGG 

Sequence 2778 

ATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 I I I I I I I U I I 
I I I I I I II I I C CNCCC NMCCAGTTCTMCCGCANGNCAATTAGTTTTTNAAAAGTCTCT 
CTCTTCTC ATGTTTTTTCCTGCTTTG AAACTC AAG NG NTCTCTTTTACTTAATAN ATCAA 
GATCCAACACTAATTTGACATTAAATCAAGACTCTNTACAAGTNGACTATCNCTAACNGG 
CTGGGGGACTTCTGCGGAA 
Sequence 2779 
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ATTGNAGCTCCCCGCGGTGGCGGCCGAGGTACATTGCTGGCCTTTGCCCAAATTATGCTT 

NCCCCATTTGGTATGACCTGACACCATTGTGATCAGTCTGATGACCTGGCAGCATCCCAT 

C TGCCTA CCCAGTTCACTNTTGCTCCATTTCAGGGCCTCTTACAGGCAACTNCTNACATA 

TAT7TTTGACACTGACTCATGCCTTCTNAGGCTNAGCTAACATCAGCCCATTCATTGATN 

CAGCAAGACAATTTGAGAACCACTATGTGCTCTGCCAGAGTGAATAACAAAACCGCAAGT 

ACCTNCCNGGGCGGNCGGTCTAGAACTAGT 

Sequence 2780 

CTATAGGGCGA ATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTA CI M I NIM! 

nniiimi i ntncct mccagttctmcncmgtcmttagtttttcanaagtctct 

CTCTTCTCATGTT7TTTCCTGCTTTGAMCTCMGNGTTNTCTTTTACTTA 
Sequence 2781 

GGGGGGGGNTTTGGGGGGGGAAAAAGCCCCGGGCGGGAGNGGACAGAAAAAGANNTNGGG 

GNNGNGAGGGGGTTAGGAGNAANCCCNNGNGGGTTTTCCNAAACCCCCAGNGANGCCGGG 

NCGGCNGNGCGCGCANNGGAGGGCAGNGCTATGNANCCGNCNNGGGNACNCCCAGGCCCN 

GGGNNCCACGNGGCCCNNCCGGCACGNACNGNGCCGCNAAACCNCGGNAGGNCAGGCGAN 

GNANCCCNCNGCANAANNGAAGGAAAAAAAGGGGAAAAGCGGGGGCAAAGAACCAACAAC 

NCNGGGGGAACCACAAGCNAGGGAACCCCGNGAGAANAANANANCNGGGGANAANNNNGG 

GGNGGANAGNAANAACAAAGGGAAAANGGCCCCCGAGGCGNGGAAAACCCAAAAAGAGAG 

GNNAACGGGGGGCGGNNAANNGNACCANNGGCGNAAAAAAAAAAGANAAAAAGAGGGCNN 

GGNGGGGGGGNCGNAAAAAANAACACACGGGACCCCNNNNAAGGGGGANACCCNCNCGGG 

GNGCCGGGGGGNCCCCAAGNAAAACCNNAAAGGGAGGGGNANNCCCCCCCCCGNGGGACC 

CGNANNAANGGGAAAAANCCCCGGAANGAAAGNCAAAAGGCCCNNGNAAACCGGNAAAAA 

ANCNCGGGGNCAGAACCCCCCNCANAANGGGGGGGGGGGGGCCCNAGGGGNAAACCCCCA 

AGGCCCGGGGGGGGGNGNNCCCCCCNNNNCAANNNNNNNNNNAAAGGGGNNNCNAAAAAA 

AAGCCAGCCGGCCNGCGGGNNCGGGAAAAAACNAACGGG 

Sequence 2782 

TCCCCGCGGTGGCGGCCGCCCGGGCAGGTACCATGCCGATCTCTGAGAAGTTNTGTTGCA 

CCACTGTGAAGGTCTAGATGCAAGCTTGGCTCCCTCAGAMGGCGCTTCCCTTTTGCATG 

GCTGAGGATCCTTGAAGGAACCTGGTCAGTCTCCGGTTCAGCTTCCGACACCAGAGTGGA 

ACCCAGTAAGCACCATCAGGAATGGATTTCACTACMGTGTGGATAACTCTGATTTTCAA 

AGGAGTAGTTACTTGCAAATTACATCCTTGCTGAATTCAGGAGGTATGAMCCCTATTTT 

ACCATGTTAG AAAACA GCCCAGGATTTTCTCATTGCTCTGCCATCATATATGTCTATGAC 

TTGAGCCCTTATTTTTCCATCTGCAAAACMTMTGCCTATATGTCTTTGCATATAGATT 

TGGAAATCTTCATTCAAGGTTTAGTAGGATCA 

Sequence 2783 

CCGCGGTGGTCGAGCGGCCGCCCGGGCAGGTACTTTTCTTCTTGAAGTGAGTTTAGATCA 

CGTTTCAGCAAACGTCCTCTGGAGCTCCTTCCTCTGGGGAAGGAGCAGCGGAGGGCTGTC 

ATCACCACTGCACTCTTCCAACCTTCTCTTCTGCTCATTCACCAGTTCCATCCTCATCTG 

ACCATATTACTCCAGGMCTTCCCTTTTCCCTTGTAMTTCTGTTCCATTTTTGGCCAAA 

CCATGGTTTGCCTCTGCTGCATTAGTTTGAAGTC Al I M I M I CCAGACATTGTGGCGGT 

AGTTCCCAGATGTCCGAAAAGATGAAGTTATCAGTGCACGAGCAGCCAGGGCTTTTCTTT 

ATGTTGATTCACCTTAGGACACAGCCCAACAATGCCTGCAATTGTCATTAGCCCGTGAAG 

AGGTTCGCTTAGGGCTGCCACAGCACAGAGTCCCAGCTGCGCCCGTTTCTTCCCCCCCC 
Sequence 2784 

CCGCGGTGGCGGCCGAGGTACTAA I M I M M M M M M M M I I GTATTTTTAGTAGA 

GACGGGTTTCACCTTGTTAGCCAGGATGGTCTCTATCTCCTGACCTTGTCATCCGCCCAC 

CTCAGCCTCC CAAAGTGCTGGGATTATAGGTGTCAGCCACCGCGCCCGGCCTGAATATCT 

TTTTTATTTTTM GCTTT CAATAAATCTTACTGACATCTAATTGATAAAAGTTGCACATA 

TTTMTGTATACATTTTGATGAGGTTGGACATATGCATACACTNGTGGTACCTGCCCG 

Sequence 2785 

TTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI M M M M M M M M M 
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GAGACGGAGTTTCACTCTTGTCACCCAGGCTGGAGTGCAATGGCGCAATTAGGGTTCACT 

GCAACCTCTGCCTCCCGGGTTCAAGCAGTTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGA 

TTACAGGCATCCACCACCGTGCCCAGCTAATTTTTGTATTTTTAGTAGAGACGGGGTTTT 

GCCATGTTGGACAGGTTGATCTCAAACTCCTGACCTCAGGTGATCTACCCTCCTCGGCCT 

CCCAGAGTGTTGGGATTACAGGCATGAGCCACCATGCCAGGCTGCTMTTCTCCTTTTTA 

GTGAGTTAGGGAACTGAGCCTCAGAAAACTTAAACGATTTCTCAGAAAACACTCAAGTGA 

TAMGTGGCCCCATTGGAMGGAGGTTTTTATCTTCTCATTGGCAGGCCCAGNGTTCATT 

GCACAATATCATGCTACCTCTTGGAATCTTTTAAAA 

Sequence 2786 

TCTTAGGG C GAATTG GAGCTCCCCGCGGTGGCGGCCGAGGTACTAAI I I 1 1 I I I 1 I 1 1 11 

i i l i l l 1 iGTAI I I I I AGTAGAGACGGGTTTCACCTTGTTAGCCAGGATGGTCTCTATCT 

CCTGACCTTGTCATCCGCCCACCTCAGCCTCCCAAAGTGCTGGGATTATAGGTGTCAGCC 

ACCGCGCCCGGCCTGAATATCTTTTTTATTTTTMGCTTTCMTAAATCTTACTGACATC 

TAATTGATAAAAGTTGCACATAT7TMTGTATACATTTTGATGAGGTTGGACATATGCAT 

ACACTCGTGGTACCTGCCCG 

Sequence 2787 

CCGCGGTGGCGGCCGAGGTACAATACAATCTAGATGACGGTGCAGACTAAGTCAAGAACT 

AAAGTTGTGCAGTAACCCGAGTTAAGGCATGAATGCGGACACACACATGCACACACACAG 

CACCCATGCTATCAAGACACAGGATTTTTTCAGTTGCCTCATGAGAGGCAACCTGGGCTT 

GGCAGTTAATCA GAAC TGCTGAGCATTCCAGAAAATGCCCCCCACGACTTTATGCTAACA 

GCTGTGTGTATGTTTTAATCAAAAAATTAAAGAAGAAAAAAAACCTAAAAAACAAAGAAA 

AAAACAAACAAAAAATCACCAAAAACCTAGAAACCCCTTAATCTCTTACAATGGCTCTTG 

AGCATGGMCTCATGTAGCAGCATCAATGGCTGGCTCTTTAACAATTTGGAAATAAAAGG 

TTGGTTTACTATGTATTTCTTTGGTAGTCATCACTACCAAAGTT 

Sequence 2788 

CCGCGGT GGCGGCCG AGGTA C I 1 I I J I I I I I 1 1 1 I 1 1 I I I II ) | I G CTTTNATGTCTTTA 

TGTATTTA I I I I 1 1 I I I GAGACAGGGTCTTGCTGTGTTGCCCANGCTGGAGTGCAGTGGC 

CTGGTCATGGCATNAAGGCTCACTGCAGCCTGGACCTCCTGGTTCAAGNGATCCTCTTGT 

NTGAGTCCCCTGANANAAAACCCCACCCCCNNCTACANAAATTTTNTGGAAACANGGGCN 

NNAANCTGTTNCCTANGCNTGTNTGGAACNCCTGGGCTCAAGGGANCTTGTANCCTTANC 

CNCCTAAAANAGCTGGGANTTATAAGGCATGANNNAATTGTANCCTGNCCCGGCGGGCCN 

CTTTANAMAAGTNGGAANCCCCCCGNCCTTGCAGGAAATTTCATTATNCAACCNTNTNN 

NAATCCCCCCCCCCCNCCAN 

Sequence 2789 

CCGCGGTGGCGGCCGCCCGGGCAGGTAC 1 1 1 I M M I M I II II I I < I I f 1 1 I I 1 i M AG 

NAGANATGGGGTTTCACCATATTGGTCAGGCTGGTCTNGAACTCCTGACCTTGNGACCCA 

CCCGCCT TGGT CTCCCAAAGNGCTGGGATTACAGGCATAAGCCCCCGTGCCCGGCCCACA 

TGGTATTATTTTATATAAMCGCAAGTTMGTNTTTATGTGTGAAMACTTT^ 

CTTTNTCAAAMGAANTTNACTTATTNGTTAAMCCNTTTTTTGNCCTAAGGGCCAAATA 

NGNGAAAAAAAANCCCNNATTNNNCTTTTTANATTTTTN^ 

ATTTNNCNNNTTTTTTTGGNGNNANTAAAAAAAMGGGTTTTTTTNNT^ 

Sequence 2790 

TACTATAGGG CGAATTGGAGCTCNCCGCGGTGGCGGCCGCCCGGGCAGGTAC I I I I I 1 I I 
I I I 1 1 i I i I I iGAGACGGAGTTTCACTCTTGTCACCCAGGCTGGAGTGCAATGGCGCAAT 
TAGGGTTCACTGCAACCTCTGCCTCCCGGGTTCAAGCAGTTNTCCTGCCTCAGCCTCCTG 
AGTAGCT GGGA TTACAGGCATCCACCACCGTGCCCAGCTAATTTTTGTATTTTTAGTAGA 
GACGGGGTTTTGCCATGTTGGACAGGTTGATNTCAAACTTCTGACCTCAGGGGGAATTAA 
CCTTCTTNGGCTTCCAAAAAGGGTGNGGGATNAACNGGGTTGNGCCCCCCCCCCCCNGGG 
GGGGGA AAAAATTCN NTTTTTTTNGGGGGNNGGGGNNNCAANNNCCCCCCNAAAAAAAAA 
NNNNAAAI I I I I I I 1 I ANAAAAAANNANNCCTTTNTNAAAAAATGCCCCCCNCCCCAANG 
NNGNNGGGN I 1 I 1 I I I 
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Sequence 2791 

GGTCTTCTATGTGGGTGTCAAGGATATGCTCCCTCACGGCTTCCGAAGGCCCCCAGCAAA 

AGATCTAACATTCTTGCTCAAAGTTGCGAGAGAAAGTAGCACATGGAGTAGCTGGGTTGG 

GGCGGCGGCCTCTTCTCTTCAGCTCCCTTAGCTTGGCTCCGTAAGTGGATCACTTGCCAA 

ATGCTTTAGATGATTGCCTCTCAATAATTGAAAGGTGGTGGTAAGTTGTATTGCTGCACT 

GTCGGTGTTAAGAGAMTTACTCTCACMGAGCAGAGGCCTGMGATTCTTTCTTCTGAA 

GGGATGATGAGCCTGGACTCTCTGGACTCCTAGATTATGAACTCCTGCAGTGGACCATGT 

CCTATTTTTTGGAGGGCGTTGGG 

Sequence 2792 

CCGCGGTGGCGGCCGAGGTACCTTCAATACCTTTAGTTGTCTCCCACACACGCGTGTGTG 

TGTGAMTCTTCTACMTATCTTCCCTTTTTTAGACCATGTTCACTGTCAAAAAGGTGCT 

TTAAGAGCAGTCTTTGGCTGGGCACGGTGGCTCACACCCGNAATCCCAGNACCCTGGGAG 

GCCGAGGCAGGCGGATCACCGAGGTCAGGAGATCAAGACCATCTTGGCTAACAGGATGGT 

CTTAAGGGACAGNGAAACCCTGTCTTNAACTTAAAATACAAAAAATTANNTGGGCGTGGT 

GGGNACCGCNCCTGTAATCCCANCTACTCAANGAGGCTNAAGGCAGGANTAATCACTTTG 

AACCTGGGAGGTAGAGGCTTGCAGTTGAGCCAAGATTGCACCCACTGCACTTTCANTCCT 

GGGCCGAANAANAGCCANGGACTTTCATTTTTAAAAAAAAAAAAAAAA 

Sequence 2793 

CCGGGCAGGGACCNCGGGATGGTGNCAACTTATGACAGGACCCATGGGCCCTCCCNATGC 
ACACAGNACTNTTGGAATCTNATCCTTTTCCATGGCTCTGGCTCACACTTNCACAGNATT 
TACTCCTAAATATGCCCCCTG NGTTCA 
Sequence 2794 

T ATAG GGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACANATTTGAATGGCTTTGA 

CTTTTGGCAGCTGCACAGNGCTAGGACTGGACCATGAAATATCTNTGGGCTNTGNCAATN 

ACATTTGGGTTAANCTAANCCTGATCCCATGTGTTCCTGGAAGAGAAGCCCCATGACATT 

CAAAGTCCTTGACAATNTGACACCAGCTTTTNTAACCNTATAAGGCCC 

Sequence 2795 

ACTACTATAGGGCGTCGAGCGGCCGCGCGGGCAGGTACGCGGGCACACTGAGGAATTATG 

ACTACTATGCMGCCGAGTTCCAGAATCTATTAAAAATGCACTTGTTCCTTAAGGGAAAG 

TTTCATTTGGCCGGGCGCGGTGGCTCATGCCTGTGGTCCCAGCACTTTGGGAGGCCGAGG 

CAGGTGGATCACTTGAGCTCAGGAGTTTGAGACCAGCCTGGGCAATATCGTGAGACCCCA 

TCTCTACAAAAATACAAATTAACTGGGCATCCTGTCATGCGCCTGTCGTCCCAGCTACTT 

GAGAGGCTGAGGCAGAGGAATCTCTTGGGCCCGGAAGGCGGAGGTTGCAGTGGGCTGGGA 

TCGTGCCACTGCACTCCAGCCTGAGTGACAGGAGTTAAGCCCTGTCTCAGAAAAAAAAGA 

CAAAAACCCAAAAAGTACCT 

Sequence 2796 

TACAGAAGCCGGGAGCATAAAGACGTATAAGCCTNGGGGGTTGCTCTAATGTAGGTGAGG 

NTAACATNACATTANATNTGNAGTTGNCGCCTNACTTGCCCCGCTTTTTCCAGTTCGNGG 

TAMCCTTGTCNTTGCTCAGCNTGCA7TTTAATTGTAATTNGGTCCAACNGCGCTGGANG 

GAGNAGGCTGGTTTTTGCCGTTATTTGGNGCCGCTTC 

Sequence 2797 

ACTNCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACGCGGGATGATAAT 

AAGA TGCM TTTGAATCTTCATCATACGAAGGATACTTTCTAGCTTGTGAAAAAGAGAGA 

GACCTTTTTAMCTCATTTTGAAAAAAGAGGATGAATTGGGGGATAGATCTATAATGTTC 

ACTGTTCAAAACGAAGACTAGCTATTAAAATTTCATGCCGGGCGCAGTGGCTCACGCCTG 

TAATCCCAGCCCTTTGGGAGGCTGAGGCGGGCAGATCACCAGAGGTCAGGTGTTCAAGAC 

CAGCCTGACCAACATGGTGAAACCTCATCTCTACCAAAAAAAAAAAAAAAAAAANGTACC 

TGCCCC 

Sequence 2798 

CCGCGGTGGCGGCCGAGGTACTGAGCCCTTCATTCCCCAMCTCAGACTCTTAGCTCTTT 
TGTCMCTCTGGGCAAAGGTCAGCATTGGAATCGAGCGGCCGCCGGGCAGGTACATTTTC 
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TCTCATCCACTTCCGATTCTCTTCATTGGCTG 

GTTGGCTTCTAGGAMCCMGCACATCTTGTTGTTTCTTCTTAATCGCTGTAGAGCCAGA 

GGAAGATGCAATATATACACCGGATCACCATCCTGGGAACAGCGGCTGCGGTTGTTTGGG 

TCCTGAAAAGGGCTGTGGAGCAGCTGCAGCAATGGCTGGAATCCAGCTAGGGGCTGAAAC 

AACGGTTGGCAGAGAAGGGTGGGGGAAGTGGGAAAAAGGAAGAACTCGCCAGAAGCCCCG 

CCTTCGNCTTTAGCAAGCTTCCACCGAGCTCTTCTAAGCGCTTGAGTTNCAGCCAAAAAC 

CCCCCCCCG 

Sequence 2799 

AGGTACTAACCTCTTACCTTCCAAGGTGGTAGAACATGCTTGAAAAGATAGTAAGTGAAA 

AGGGGTAGCCAGTGCTTTACTCACAAGACTGCTTGAACATGAGACTCAAGGAGGGACCTC 

AGCAGGCCTGGGGTGTTCAGCAACTATTCCTGGCCGGGGCATCTTGCAAAGGAGTTGCTG 

TGACAGTAAGCTCTTCCACTTTGAGACCGTCACCTCAGCCACGGCTCCCACCTGGGCTCA 

GGAATGGTCAGGCAACACGTGGGGCCCAGGATGGCGGTGGATGACTAAACTGCCGAAGAC 

CGCGCCGCTGCGACTACCGAAGTACCTGCCCG 

Sequence 2800 

CCGCGGTGGCGGCCGCCCGGGCAGGTACMGTATGCMGTTTTCTGTAAACAGATTACTT 

TCAAACAGTTGGGAGCCCCAGGGAGATAATAGAAGGTATCCTACTCAACAGCAGTATATA 

GAATGCTGTAGAAGAGGAATATTGTAAGAACAAGAACTATCTCTTGAAAACTAAAAATAT 

G ATAACTG AAATATAAATTTAAATAGGATTGGAAGCTAAAGCTAGGAAATTACCCTAGAA 

TGi i M I iGAGAAATGGAAAATACCGAAGACCAAAAAAATAAAATAAAACTATGCTGTGA 

MGAGAMGATTAGCATTTGAGMGGCGGGAAGTTGGTCCTGCTCAGATGCCGGTTTTCA 

GATGCCTTATTCTAAAAAATAGNGATAGAAAACAAATTTCTACAAAGCAAAATTACAATA 

AATTTA 

Sequence 2801 

CTACTTAGGGCNATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGACGGGTGCCTGTAG 

TCCCAGCTACTTGGGAGGCTGAGGCAGGAGAACTTGTTTGAACCCGGAGGTGAAAGTTGC 

AGGGAGCCGAGGTTGTGCCACTGCACTCCAGCCTGGGAGACAGAGCAAGACTCTGTCTCC 

AAAACAAACAAACAAACAAAAAAACCCTGTAGCTTGGGATCAGCCTTCTCTTCTGTTGTT 

mCTTTAAAAAATAAAAATTAAAAATAGGCTTCAAGTGATCCTCCCGCCATGACCTCCA 

AAACTGCTGGGATTGTAGGTGTGAGCACTGCACCCAGCCGTATGTTTTTTTCTACATAAA 

AAACAGCACAGGATTATCTTCCAAAGCTAACAAATATGTTCAAATAACCACAACCCCAAN 

TNNAAAAAAAAAAATNAAGTACCTGCCCGGGGCGGCCCCTCTAAAAACTAANTGGGATCC 
CCCGGGGC 

Sequence 2802 

NNANGNCCCNNNAGNGAGGGGAAAANGCGCGCCGGGCGNAANCAAGGACAGAGCNGNNNC 

CCGGGGGAAAAGGANANNCGCNCACAANACCACACAACANACGAGCCGGGAGCANAAAGN 

GGAAAGCCNGGGGGGCCNAANGAGGGAAGCCAACCCCACAAAAAANGCGGGGCGCNCACN 

GCCCGCNGNCCAGGCGGGAAACCCGGCGGGCCAGCNGCAANAAAGAAAAnnnnnnnnnnn 

nnnnnnnnnnnnnGNNNGCGAAAAGGGCGCNCNNCCGCANCCCCGGACACCNAAACGCNG 

NGCNCGGNCGGNCGGNNGNGGNGAGCGGGAANAGCNCACCCNAAGGCGGGGAA 

Sequence 2803 

NC CACCGC GGTGGCGGCCGTNCGGGCAGGTACTCCTTTCAGAGGGTCATCTCCTCCACAA 

GTAi II I I iGT TTCTT TGGCTGGTCTGGGTCCAATGCTGTTGCCATCCCCAGCTTCAGAC 

TGTTCTCCTTGTTTTGGAGAACTTTCTTTGGACTGTATCTTCAGAGACACTCCTGGTCAA 

GGGGCCTCAGAGGACCCAAACGCTCTGAAACAGCGTCTTAGCTCATCGCCGAGTGTCAGC 

TCTAGCTCTTCGGAGCGCTTTCTTCTCCCCCCGCGTACCT 

Sequence 2804 

CCGCGGTGGCGGCCGAGGTACAGAGAMATATTTTTTAAAAATCTCATCAGGCTAGGTGA 
GGTGGCTCGTGTCTGTAATCCCAGCACTTTGGGAGGCCACGCTGGGTAGGTTGCTTGAGT 
CCAGGAGTTCAAGACCAGCCTGGCCAACATGGCAAAACACCGTCTCTACAAAAATAATAC 
AAAAATTAGTGAGGCATGGTGGCACACACTTGTAGTCTCAGCTATATTACTTGAGAGGCT 
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GAGGTGAGAGGATCACCGTAAGCATGGGAGGCAGAAGTTGCAGTGAGCCTAGAATGCGCC 

ACTGCACTCCAGCCTGGGCGATAGAGCAAGACTGTNTCAAAAAAAAAANATTTGGGGAAT 

AAAGGAAAGTTCCTGCCCCGGGGCGGGCCCGNTTIfrAGAAACTTAGNNGGATCCCCCCCG 

GGGCCTTGCAAGGAATTTNCGATATCNAAGCTTANTTCGANTCCCCGTCCACCCTNGAAG 

GGGGGGGGCCCCGGNTACCCCAAATTTT 

Sequence 2805 

TAGGGCGAT TTGGAGCTNCCCCCGGTGGCGGCCGCCCGGGCATGGTAC I 1 1 1 1 1 I 1 1 I 1 1 

I I I 1 1 1 1 1 I IGCCAGCTTTTCTAAGCAAATAGATTGTCTGAATTAGTCACAGAATAATTT 

TGTGAAMTTCATGTTTMGTAGCMCTACCCTTTCTTTTTTTATATATTTTTAAGGNAT 

TAGTTTATCTTCTTCTAACTGGNGCAGTCACTTAATGTTTTCATTAATCTTCGACCTGGA 

GAGNGAAATACTGATATTTCTAGAAAAAAATTCTACTCCTCTGATTATTTGAAATGCTGA 

GGAAAATGTCCCTCCCATAGTAAAACTTGTAAATAAGGAACTATATCATATTCAGTAGCT 

GNGTTCTGTTCCATCTTTTT 

Sequence 2806 

CCGGGCAGGTACATTCTCTGTTCATTACTTAGTTCTTAAGGATATGTGTTCATCCATTCT 

GTCGCTGGCTCAGAGTTTGCT TCACTC TCTAGACCAGAGTATAATTTCATTTGGCAGTCT 

CCTATACAAATATGCATTTAAGTTTTTTGACACGTACGCGGGGGACTCAACAGAAATGGG 

TTTCCAGAAGAATAATGAAAAGTTGTGGGTAGGAAAATGAATCATTTGGACTCTTCAATG 

AAATGGAGTGAGCCCAGGAGAGCTCAGCCAACAGAGGCACTCTGGGAACCTGTTAGTAAA 

GCCAGGCTGGCCAAATGCCATTTGATTTTGAACCTCGTAGGTCCCCACTCACCCTCTGCC 
AGGAGCT 

Sequence 2807 

AGCCTCACCCGC GGTGGCGGCCG AGGTACCCCGGGTGTTCCTTTTTGTTCAAAGTCTATT 
TTTATTCCTTGATATTTTTC 1 1 I 1 1 I 1 1 I I 1 1 I GTGGATGGGGACTTGTGAATTTTTCTA 
AG GTGCT ATTTAAACATGGGAGGGAGAGCGTGTGCGGGCTCCAGCCCCAGCCCCGCTGCT 
CACTTTTCCCACCCCTCTNCTCCCACCTGCCTCTTGGCTTCTCAGGNCCTCTGNCTCTTC 
CCGACCTTCTCTTCTTCTTGAAAACCCCTTCCTNCCACAGCTTGCAGCCCCATCCTCCCC 
GGGCTTCCCTNCCTAAGTCTGGTCCTGGCGGTCCTCTNGTCCCCCGGGNTTTCNAGAGGA 
CAACTTNCCCCAAAGGCCCAAAGCAGTTTTTTCCCCCCCTAGGGGGGGGGGA 
Sequence 2808 

CCGCGGTGGCGGCCGCTCGGGCAGGGTACATGCCACCACGCCTGGCTTATTATTATTGTT 

TTGTTTTGGAGACAAGAGTCTCGTTCTGTCGCCCAGGCTGGAGTGCAGTGGCACAATCTC 

AGCTCACTGCAGCCTCCGCCTCCTGGATTCAAGCAATTCTCCTGCCAGAGCAGCTGGGAT 

TACGGGCACGTGCCACCATGCCCGGCTMTTTTTGTATTTTTAGTAGAGATAGGGTTTCA 

CCATGTTGGCCAGCCTGGTCTCGAACTCCTGACCTCAAGTGATCTGCCACCTCGGCCTCC 

CAAAGTGCCCGAGGTTACAGGCATGAGCCACTGCACCCGGCCTATTATTATTAACTTAGT 

GTTTGCTAAGTGCTTTATAGATACGGACTTGCTTAAATCTTATAATAAGTCCTGAAAGAT 

GGGGTGATAACCTCATTTAAGAAATT 

Sequence 2809 

AGGTACTGTAMTATTACCATTATTTAAAATGTTGACATTTCTGCATTAAGTAGAAACTT 

TCTAAATGCCTAAATACCACTCAAACATGACTTAAAAGAAATTGAATGACTCACCACTAT 

GACCTTCAAGAGTCTGATTCATAGAAAGGTTACTGGGGGCTGCAAGGCCCCTCAATTTTG 

CATCATCTGTCTGCGTCTCTAATTTCAAAACTTTCAGTAATCCATCTTCACCACCGCATG 

CT ATGAA CCCTTGTTCCTTGTTCCAGGATACACACTGCAGCTTCACGTTATTGGGAATGG 

AAATT7TCTTGCTCAGGTAGAAGAACATCGTGGGATCCCCGAGAGGGTCACGGCGGCCGC 
T 

Sequence 2810 

AGGTACGCGGGGGGCTCTGAGAGGAGTCTACCTTGCCTTCTTATGGGAAGGGAGACCCTA 
AAAMCTTTCTCCTCTTTGTCCTCCTTTTTCTCCCCCACTCTGAGGTTTCCCCAAGAGAA 
CCAGATTGGCAGGGAGAAGCATTGCGGGGCAATTGTTCCTCCTTGACAATGTAGCAATAA 
ATAGATGCTGCCAAGGGCAGAAAATGGGGAGGTTAGCTCAGAGCAGAGTAGTCTCTAGAG 




WO 01/070979 



PCT/US01/09126 



TABLE 1 

458/467 



AAAGGMGAATCCTCMCGGCACCCTGGGGTGCTAGCTCCTTTTTAGAATGTCAGCAGAG 
CTGAGATTMTATCTGGGCTTTTCCTGMCTATTCTGGTTATTGAGCCCTTCCTGTTAGA 
CCTACC 
Sequence 281 1 

GGCAGGTACI I I 1 1 1 1 I I 1 1 1 1 1 1 1 1 1 1 1 I GCAACATTGCCCTTTGATGTCCCCATGAGG 

GCCAGGCCCAGGCAGMCCCATCCCATTTTATCCTTAAACTCANAAGGAAATTTGTCTAA 

ATATTAAAGGATTAATATGGGAATAAAAAATGAACCTTAAACCCTGCCACTGATACACAA 

GCTGTCTNTCTTAGAGTTCMTGAACACTTCAGGAGAGTATTTCCAACAATATTTAGATA 

TTGGAATATCTAMTATTGTTGATTTAGATAACCACCCTAGATTTCTCACCACCCTAGAA 

CATTTAGTGGGGAGACATTCTTTTCTCCTTTTTCTGATMCTTGGTCAGMGTGATTGAC 

TGTGCAAATGGTATTT 

Sequence 2812 

NNCGGCGGCCGGGCGGGNACGCGGGNGAGCNNTACGAGGGNCAGGAANCCCAGANGCCGC 

NCGACCTGGAGNCAGCCTACAACGCNTTCAACCGCTGCCGGCANGCCCGAGCCCGGGGCA 

CTAGCCCNNGCACAGAAGGGCAGAGNCNGAGGCGANGGCNCCNGGNCCCCNGNCCGCCAC 

ACAGGCCTTTGGTNTTNTCACACAACNCACGGGGCGGCAGCCGCCNGAAAGNAGACNGNC 

CCCGGGGGCAGAACANNGGGGGCCGGGCCCCNCCCCACAANAAAGANGCNCNCCGACAAA 

AAAAAAAAAAAAAAAAAGNACCNCGGCCGCNCNAGAACNAGGGGANCCC 

Sequence 2813 

CGGTACCCMGCTTTTGGTTCCTTTAAGTGAGGGNTNMTTGNCGNCGNCTTTGGCCGTA 
ATTCAATGGGTCMTAGCTTGGTTTCCTGTGGTGNAAATTGGTAATCCCGCTTCAACAAA 
TTCCCACNCAACATACGNAGCCCG 
Sequence 2814 

CCGGGCAGGTACGTTCTTTTNGCTTTTCCTTTCNGTAAGATGGTCTTCAGAGCTNCTTAA 

ACACATTTAGAAAAAAGTTAAACCCCAAGACNCTTTGGGGATAGGTTAA7TTTAAGANGC 

CCAACTTTGGACTNGGATTAAAGGMNTACCTTAAAAANCCCNGGNAAAACAATTATTTT 

TTGGG 

Sequence 2815 

GGCGGCGGCCGAGGTACAGAGAAGCCATCANTTTAGAGGGCAGCANAAAACCAGAAGCCN 
GNTTTGATCCCTNAACACCAAGANGCCTNTAACAACANGNCACCAGCACCCCCAGGAAGG 
CCAAGGAGTCCCACAGAAAAACCTAGGGNNAGACCAA 
Sequence 2816 

GCGGCGGCCGAGGNACMGNAANCNCCTTTTTTGGGGGGGGGAAAAACCCCCCCNCCCCN 

NNNNCNACCCAGANGAGGAGGGTTTCNGCCCCCAGGGGAACANCNCCNAAAANCAGCNCG 

G CCNNNGNC GGGACCAGNGCCAGGNGGACAGCCAGNGNCCNGGCCAGANGAAAGGCNGCC 

GTTTTTTT1 iNTTNGGGGGGGNNCGGGGGGGGGCCCCCCCAAAAAAAANCAGCCACCACC 

AGGNGGGGNGGGGAGGA 

Sequence 2817 

AGGTACCCTGAGGTGCTCCGCTGGGGACTCTGCTCATTCTGGGGGTGCAGTTGACGGCTG 

GTCGTGATCTTTCCCCGTAATCTGTCCCCTCTTACGGAACCTAGTCTCCCGTTCTGGCCA 

TGGCCTTTCTTCTTGGACACTGCTTAGGANCCCAGAMGMGTATTGTTATCAAATTCTT 

AAAGCCTTAGGAAGAAAGTCNAGGGAGTGGGAAAACCAGGCTTCTGANAAAGAATACCTG 

NTTGGCCCACCTGNATCTTCCNAGGNCANCCACGGAANTCCCGGGCCCCTTCCAATCAGG 

NAAGGTCGGNAATCTCTGATGGTCNATCGGTTCNATGGCCAACCTGGCCAACCAGTTTGA 

AAAAAAA 

Sequence 2818 

CCGGGCAGGTACTGTTCCTGTTGGCCGAGTGGAGACTGGTGTTCTCAAACCCGGTATGGT 
GGTCACC TTTGC TCCAAGTCAACGTTACAACGGAAGTAAAATCTGTCGAAATGCCACCCA 
TTGAAAGCTTTTGAAGTGAAAGCTTCTTTCCTTGGGGACCAATGGTGGGCT 
Sequence 2819 

AGGTACI I I I I I I I I I M I I I I I I I I I GGGTTAGGATGGTTCTAACCTGATGGGTTGTGT 
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TTACAGTGGGGTTTCCCCCAMGGTTATCTTCCTGCTTTCCTCTGTAAATAGGGCAGTTG 

CTGCMCAGATTAMTACACTCGGGCCACCTGTGGGTTAGTGGGTCMGMTTTTTGACA 

GAAAGGCTATAGGCTGCCGGTGGCCCCCGTGTGTTTGGGTGAGCACCCCTAGAGCTATCC 

CTTTATTCACATT GACGA AGAGGTGGAATGGCTGTTCTAGGGAAGGTAGAACTAAGGCAG 

TTATGAGCAGGTGTTTTAATTCCTCTACTTGTTGGACCTCCCGTGGT 

Sequence 2820 

AGCTTATCGATACCCGTCGACCTTCGAGGGGGGGCCCCGGTACCCAGCTTTTTGGTCCCT 

TTMNTGNGGGNTAMTTGGCGCCGCTTTGGGCGNAANTCANTGGNCANTAGCTTGGTTN 

CNTGGTGGTGNAANTTGGTAATCCCGNCTTCNACANTTCCACACANANNACGAAGNCCGG 

GGAANCNTAAAGTGGNAAANCCTGG 

Sequence 2821 

C GGCC GCCCGGGCAGGTACCATCTCTTGGGAAAACCATGCTACCTCTTCTCTCTGTTCTC 

TATTTTGCCACTAGAGAAATAGAAAATAAGGCTGGGAGCAGTGGCTTATACCTGTAATCC 

CAGCACTTTGGGAGGCTGAGGCGGGAAGAATCACCTGAGGTCAAGAGTTTNAAGACCAGC 

CTTGACTAACANTGGTNGAAMCCCNCGTNTTTTTTNTTTAAAAAATACC 

Sequence 2822 

CCGCGGTGGCGGCCGAGGTACTCCAATCCGGGTGACAGAGGGAAGACTCTGTCTTAAAAA 

GMAAAAAAATC AATAGA AATCGGTTTTTATTTATTTATAGTATGTGCTTTGAGTGGTTC 

TCAATCAAATTGATTTTTCCCCCTTCCAGGGGATATTTGAAATATCTGGAGGCATTTTTG 

TTTGCCCCATCTTTGGGTATCATATTGGCATCTAGTAGGTAGAGGCCAGGGTTGTTGCTA 

AGCACCCTATAATCCACAGGACAGCACCCAAAAACAGTTACCCAGCCCAAAATGTCAGTA 

TTGTCAAGGTTGAGAAGCCTTMTTTAGATGTAATGTTAAGAGTTCAGTAAATTGGCCAG 

GCACGGTGGCTCACGCCTGTAATCTCCTAGCACTTTGGGAGGCAAAGGCAGGCAGATTGC 

CTGAGCTCANGAGTTCGAGATCACCCCGGGCAACAAGGTGAAACGCTCTGTCTACTAAAA 

TACAAATCC 

Sequence 2823 

CCGCGGTGGCGGCCGCCCGGGCNNGTACTCCTCTTGCTACCACCTTTGTTGCAGAAGATG 

AAGGGGAGAGGGAGCTTCTCCCTATGGCCTCATGGCTTCTTGTGAGACAGATCAGTCCAG 

CCAGATACAGAGCAAAGCAGCTTTGCATCACCGCGGGCCAGTTGCTGATGCCCAGCTTTA 

TGTCTAAAAAAAAAAAGTGGAAGCCTCAAGGGGGATGGAGGATAGCAAGAAGAATGGGTG 

CCTTGGCCCCAGAGGCATTGTAGGGAGAGGAAGACAATGTATCTCATCAGGGTTCTCAAC 

ATTATGAGATTATCACACATCAACTATCTTTGGAGGGGCTGAGTGATTGAGTTATGGCTC 

TGACTCCTCTCTGGGGGTGGAGAGTGAAAGATGACAAANAAGGCCATCTGTCCCCTAGGA 

GACACAGTTTGCAGTATAAGACAGGACANAAGAGAACAGAAAAACAAATNCAACTGGAAA 

AAAGGGGTGG 

Sequence 2824 

AGGTACCTTAGAGCAACATGCAAAGCTTCCCTNCTCAGCAATCCCAGGTTGGGGCCCCCT 

GTCTTCCTATCGTCCTACCCGTCACAACCACCACTGCAGGCTTCTGATGCTCTGTTTTCC 

TCCTCTGTCTCAGTTCAGTTGCTCTGAGTTAGAGAGGAGCTCTCTGGGACTGGAGCAAAT 

GACTGCACCGGCCCTCATGGGACTTCCATTTCAGTGGATTCAGGGGAAAGCAGCACGTGT 

TTTTGAGAGACCTTGGCAACGCAGGCGACCTTGACGTCGACCAAAANAAGAGAGATTTGG 

AATGNTTGTTTACAGCCTCCGGTCAAAAAAAAA 

Sequence 2825 

TTAGGGCGTTTGGAGCTCCCCGCGGTGGCGGCCGCCCNGTCAGGTACCANNNCTTAGCAN 

GGAANNTGGACAACANAAGCTNTAAATCCTCTTGCATCGNCACGNTNAATTTGCACTGAC 

CAATCTGTTGGCACAGTMCTGGTTATAAGCTAMTTTCTACATTTTGGCTACAAGTATN 

CCAAATNCACCTTTTAAAAAATCCTATGTNAGATGCCATCTGGTGTTAATGATTTGCACA 

CCC CTTAA ATTGAAANTATTNCAAATAAATCTNACGGATTTATATANNATNATTAATGNN 

TNTATTTTMAAAGACAATCTGANMTAACACTTCCCCTAATTGTTGTCTTAATAATGAC 

CAAGAGCTGNNGAAAAATNATTCACACTGNTACGTCGTTNTGTTGGTTTGCTCACGGGGG 

AAGGGGGGTTG 
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Sequence 2826 

GCG AATTT GGAGCNAAACCNCGGNGGCGGCCGAGGTACTTCGGGGAGAGNNTNTCCTNCC 

NTCATTTTGAACNCCAGCGGCCTCTTCCCC7TCCNGGGGCTGCTTGCCCTGGGAACNCTG 

GCACCTTGGGCTGNGGAAGGCTCTGGAAAGTCCTTCAAAGCTGGAGTCTGTCCTCCTAAG 

AAATCTGCCCAGTGCCTTAGANACAAGAAACCTGAGTGCCANAGTGACTGGCAGGGGCCA 

AGGGAAAAAAAAAAAAGTTGCCCTNANCCNTNGGGGGAAAAAAAAGNGCCCNGGAAACCC 

NGGGGGGNCCCCC 

Sequence 2827 

ACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCGGAACACTTGAGGTCATGAGTCTGA 

GACCAGCCTGGCCAATACGGAAAAGCCCCGTCTCTAACAAAAATACAAAAATACAAAAAT 

TGGGTGTGGTGGTGTGCACCTGTAGTCCCAGCTACTCAGAAGGCTGAAGTAGGAGAATCG 

CTTGAACCTGGGAGGCAGAGATTGCACCACTGCACTCCAGCCTGGGTGACAGAGAGACAC 

TCCATCTCAAAAAAAGAAAAGAAAGGAAAAAAAGAAAAGAAAAGAAATGTTGAGGCAAAT 

GMTATACMACACATTTTAGATTAGCATTTGAATTAGTAAACTGCATAAAAAAGATCCA 

ACATGAGCTGACATCATCCAATCCATTGAGGGCCCAAATAGAGCAAAAAGGCAGAGGAAG 

AGCAAATTC 

Sequence 2828 

CCGCGGTGGCGGC CGAGG TACAGAAAGAAGAGTATCCATTTCAACTACTAGGTTAACTGC 
CTTTGATGAGCTTGATTTTACCTGGGTCACTAATTCCACAGAACCAATGTAGGTGTCTGG 
GCGGAGCAAAATATG CTCCA ATTGTGTTTTCTTTTGATAGATTCTTTCAACAGACAGTCT 
TTTCTTAGCATCTTCATTTTTCTTTATmGTtGACTTG 

CAATGGTGACACTTCCATGGTGACGGTCGTGAAGGGGCTCAAGAACCCTGAAAGCGACTA 

AACAGGCAGGACCCCACGAGACCACCCCCGACCAAGCCGNTTCTCCACAGACGCGCGTCC 

CCCGCGTACCCGCCCG 

Sequence 2829 

CCGCGGTGGCGGCCGA GGTACTTTAAN I 1 I I I I I M 1 1 I I I I 1 I I I I A M I 1 1 I 11 1 I I i 

I 1 1 1 I I I 1 1 I I I i 1 I I I I I 1 1 I i ■ I I I I I I I I I I I I ■ I I I I I I I I I I ■ 1 1 1 I I I I I I ■ I 1 

I I I 1 1 I I I I I ! I I I I I I I I I I NGGGNNGNNNANNNTTNTTTTAAAAANGGNAAAAAAAAA 

ANNGGGGGGNGGNNNNNGNNNAAAANNANNNGNNNNGGGGNNAANCCCCCNNCCNNNNNG 

GGNMNCCCCNNNNN AANNN NNNmTTTNNGNGGNNNNNCCCNTTNTNAAANAANCGGGG 

TNAAAAAAAAAAAAAMTTTTNGGGGNNATTTTTTNNNGNNNCCCCCTNANCNNGGGGGG 

GNNTNTTTNCTNTTTTNANACCCCCGGGGGGG 

Sequence 2830 

CTACT TAGGGCGAATT GGAGCTCCCCGCGGTGGCGGCCGAGGTAC r 1)11)1 IGTTTTGT 

TTTGGI I I I I I I I I I I IGGCTTGACTCAGGATTTAAAAACTGGAACGGTGAAGGTGACAG 

CAGTCG GTTGGAG CGAGCATCCCCCAAAGTTCACAATGTGGCCGAGGACTTTGATTGCAC 

ATTGTTGTTTTTTTAATAGTCATTCCAMTATGAGATGCATTGTTACAGGAAGTCCCTTG 

CCATCCTAAAAGCCACCCCACTTNTNTNTAAGGAGAATGGCCCATTCNTTTCCAAGTTNC 

CNNANGGGGANAMANANCNNTTNTTTTCN^ 

CCCCCCCCCAAAAACCTTm 

CCNCI I I I I I I I I I I I I 1 1 I I CACNCCAAAAAAAAAAAAA 
Sequence 2831 

CCGCGGTGGCGGCCGATGT AC I I I I I M I II I I » II 1 1 I I I I I I | | I u | fl I I I ll I I I 

M I I M I II I I I I I I I I I 1 1 I I MCCCTTTGACTrflTTTTTAANGCCNCGGGNGGATNTC 

NTTGCCCANCTCCCANNMTGTTTGCCCTTNTAANTCTGTTCCACTTTTAGGNGGNAGCC 

ACCAGGCCTTANCCATCCCGGGTACCTNGGCCGTTTTAAAANNAGGGGNATCCCCCGGGN 

TNNANGAATTTNNAAANTNAAAGCTTNNNNGAAACCNNCCCNCNNTNGNGGGGGGGGCCC 

CGGGCNNNAAATTTTrTNTTTNNTTNAAANANGGGNAAAAAANGGNCCCCCTTGGGGNNA 

AAAAAAANGNAAAAAAN I II I I I I I I NTNGGNAAAAAAATI I I I I I P CCNNAAAAAAAT 

NNCAAAAAAAAAAAANAAANGGGNGGGAAAAAA 

Sequence 2832 
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CCGGGCAGGTACAAAGATTCCTCACTGCGTGCTAAGAAAACAGATCCAGGCCGGGCACGG 

GGGCTCACACCTATNANCCCAGCACTTTGGAAGGCTGAGGCGGGTGAATCACCTGAGATC 

AGGAGTGCGAGACCAGCCTGGCCAACATGGCAAAACCCTGTCTCTACTAAAAACACAAAA 

ATTTGCCGGGCATGGTGGCAGATGCCTGTAATCCCAGCTACTTGAGAGGCCAAGGCAAGG 

AGAMNTTGCTTTGAACCTGGGGMGGCCGMNGNTTGCMNTGAGCTTGAAGATTCGGC 

MCNMCTTGCACTTTCCAANNCCNTNGGGNTGACNAGGANGATAAGGACCTTCCNTTCT 

NCAAAAAMAAAAAAAAMGGGANAAAAAAAAAAAMGGGNTCCCTTTTNGGCCCGGTTT 

TTT ANAMC TTMGTGGGMTTCCCCCCNGGNCCTTGNGNGGGMTTTCGNNTNTNNNAA 

AGCTTTTTTCGAATTCCCCGCCCCANCCNTNNGNGGGGGGGGG 

Sequence 2833 

CTAATTGGAGCTCCCCGCGGTGGCGGCCGGGCAAGGTACTTTC I I I 1 1 II I 1 1 I 1 I 1 1 1 G 

AGATGMGTTTTGCTCTTGTTGCCCAGGATGGAGTGCAATGGTGCAATCTCAGCTCACTG 

CAACCTCCGCCTCCTGGGTTCAAGTGATTCTCCTGCCTCATCCTTCCTAGTAGCTGGGAT 

TACAGGTGCCCACCACCACACCCAGCTAATTT7TGTATTTTTAAGTAGAGAATGGGGTTC 

ACCATGTTNGGCCAGGCTGGTTTNAMCTCCTGACCTTAAGGNGAANCCCCCTTTGCCTT 

NGGCCNCCA AAAAGGG NTNGGNAATNANNAGGGGGGNNACNCNCCCCCTTNNNCNNANAA 

AAAGGGGGN 1 1 1 J I II I NTNTGGGGGGGGGGAAANATTTTNNAAGGGGGGGNGCCCCCCC 

NNC CCCTNA NAAAAAAAAANAAANCCCCCCCCCCGNGGGGANNAAAANNNTTTNTATAAA 

ANANi i i » ICCCCCCCCCCCCCNGGGGGGGGGGGCCCCCCCCCANCI I U I I 1 1 1 I I 1 I N 

TAGNAAGNGNNNCAANCCCCCCNAAAA 

Sequence 2834 

GGGGNAMCCCCGGNGGCGGCCGCCCGGTTTGGAACCNGGGTTNAAACCCCCGGNTTTNA 
ACCNCANAACCGCAAGANAACGGGNGNAAAAAAAGGGAAACANANCAGCNGTCCAAAGAA 
AACAAAANGNGGGCAAACC 
Sequence 2835 

GGTGGCGGCCGAGGTACTGATCATGGMCTCTCGGGGAGGAAATGATGTTTTCCTTCTAC 

CCATCTTATGTTCATTGGCTGGGGCTCCTGGAACAGAAGACAGATTTACAAAAGAGAAAG 

GCACACAAATTTATGTMTATAAGTTTTACATGACATGGGAGCCTTTATAAGGAAATGAC 

CCAAGGAAATGGTTAMCCTGAGTGGTTTTGNGTTAGGTTTGATGAGCAATGAAAAGCTA 

TGGAGAACTATGATAGGAGGAGTGTGAGCTAAACGCAATGAACTGGGGGAAACT 

Sequence 2836 

TACTTAGG GCGATTGGAGCTCCCCGCGGTGGCGGCCGAGGTAC i I 1 I I I I I H I 1 1 1 1 1 1 

rTTTTTI iCTCAAAACGGCCTTCCTTAGTCCTGTAAAACCTGGAATGCACATAGTCCTGT 

AATGGCCATCCCAGAAGTGAATCTACATGATTATTCAAATTCAAATACCTAANAAAAAAA 

ATCACTTGAAGTTNTGCAACTTCCCAAATGCAAATTCCTGANAAAGAAACTTGACTGAGC 

CAGTTCMCTTTCTGTATTGGATTANAAATCCTATCTTGCTGGCTGGTCTGTANATTGGG 

TTGCTCTTGAGTCAAAATGANCCTTTTTTTAAAGTTMGCCNCTTTTGCCCNTTNGGGGG 

GGGGCCCCNCT NGGGGGGG GNNAMAAMTMTTTNNCCCGGGGTTTTNTTTTTTTNCCC 

CCNAAAAANCTN I I II 1 1 I I ICNAANCCCCCCNNAAAAAAANNGGNCCCCCCCCCCCTNG 

NNGGGGGGGGGGGGGGGTNTTTTTNTCCNCCCCCCCCCNCAAAAA 

Sequence 2837 

CCGGGCAGGTAC I J I I I I I I I I II I I I I I I I ) 1 1 1 GGGTTTTTAGTAGAGACAGGGTTTC 

ACCATGTTGGTCAGGCTGGTCTTGACCTCCTGACCTCATGATCCACCGGCTTCGGCCTCC 

TGAAGTGCTAGGATTACAGGTGTGAGCCACTGCGCCCAGCCCTGAGAAATAGTTCTTCTA 

ATTGTCATCCAGTTTTCATCTGAGTCCTGTTGTTCTTTGGATATGTGCCCTTCCAGAGCA 

CAGCAGGGGTTGTTCAAGTCTTCCANAAAAGCAGCTCTTGTTTCTCCTCATGTGGTGGGA 

GTGGAGTCAGAGCGTGGCTCAGGCCCCACATTCTCAGCTGTTTGGATCTGGGGACTCGAA 

GTTTCTGGTGGTTACTTCTGAAAGTCTTTTCCAGGATMTTATTCTTGCTCGGTTTCTCT 

GCATCTCTGACAGGCTGGTTTNCCTGCTTCCCCGCTT 

Sequence 2838 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACI I 1 11 I I 1 1 I I I I 1 I I 
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TTCTTTTTTTCTCTGTGGAAAAAGAMGMTTTGACTTTATTTAGAAAGTCTACAAAATA 

CA GAAGAC GATAACTCGCTTGCTGTAAGTCAGGAAATAAATAAATTCTAGGAGCCGGGCA 

ATATTTTTTAAC I I I I I 1 I IGAGACAGGAGTTTGCTATGTTGCCCAGGCTGGAGTGCAGT 

GGTGCGATCTCAGCTCACTACAACCTCTGCCTTCTGGGCTTAAGTGATCCTCCTGCCTCA 

GCCTCCCMGTAGCTGGGACTACAGGCATGAGCCACCATGCATGGCTAATTTTTGTATTT 

TTTGTAGAGACGGGGTTTCACCATGTTGTCCAGGCTGGTCTCAA 

Sequence 2839 

AGGTACI 1 1 I I I I CI II I I II I I I II 1 1 I 1 1 GGAGACAGGGTGTTTATCTGTCACCCAGG 

CTAGAGTGCAGTGGCGGGATTACTGCTCACTGCAACCTCGACCTCCTGGGCTCAAGTGAT 

CCTCCCAGCT CAGCCTT CAAGAGTAGCTGGGACTGCAGACCTGCACCACCACGTCCAGCT 

GCCCGGTTAA I I I I I I ICTGTCGGTTTGAAGAGGGGAGAAGGTCTCACTATGTTGCCCAG 

GCTTGTCTCAAACTCCCGGGCTCAAGCAATCCTCCCACTGTTGGCGTCCCAAAGTGCTTG 

GGGTTACMGGTGTGAGCCACCACCACACTGGGGCTCTGCTCTGCCTTTCTGAGTTTTGG 

GTTTTCTGCTTATGGNGGGGGAGCTTTGTTCCCGTTCTTCCCCACAAAGAACCCAGGGAT 

GTGGCACAAGCTTCCCTGCCCGTTTTTCCTTTAACTTCAAGTTGGG 

Sequence 2840 

CCGGGCAGGTACAACTGGAAACAGCCACCGGAGAGAGAACTGTCTCGCCTTCGCCGGCTT 

TACCAGGGTCATCTCCAAGAAGAGAGTGGCCCCCCACCTGAGTCAATGCCCAAGATGCCC 

CCTAGAACACCAGCGGAAGCCTCCTCCACTGGGCAGACAGGCCCTCAGAGTGCTCTGTAG 

GAGCTGTAGACTGGGAAGAGAGGCCAGGCGTGGTGGCTCACTCCTGTAATCCCAGCACTT 

TGGGAAAGCCAAGGTGGGCCTTGATCACTTTGANTCCCAAGGAAGTTTTTGAGACCAGCC 

TTNGGCACCATGGTGAAMCCTTTGTCTTTACCAAAAAATACAAAAATTTAGCTGGGTGT 

GGTGGTGCACACCTGTAGTCTTAACTATTGGGGGAGGCTAAGGTAGGGATTCACTTTGAT 

TCCCMGGAGGCGGAGGGTTTTGCANTTGAGTTTGGCANTTCAACACCCCTTGCANTTNC 

AGCCTTGGGGTGGACAAGCTTAAAACCCTTTNTTTTCAAAAAAAAAA 

Sequence 2841 

CTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACCCATGTCAATGGAGTAAT 

GCTCCCAGGAGAGTTATGGCTGCTCTGCTATGTCATGCAGGTTGTTAGGGAAGTAGGGGA 

AAGCTAGAAGTTACAGGCCTTACCCAGCTCCCATGCAACCCAAAAGGCCAGTCTCACTCC 

CACCGTGCCCCACCCTGACAGCACCAAGTTTGTTTCCAGGCAGTGAGTGAGCAGGGCTGA 

GMCTTGTCCCAGGCTACCAGCCTGCCAGCTGAGAAAGAAAGCATGGCTTTTGCATCTTT 

TTG CCTG TTGAGTCTGCGCACTGGATTTATGCCCTCCCTCGAGTTTTGGCCGGGAGATTC 

ACGTTTTGGTTCMGTGGTTACCAAAGTTTACTTGGGGAGGTTTCCTTTTCCTTTGGGTC 

TT 

Sequence 2842 

CNAATT GGAGC TCCCCGCGGTGGCGGCCGCCCGGGCAGGGTA CI I II I M I I 1 1 I I I I 1 1 

TTTTGTATTTTTAGTAGAGACAGGGTTTCATCATGTTGGCCAGGCTGGTCTCGAATTCCT 

GACCTCAGGTGATCCGCCTGCCTTGACCTTCCAAAGTGCTGAAATTACAGGCATGAGCCA 

CGATGCCCAGCCTGAGGAACAGATTTCTATATGGCAAATAATAAAGGCAAATAAAAATTA 

ATGCTAAMTAGAATGAGGAMGTATTIsrTTTTNTTCACCAGAATGGTTGTNANCANAAAT 

GNTTTGCACCAGGGTGGNNTTNAAAAAMCCCCNCNCTTNNATTTTGGGTTNTNCCTGGG 

GGGNTNTNGGGNNCATNAAANACCNTTTTANNNTN^ 

Sequence 2843 

AGGTACTTTAACTTC I I I M II 1 1 I I I I I I GAGATGAGTTTTGCTCTTGTTGCCCAGACT 

GGAGTGCAATGGCGCAATCTCGGCTCACAACAACCTCTGCCACCCGGGTTCAAGCGATTC 

TCCTG CCTCG GCCTCCTGAGTAGCTGGGATTACAGGCATGCATCACTATGCCCAGCTAAG 

TTTGTATTTTTAGTAGAGACGGGGTTTCTCCATGTTGGTCAGGTTGGTNTCGAACTCCCG 

ACCTCAGGCGATCTGCCCGCCTCGGCCTCCCAAAGNGCTGGGATTACAAGTATGAGCCAC 

CGCACCCAGCCTAAGATCCAAGATTCTTATG7TTTCTCTCCTTGCCTTTTGGAACCTGCC 

CGGGGCGTGAGCGGCCCGCCGGGCCAGGTACCAACNAGAAACNCAAACACCTTGCNCAGT 

NTNTCNAGGCACCNTTTCCAAAMCCAAAATTTTGGANAAGGTGAAACNTTAACTTNATA 
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TNGGCTTAAATTTTAA 
Sequence 2844 

ACCGCGGTGGCGGCCGAGGTAC 1 MM 1 1NTTNTTT A I MUM I ATT A I 1 1 1 1 1 NTGGG 

GACGGAGNGNCCCTCTTGNTGCCCAGGCTGGAGCGCNATGGCGTGATCTTGGCTCACTGC 

AGCCTTCGNCTNCCGGGTATCAAGTGATTCTCCTGCCTCAGCCTCCCNAGTAGCTGGGAT 

TACAGGCATGCNTNNACCATGCCCAGCTMTTTTGTATTTTTTAGTANCAAAACCGGGGG 

TTTCACCCATATTGGGTCAGGCCTGGTCTCGAACTCCCAGACCTCAAGGGTGGATCCCGC 

CCACCCTCGGCCTTTCNCNAAACCTGCTGGGGAATTACCAAGGCCGTTNAAGNCCAACCC 

GNCGCCCTNGGCCANGGGGGACCTNATACTTCTTTTTTTAAA 

GGGGGGC^CACCACCCNTTATATTTNAAAAATTAGGTTNCCTTGCC 
Sequence 2845 

CCGGGCAGGTAC I M M M M I M M M M M M I M I CCTCATCTTTCTTTGATAATGN 

GNGMGCCMGGATAGGCAGGAACCCCMGTTCCNTNTNTCCTGGGATTGTCACAAAATT 

CCCCCTNTCCCCATCGCATCTAT^^ 

ATACCATGGGCTTTANAGGAATCATCGCTCACTGNGGGAACXJTTGCCCCCTCCCCAAGCC 

ACCCCACTGNGC CCATA AACGTGCTGCCTGTAACCAATTGTTCCTGTTGAATAACAATGC 

GAGCTGAGGGGCTTTTTNTGCCTGAGCTGCAAATANATTAGGCTGCTCCCTTTTATGTGN 

GCAAANACATTACCCAAAAGC 

Sequence 2846 

CCGCGGTGGCGGCCGCCCGAGCAGGTACCTCAGAATGTAACTGTATGTGGAGGTCTTTAA 

AGCAATGATTAAGTTAAAATAAGGCTATTAGAGTGGGGCCCTAACTCTAAATGACTGGAT 

TTATATGAAGGAGCTAGAAGAAGGACAGACACACCAGGGTCCTGTGTGCACAGAGGGACA 

ACCATGTGAAGAGGCAGCAAGAGGCCAACTGCAAGCCGAAGAGAGAGGTCTCAGGGGAGA 

CCAACCCTGCCAGCACTTTGATCTCAGGCTTCCAGCCCCCAGACCAGTGATAAAATAAAT 

TTCTGCTGTTTMGCCACCCTGTCTGTGGCATTTTGTTCTAGTGGCCCTAGCAA 

Sequence 2847 

CCNGGGGCGGCCGAGGTACCATNANGCTTGCAGGGGCTGAAGCATGGTTTGTCCANAACC 

CCAACCACCAGGTCTATCGNNCTCTTTCTGNCACCTTTTTNCTCTTTTTTCCTTCTNCCC 

TTGCACCTGAGGNCCTGGAAGGCCTTGATGAGGCCCAGCAAACAGGCATTCTCACAGCTG 

GGTTTATAGTCJTTGGGCCCCTTACTCAGTATCCTGGGAACCCTGGGCCAGGAAGTTAAC 

AGTGGTCAATCANAAANTNCTGAANAAAATCCCCCTCCCCCTG 

Sequence 2848 

CCCCCCGNG CNGGN NCANNTTTTGGGCNNTTTTTGGGTTTTCAGNANGGTTNGTGGAGNA 

TCCCCNGNGGTTTTTNNNAAACCCCCCCNCNNAACANAGACCAAANGGGGGGNNGNGNAG 

GGGGGNGGGGNCNNTNAGAGAGGNGGGGCGGGGGCAGCGGGGGGGNAAGNNGGNNGNNNA 

GGGGANGGGGAGGGGCNNNCCCCCANACNNGGAGGNGGAAGGGGAAAGAACGGCNAGGGG 

N AAAGGCCGGGGGGCCACCN AGCNCNGGGGGNNCN ACNGANGGGGGAAGG AACGGGGG AA 

ACCAAAGGCCCNNCCCCCACCCAANAANGGCCNGGGAAAAACCCACANNCNGGGNAAGGA 

NAAAGGGGCCNGANGGCGAAGGANGGCNCCCAAGCAGGAGGCGGGNNGGGGCCGCAACNG 

CCCCNAAAAAANGCCCCCGGGNNNACCCCAAGGGGGGANGGGGGNCANAAAAGNNGGGGG 

GGAAGGAACNGGGCCNNAGGGNGGCCCCCNGNNAGANGNGNNNNGGCGGAAANCCCANGC 

GCCNGGGGGGNNNANCCAAGGGGGNGGGA 

Sequence 2849 

GAGATGCAGTCGATTNCATACCTANTGGGTCCCANTCCTNNNTNNGGNCNGTTGNGAAGC 
CGGATAGTGACTGAGATCACTGGGTAGACCTTGTCCACCTTGGCATTCTTGTCTGCCAAG 
GTCCATGGCCCATGGGGATGGGGACAATTTGAGTGGG 
Sequence 2850 

GGCAGGAACTNTTCTTTNNTCTTTTTTTNTNAAGTNAGNGGTMTTTAAAAATCTGAMT 
ATAGGCTGGGCGTGGNGGCTTACGCCTGNAATCCCAGCACTTTGGGAGGCTGAAAGTTGG 
GGCNGGATTCATCTGAGCTCGGGAGTTCAGGGACCAGCCTGACCAACATAGAGAAACCCC 
GNCTCTACTAAAAATACAAAAATTAGCCANGGCGTGGTGNGCACCATGCCTGTAATCCCA 
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GCTGTTTCAGGGAAGGCCCNANGCCANGANAAATTTGCNTGGAACCCCGGGGAGGGCCGG 

GAANGTTTGTTGGATGGAGCCCCGMGATCANCCAATTTGNNNANCTNNCAMCTTTTGG 

GNTTGAACAAAGAAGCCNAAAAANTTCCCCAATTCTTTCATNAAAATACAANGAAAACNT 

TANANAAAAATTTTTTGAAAAATTAATGGGGACCCCNTGGGAAANGAGGCCCNNTAAAAA 

AAAAAAAAAA 

Sequence 2851 

CNANCGGAAGGGGCCCCNGGGGGGGGGGGANAAGGGCAAAACCCNGGNCTNAGCAACCAC 

ANGGGGGGGCCGGAGCCNNGGGGGGCCAGGACCAGGGGAGAGGGAAGCCCCAGCCCNGAG 

GCGNNCGCACCACNCNNAGCACCAGCGCACCAANACNGCAGACGAGGAAGGAAGCACAAG 

CNCCCCACNNNACAAAGGGGAAACCGAGGCGCGGGNAGCGCGGNCCCGGGGACGGCCGCA 

CAAGAACNAGGGGGACCCCCCCGGGCGGCAGGAGNGCCGGNNAGAAGCCAAAACGAANCC 

GGCGAACCNGGGAGGGGGGG 

Sequence 2852 

AGGTACGCGGGGATGCGCAGTCGTGAGTCCTCTTGTCCTTGAGCGTCAACCTTCTTTCCC 

TGAAGTGGCTG GGGTTC CTGTTTCCTTCTTTGATTGACAACTTGTGTTAACCCTCGCACA 

TCTCTGGGCCAATTTTTGCTTGAAAATGGCAGCTCCCGAGCAGCCGCTTGCGATATCAAG 

GGGATGCACGAGCTCCTCCTCGCTTTCCCCGCCTCGGGGCGACCGAACCCTTCTGGTCAG 

GCACCTGCCGGCTGAGCTTACTGCTGAGGAGAAAGAGGACTTGCCCGAAGTACCTGCCCG 

GGCGGCCGCTCTAGAACTAG 

Sequence 2853 

CGGNGGCGGCCGCCCGGGNATGGTACCCTCTGTCACGGCTTCCTTTTNCTGGAAAAGGGA 

ATTTCCCAACCCCGGGTGAGGCAATGCCCCGCCCTGCTCCGTGGGCTGCACCTGCTGTCT 

GTCAAGCCCCAATGAGATGAACCCTGTACGCGGGGGCCTGGGATCTCAAAATGGCGGCCC 

CGTGCGGAAACAGCGTNTGGGAGCANNCATGTTGCCTNCTGAACAAAGCCGTTGAAGATG 

AAGAATGGGCAAAATCGCCCCATACGGAACAAGCGCANCCTNGGGAGCCCGATACCTGGC 

NNGCGAACACCAAACGGGAGAATTTCGCCAATATGGATGTGACAGCGGTTCCCATTAAAG 

CGGTGATAGGGATTTTT 

Sequence 2854 

CCGGGCAGGTACGCGGGGTGGGCATTCTGGGTAACAGAGCTATTTACTTCCTGCGGGTGC 

ACAGGCTGTGGTCGTCTATCTCCCTGTTGTTCTTCCCATCGGACGAAGATGGCCCTGGAG 

ACGGTGCCGAAGGACCTGCNGGCATCTGCGGGCCTGTTTGCTTGTGTTCGCGTGGTCAAG 

ACTAGTACCACCAGNTTTAGAATATGATGGCTTGTNGACAACATGTTGATGTCATATGNT 

ATCAAAATCG AAAC GGNGTCANCTCCGAAGAAGGATGGGTTATTATTGACTTGCACCTTA 

GCCTNTTCGCTTTTGCATGGGGATANCCATTTNGCTCATTNATGAAGTTNCCATGTANTG 

TACAGGCCTTGGGGIsrrCNTTTCAMGGTNTNNNMNCTGCAGNTCCAGTTAAAACCTTT 

Sequence 2855 

CTTTCATGTGATCTTTGTGGCAGTGGGACAGGAAGTAGGCGCGGGCCCTCAGGTTCTCCC 

TATCGAAGCGGTCTATGGAGATAGTTGGATACTCGGCCATCTGCCCCTCGAAAGAACTCA 

TAGCGCCGTCGATCCCAGAGTCCGGGACCCCAAAACCGCAGCTGAAGCCAAGGCCAGCCC 

TGACNCGCCCCCGCGTACCTCGGCCGCTCTAGAACTAGTGGGATTCCCCCGGGCTGCAGG 

GAAT TNGATA TCAAGCTTATCTGATACCGACCGACCTTCNAGGGGGGGGCCCGGTTACCC 

MGCTTTTTTGTTCCCTTATAGTGGAGGGmAMTTTGCGCCGCTTGGGC 

Sequence 2856 

GGGCGATTTGGAGCAAACCCCGGGGGCGGCCGCCCGGNTTGGTACCAAAANTNCAAACNA 

CCANTTTNGAANCCGGCGNNGACGNNNGCGGNCCNAGCTACTCTGGAGGCTGAGGNGGGA 

GGANCGCTNGAGNCTGGGAGGCAGAAGTTGCAGNAAGCCGAGATCATGCCACTGCACAAG 

CTAGGTG ACAG AATGAGACTCTGTCTCAAAAATAATTAAAAAGCCTCTGCCCCAAACTCG 

TTAAMGATTTTATAACCACAACTGCTGNTTCTGNGNAGATGCATCTGCATGCCCAGGAG 

CAGTAAATGCAATAAAANCATTTGGNTATACTTTGAACACAAAATAAACGGGTGAGGCTT 

TTACTTTCGAAAAAAAAAAAAAAAANAAGGGNCCTCGGCCGCTCTAAAACTAGGGGGANC 

CCCCGGGCCGCANGGAAATCGATA 
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Sequence 2857 

ACATTMTTTGCGTTTGCGCTCACCTGCCCGCTTTTCCAGTCCGGGGAAACCTTGNCGTG 
CCAGCTTGCATTTAATTGNAATTCGNGCCCAACCGCTGCNGTNGGAGAGGCCGGCTNTTG 
CCGTATTTGGGGCGCCTCTTCNCGCTTTNCTCGGCTTCACTTGACTC 
Sequence 2858. 

TACTTAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACTTGTCCTCAGCTTGGG 

CTTCTTCCTCCTNCATCACCTGAAACACTGGACCTGGGGGTAGCCCCGCCCCAGCCCTCA 

GTCACCCCCACTTCCCACTTGCAGTCTTGTAGCTAGAACTTCTCTAAGCCTATACGTTTC 

TGTGGAGTAAATATTGGGATTGGGGGGAAAGAGGGAGCAACGGCCCATAGCCTTGGGGTT 

GGACATCTCTAGTGTAGCTGCCACATTGATTTTTCTATAATCACTTGGGGTTTGTACCTG 

CCCG GACACATCCAGTAGGCTMGGGGATGCTTTCCTTTTCTGGGGTTTTTCGGGGGGTT 

TTTTGGAGCGGGGAGAGGGATGMNGAGGTGCTCCCCTTMTTTCTTTATTGAGAATGAT 

GCCGTGGATACTTGAATTTAAGCANTTGTCACATGGGCAGTGTCCTACCTGGGG 

Sequence 2859 

AG GGCG ATTTGGAGCTCCCCGCGGTGGCGGCCGGGCANNTACTTTGCGGTTTTTGGGACT 

TGATTTTNGCAGAGGGATCGGGCACTGAAGGTGCAGTTCTCAAAATCACACCTGNAGGCT 

GGCTCCTCGCTGTGGGTATCCAGGTGCTTCTGGAGGTCAATAAGATTCTTGCAGCTGTAG 

TCACAACAGTCACATTTAAAGGGCCGGTCCTCACTGTGACGAAAGCGCATGTGGTTGCGG 

AGGGAGGAAGGCAGCGGGCAGGTCATGTCACACAGAGGGCACTTATAGTGATTCACATGG 

TTGCGCA 

Sequence 2860 

ATGCGTTGCNGCTCACTGCCCCGCTTTCCAGTCGTGGAAAACNCTGTTCGTGCCCAGCCT 
GCATTTAATGGAMTCGGCCAAACCGCCNCCGGGGGAGGAGGGCCGGTTTTGCCGTATTT 
GGGGNCGCTTCTTCCCGCTTCCTTCGCTCAACTGGACTTCGCTTGCGGCTNCGGGGTTCG 
NTTCCGGGCTTGCTGGGCCGAGGCCGGGTATTTCANCCTTCAACTTCAAAAG 
Sequence 2861 

CCCTAGGGCGTTTTGGAGCTNCCCCCGGTGGCGGCCGAGGTACI ITTTTTTTTTTTTTTN 

CCCGGAGNTTTNTAANAAGATTTATTTAGCAAAAATACATATAGCCATTATTGCAAGACT 

TAAATGAGA TGNT AAATGTTCAACCCAATTTTCTTTCCTGGATMGTTTTTCTTTC 

CCCTGTCAGTTTTGAAAACATAATACCAGAAGAAGGGGGGCCCAATTCCACAGAGAGCTC 

CCAAGMTGAGYTTCTGGGAGTGAGTCTGAAGTTGAGATAAACCTTTGCTGATCTTGCTT 

ACGTTCAATGCATCTGGGCAGCGTCTTTGATGAGCCCTGGCGGTTAGGCTGGTGGCACTG 

AAGCAGGCCTCCAGGGTCTCCTGTTTAAGCAGGATTTTAAGGCCAACCCTGC 

Sequence 2862 

CATCTATAGGGCGAATTGGAGCTCCCCGCGGTGGCGGCCGCCCGGGCAGGTACTTTGGGA 

GGCTGACGCAGGAGGACCGCTTGAGCTCAGGAGTTCAAGACCAGCCTGAGCACCATAGTG 

AGACCTCATCTCTACTAAAAAAAAAAATAAAATACCAGGCATGGTAGCATGTGCCTGTAG 

TCCCAGCTACTCTAGTCCCAGCTACTTGGGAGGCTGAGGTGAGAGGATCAC7TGAGCCCA 

GGAGATCGAGGCTGCAGTGAGCCATTATCACGCCACTGCACTCCAGCCTGGGCAACTAAG 

CMGACCCTGTCTCAAAAAAATTTTAMAAATTTAAAAAATAAGAAAATCCAAGCTAGGT 

TGAAATCTGAATGTTGAGCAGNTCAGTGAGGCACAAACTTAGCTTAAGAAAGTCAACCCT 

GCCCACTTGCCATTTTGAAGGTTATTACTAGCCAAAATTACN 

Sequence 2863 

TA TAGG GCGAATTGGAGCTCCCCGCGGTGGCGGCCGAGGTACAGAAAATTAGCAAGGAGA 

CATTT TCTG CATT GTGAGAAATCAACATAGACACCTTAAAGACCCCTTTGAGAGTGTGGC 

mm iGMCTTTTCAGATTTTGCTCAGTGACCTGCTAACACTTACGTGAGAGGCTCCAGG 

TGTAAATAGAATCTAATGGCAGAATCTGTAAGTGTAAACAAGCATCTTAGGAGTGAGAGA 

TCAAGACCACAAAATGTCCAGAGCTATGACCACAGCTATACCTACCCATAAAATACGATA 

CTGGAGTAGGGTATTTTTGTCTTTTTTCTTACCTAAGAGCTAGCTAATCAGGACAGGTGA 

TGGCAGGTTCTGGAGCTCTACCAGGGCAGGTCTATTTTCTTTTTT 
Sequence 2864 
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AGGT ACAGCTCTTCTCTTGTCTTTCTGAGCTCTTACCAGAATTGCCTTCCGTGTCTTTTT 

TTTTGGAGATGGAGTCTCACTCTGTCACCCAGGCTGGAGTGCAGTGGCGCAATCTCAGCT 

CACTGCAACCTNTGCCTCCCTGGTTCAAGCGATTCTCCTGCCTCAGCCTCCCGAGGGGCT 

GGGACTACGGGTGCACGCCACCATGCCTGGCTMTTTTTTGTATTTTAGTAGCGACAGGG 

TTTCACTGTGTTGCCCAGGCTGGCCTTGAACTCCTGAGCTCGGGCAATCTGCCCGCCTNA 

TTCTCCCAACATGCTAGGATTATAGGCATGAGCCACCACGCCCTGCGGTGTCCATATTTC 

TAACAGCAGTCACTTCAGAGCCATCTAGGTTTCTTCTAACATGCACCTAAAAACTCTTTG 

GCCTNTACTACCTAGNTNTGAAACCATTTT 

Sequence 2865 

CGGCCGCCGGGCAGGTACAGNTGCATCANCTGCTCGTAGGACATGTCCAGCAGCTGGTCG 
AGGTCCACGCCGCGGTAGGTGMCTTGCGGAAGGTCCGCTTCTTCTTCTGCTCTACTTCT 
GCCACCCGCGTACCACGGCTATCCTTATAGC I I I I I A AT 
Sequence 2866 

CGCCCGGGCAGGTACTTTCCCCTCCCCCAGAAAAGCAGGACTTGCCCGCTAAGGGTGAAG 

GACCAAGGCAGGCGTCCCTGAGTGGTCTGACACCTTTGAAACGTGAGTGAATAATCAGAG 

AGGTGT CCCT GCAATGATTAAACACCAAGGAAAGGCTGCCTTCCCAGTCTGTGACCAGCG 

CCAGAGTTTTGGGTCCACGGATAAAACGTGTCTCGTTTGTCTCTACCAGAAAATGAAAGG 

AATTGAAATTAAGAGAAGGGAGAGATTAAAAAAAAAAAAAGAAAAAAAAAAAGTACCT 

Sequence 2867 

CCGCGGTGGCGGCNCGCCCGGGCAGGTACTTTCCCCTTCCCCAGAAAAGCGGGACTTGCC 

GCTAAGGGTGAAGGACCAAGGCGGGCATCCCTGCATGGTCTGACACCCTTGAAACGTGGG 

TGAATAATCAGAGAGGTGTCCCTGCAATGATTAAACAGCAAGGGAAGGCTGCCTTCCCAG 

TCCGTGACCAGCGCTGGAGTTTTGGGTCCACAGATAAAACGTGTCTCCTTTGTCTCTACC 

AGAAAATGAAAGGAATTGAAACTAANAGAAGGGAGAGATTAAAAAAAAAAAAGAAAAAAA 

AAAAGTACCTCGGCCCGCTCTAGAACTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnn 

nnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnTGTTCCCTT 

TAAGTGAGGGTTAnnnnnnnnnnnnnnnnnnnnnnnnnnnnnnAGCTGTTTCCTGTGTGA 

AAATTGTTATCCCGCTCACAATTCCACACAAACATACGAACCCGGGAGCATAAAGTGTAA 

AAGCCTGGGGGTG 

Sequence 2868 , 

CCGCGGTGGCGGC CGAGG TACAAATTGTCG 1 I 1 I \ ATTCCTCTTATTGGG ATATCATTTT 

MAMCTTTATTGGGTTTTTATTGTTGTTGTTTGATCCCTMCCCTACAAAGAGCCTTCC 

TATTCCCCTCGCTGTTGGAGCAAACCATTATACCTTACTTCCAGCAAGCAAAGTGCTTTG 

ACTTCTTGCTTCAGTCATCAGCCAGCAAGAGGGAACAAAACTGTTCTTTTGCATTTTGCC 

GCTGAGATATGGCATTGCACTGCTTATATGCCAAGCTMTtTATAGCAAGATATTGATCA 

AATATAGAAAGTTGATATTCAACCTCACAAGGGCTCTCAAAGTATAATCTTTCTATAGCC 

MCTGCTMTGCAAATTAAAACATATrrCATTTTAACATGATTTCAAMTCAGTTTTTCA 

TACTACCCTTTGCTGGAAGAAACTAAAAATATAGCAAATGCAGAACCACAAACAATTCGA 

ATGGGGTAGAAACATTGTAAATATTTACTCTTTGCAMCCCTGGTGGTATTTTATTTTGG 

CTTCATTTCAATCATTGAAGTATATTCTTATTGGAAATGTACCTGCCCGGGCGGGCGCTC 

TAGAACTAGTG 

Sequence 2669 

GCGMTTGNAGC TCCCCG CGGTGGCGGCCGAGGTCTMTTTGAATTTGTAATGAGTCTGA 

TGGTATATTTCAATTTTTTGCTTTGAGGGACTGGCTGCTACATTGCAGMTATCTTATAT 

CCCTGACTGCTTTCCACTAAATGTCAGTGGTGACCCCAATCCAATATTATGACAACTGAA 

CATGCTTATGCATCCCTCATGCCTTTAi I I I I I ATTTTGGGAAATCTTTCAGCTTCAGTT 

TTTGCTGATATTTATGTGATTCTTTGTTCTGCMTTCAMTTTCTGGGAGCCAAACAGTC 

TCCTTGGTTCAGATTACTGTTTTTTGACTAGAGCTTCTCGCTTCAGATTCTGTCATAAGA 

TTATGGCTTAACCTATGGTTGTCCTTTGATTTGGTGCCATATGAMTAAAACATTATTTT 

CTATGGCTATGTATTAAGAATTTTGTGCAATTCTGTTTTTCTTAGAAGGCTGAGGGTGTG 

TTGTCAGACACCATGACTGATGTGACAGGTGTATTTTATTATGC 
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Sequence 2870 

ATAGGGCGMTTGGACTCCCCGCGGTGGCGGCCGAGTCCTACCCTTTACTTTTTCCCCAA 

GACCATCTCAGGGTGGAGCATTCTGTCTAAGAGAAGAAAGATAAGGAGGCTCCCACCCAC 

CTCTCCCAAGAGCAGACATTAMCATCTTTGTGCTTTGAAGAGAGTGAATTTGGATAGTC 

TTGTGATTCTCMGACTAACTTCCAGAATTATACTTTAACCCCTTCCAGATATGGTCCGC 

CTTTGGCATTGTGTGTACCTGTGATGGGGCGTGTGGTTTCCGGTTGTCTCACCTTTAATT 

GTCMCCTCCAGTGTATGACTCTAGAAATATGAGGAAMGCTTTTCAGTTTTTAAAATTG 

CCATTTAAATTTAGTCTATTAAAMCMACCTAGAGGTCTTGGGTTGCAGTTGATTTCAG 

AGTATATTAATTTAGTGGGTCCCNAMGTATTACATNTATTTATATTCTGGAATGAAAAG 

G 

Sequence 2871 

CCGCGGTGGCGGCCGAGGTACTCCTTCGTAAACCATGGAGAGCCAGCCCAATGCACAGCA 

GTGGATATCATCTTTCTCAGAGTCCAGTATCACAGAATCACGACTTTGTCCAGCTGCAGG 

TGCCTGCAGGTCACACTGGCTMCTACTTCTGTGATGGGCTCTTCTTTCTGAGGTTCTGC 

CMCTTGTCTACTACATAGGGTTGATCATCCTGTTCAGGAMTATTTCTTTCATTTGCTC 

TGAGCTTMTATTGTMTTTGTATTTGATCTGCTGGGTCTTTGGAGTCAGGACTGGTTTT 

ATCAGCAGTTTGATCTTCTGAGGTCTGGTATGTAGTTTGCTGGCCCACAGAACCTTCACG 

TGTATTCACAGCCTCAATGCCATAAGGAAACTCTTT 
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Sequence 2872 

CCCTTAGCGTGGTCGCGGCCGANGTACAAGGTGAACCCTGAACATTCTGTGTCAGAAAGC 
TAANGAAGTGCCCAGAGAATGATAAATCAATCAAAAGAACAAAGCGACAAGCATGAAAGG 
CCAAATA TGAGATMTTTGGGAATTTAAAAAATGAACATAATGGATCCGTAACACATTGA 
A l l n l l AAAATTCTTGAGCCTGTMTGATACTGGAGGAAGGAGTCTTTTGTTTAC 
Sequence 2873 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGTGGGGATATTTTGTCTCACAGATTG 

TAAGAAAGGGGTATGGAAATCCCCAGGCAACAGTGTGCTCCTCAGCTTGCTGAAACAGAC 

CAMGACTA TGTT CTAATCAMCCTTCCAGGMTCTCATGGAMTTTCATTTAATGCCTC 

TCCAGGCACTTTTCTGAAAGCCCCCACGTTAGGGATGTCTTGGCTAAGACATCTCTCATG 

GTATCCACAGCAACCCTGATGAAGCTCATTTCTGGAGAAGAAGAAAGTCTCTCAACACCT 

CTGCTAAGTCATCATTCTCCCATNCTCACTGCAGCAGTTCCTGGAATCTCATGAAGGGAA 

GCTTGGGACCCACCGCACACCCTNTGCAAATACCTCACAAGTAAACTTGGNAAATGCCG 

Sequence 2874 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGTCCTAGCCACAGTAGTAATCACCACTGGCCT 

GACTGAGCCCTCACCCTTTATACAGTGTCTCCTGCCACCCTCCTGGGAGAGGCTGTTCTC 

GGCACAGCTGGCCTGGGGTCACACAGCTGGTAGGTGTAAAGCAGGCATTGGAGTCCAGGT 

AGTCTCACTCCGTAGCCTGTCTCTTTAGCCACTGGAAATGTAGAGCAAAGCGAGAATTGT 

CCAAAGAGATAAGCTAATAAAGAGGAAAACAGGCTGGGTGCAATGGCTCATGCCTGTAAT 

CCCAGCACTTTGGGAGGCCAAGGA 

Sequence 2875 

CCCTTAGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGGTCA 

CCACACTCTACAAAGGCAGTCANCTNCNTGACACATTCCGCTTCTGCCTGGTCACCAACT 

TGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCCA 

GCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGCT 

CCACCTACCAGTTGGTGGACATCCATGTGACAAGAAATGGGAGTCATNAGTTTATCAACC 

MCMGCAGCTCCAGCACCCMGCACTTCTACCTTGAATTTCACCATCACCAACCTACCA 

TATTCCCAGGACAAAAAGCCCAGCC 

Sequence 2876 

CCCTTAGCGTGGTCQCGGCCCGAGGTACAAATAGAGCTGGAATGATGATGATCACAATGA 

AGATGACAACAGTGGTGGCAATGACACGGCTGCCACTACTTGGAGTGGCCTCGGAGTTTA 

ATATCAAAGGCTGGCCTGGGTTTGAGCTGGTTGAGGGAGAAGTGAGTGTATTGCATACAT 

GAGMMCCTGAGCAAAGGAAAGATGAGACTCAAATGGCAGCACTAAGACCATTTTGCAC 

TGGAGTATCAGATATATTTTGAAAAGMGACTGTAGAGTGACTTTTTGAAAGACATGGGG 

GATTAAGCAGGTAAAGAGAAGCTTGMGTGGTATTTAGAGATGCTACATTTAGTGACTTC 

AGAAGGTATAAACTGAAGCAGCAATGTAGATTAATATTCAAGATATGGGGATAAAAGGGT 

GACTAAAGTGGGATCCAAAGCAAATGGGTGCCCGAAAAACCANTGAAAGAAGAACTTGAT 

GGAGGCTATGATCTAATTTACATTTAAGGGGAATGAGGAAAACCAGTGGGAAGAATTTGC 

CACCCTTGAAAMTTAAATACTGGAAGTTCTGACCCAAAGAAATTAGGGTTAAAAAAAAG 

AMTTAA AAN G CCTTTCTTTTTGG A 

Sequence 2877 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTCCTTCAGTGAGTTGTTGGCAACTAGGCAGTG 

AAATGAGGGAGTGACCGCTCATGGCTCATGAACTTTGCAAGTTGTGCAACTCAACTGTCC 

CACCTTGAAAGGGCTCAGGMGGTAMGAGTGTTGTTTTGGGACACAGGGATTGTTGCAT 

TTAMTCTTTGTGACTGTGGTTTGGTCTCAGACTGGTTTTTCATGCATAAGTTCTTCTTA 

CCCTCAGGCATCTGGGATTTTACCTGTTTTTCTTCCAGTGGTGTGCCCTCTCCGCCACAT 

TGTGGAAGCGTGCTTTCTCAATCCTTCAGGAAGAAATTCAAAGACTTTN I I I I M M 

Sequence 2878 

CCCTTAGCG TGGT CGCGGCCGAGGTACATTATTTAGTGTAATGCCTAATGTTAGTCCTCT 

TTATAGCATATTTTATACMTTTCTATTATTTAACAGGCCTTGGGAATTCAAAGATGAAA 

TGTGTGTCCCTCAAGTAACTCACTAATCTATTGTTATAAGTAGATTGTAATTCATGAGTG 
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CTGTAACAGCTATCCTATGACAGCACTGAAGAGGATGTGGACTTCCTAGATGAATATGTA 
TTGAGTTGTAGGGGAGTAAGAATTCACCTACCAGTGAACTAGTGGACCTGAAGTGGTATT 
CCAGATTG 
Sequence 2879 

CCCTTAGCGTGGTCG CGGCC GAGGTACAGTTCTCAGTTTTTCTTATAGGAGAAATATGGT 

ATATGTTTATMGAATCTTTTATGAGATTATAGATTTCAATGCTGTGGATAGTGTCTTGC 

ACCCAAACMGAAAGTCCATMTGGAATGATCTTCCCTCAGCTTCCTATCGATTTAGTTA 

CCTCTTGAMGCACAAAAATTAAAACATTGCCATATGTTGMTTTTTAAAAAGCACTTGG 

AGTGAGCGMCATTTCCTGATAMTGCCTTTTAGAGATAGGTTCTTGATATTCAGACATC 

TGCAGA MTGTT CTGGTTCCCAAAGTCATTTCACTTCGAAATAAAACACAGCTCCTTCAA 

ACAGCACTTTTTCCACATAAATCTAAGTTGCCTCTCCCTGTGGACATTCAGAACTGATAG 

MCAMCACTACTCTTTTGAATTTGATGGTTCGTGTCCTTTAAAGTGTTTGAGGACCTAT 

GCAGAGCCTGTACACTTGGGGTAGTACCTGCCCGGGCGGCCGCTCGAA 

Sequence 2880 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAATTTATTAAGCTCTAGGTTCATCAGATGACC 

AGGAAAACATAGACTATCAGGACACAGACAAAGCAGAACTGCAGAATTATAGAAAATTCA 

AGGTCTCATGGAGGAAAAGCAGTATCTACAATAAGAAAGAGACCCCAAACAGCACTCTAG 

ATTTCTT TMGA GGGATTATATTTCCTCCTTCCTCCCATTCTCCCTTCATTTCTTCCTAC 

CTCCCTCTTTTCATCCTCTTCTCCTTTTATCTTTTCATCATTCCT^ 

CTTAGTTACTACTATGTTCTAAGCACTG 

Sequence 2881 

CCCT TTGAGCGGCCGCCCGGGCAGGTACAAAATTGCTMTTTTATTCCAGTGATGATTAT 

TTTTMCTTTGGTGATTACGTATCAAATTAAGGTAATTCTATAAAAATCTACAGCCAGTA 

MCTGAAACTATAGCTAAAACTGTAGCTACTGTTTTTATATTTTCTTAGATGCATGTTGT 

TTTATTATMTAGTTTAATGT AGTGGT TGMTGGMTATGTGGTTTTCTTCTAGAATACC 

TTATGTGATTTATTCTATGACTTTTTTCCTTTGMCCCTTATGACTCTTAGTATGTAAAT 

TTTCTTAAATTATACTCTTTTGTTGTTTATTCTAAACTACTTGAGGCAGATCTCAAAAGT 

TGMTAATTACTGTGCATCTCAGAGAATTGTCTTGTGTCTTTCAMTATATTTAGTTGTC 

TTTAGCAGTMCTCTTACATGGTTATrTGCATGACTTAATAAACATTTCAATGCCCCCAA 

GTTACCTAGCAGAAAAAGTCCACTTTTAMGACATCTGTMGAGAGAACTTTATTTTTCT 

TAAATTGGAAGCACATCATTTAGTCCT 

Sequence 2882 

GTACACAGCAAATACAATCTTATGTCAGTTATAMGAATMTAAATMTTTTTTGAAAAG 

CAMGTGACTTGTCCATMCCACMTATTTTTCATTTTCAMTCTCATTm 

TGCCACATTGCCTATAATTATACAGCTAACAATTATCCCATGCTCAGAAGAAATAAAACT 

GACGTAGTTAGTAATCAGTNATATATCATTCCCTTAAAACAAGTATTGCTGACTTAATAT 

GCATAAAACTGCTGTCACATACCAAATATGACCAACAGGTAAAGGGTGAGAACAATTTCA 

CTGACTACAATATTTATAGCAAAATAAAGTGTCTATATAGTGGGAAAGAAATATGCTTAA 

AAAGAAAATAGGGAAAATGCCCAGTAGAAGAAATAAACTAAATTATGCAACCTGCCCTTA 

AAMAAAAGGCCTTTTATTGGTATTTCAACCTACCCAGAATT 

Sequence 2883 

CCCTTAGCGGCCGCCCGGGCAGGTAC I I I I I I I I I I II I II I I I I II I I I I 1 1 I GCGTGT 

GANACGMGTTTNNNTNTTGTTGCCCAGGCTGGAGTTTGACCCACAAAATMGGTTCCCA 

NATCACCATCAACTAAAGTTCAAAATAGACAAACCCTATCCATCCTANCCAGNGATTATT 

TAGTATTTGNGGAAAGCCTNTAAATTTAAAGGGTCCACGAACAAAGACNGAGACNAAAGA 

ATNAGAGCNACATGTAGGAAAAAGAGATTTACACAGACTNTAAAACATTAACAAAACCAA 

GATTTGTATTACCAGAAAAAGAGTTNTTTTGTNTMGAAACAAAMTAAGNTGNTNTNGG 

GAAAAAAAAAAAAAACAGNTGGAAGCTTAAAAGGAAACTTAAGGAAACCTNCCNGAAAGA 

AGAATNAAATGNGNGAAATNGAAATAGGGAGAGAAAACTTNAAAGCCCAAAAATAAAAAT 

TGAAATGACNCATCNTTTTNNGGAACCCAAAAAAAAAAGAAAAAAA 

Sequence 2884 



# 
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CCCTTAGCGTGGTCGCGACCGAGGTACAGCAGGCCTAATGATAAGACAGCCTACCTGGGC 

TCCAGCCTGCCTGTMCTGTGTGAACTTAGCMGTCCTTCACTCAGTCAGACTTTGCTTT 

CTACATCTGAAAAAGTCAAAAGAAAGAAAATAAAAACCTCAGCCTACCTCCCTCTCAGGG 

TTGTTACAAGGGGGAAAAGAGCATAAAACAGATGCAAGAAGTGCTTCCTTAAAGAGTTCA 

ATATGMTTGTTTTTCTGTTTAGGTTATGTTAMTCTATTTCATATTATTMTCCCTTCC 

AGATGACACCCATCAAACAAACACCCAAATGGTCCATGTCTGAAAGACCACAGACTTTTC 

TTTTGGGAAAAAAAAAAA 

Sequence 2885 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTN I I I 1 M M 1 I I I I I I I CI I 1 I I I GAGA 

CAGTGTCTC GCACT GTCACCCAGGCTGGAGTGCAGCGGTGTGACAGGCACATGCCACCAT 

GCCTGACTAATTTTATCGTTTTCI I I I I I I I I I CAGGTAGAGATGGGGTTTCACCTTGTT 

GTCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCCACCTGCCTTGGCCTCCTTACGT 

GCTGgGATTACAGGTGTGAGTCACCACGCCCAGCCATTGATCTGTTGCCTCTTACTAGAG 

CTTCTTGTTGGATTCCAGTTTTTTCCCACTCTATTT^ 

TAGATCTGTTCATTTCATTTACCCACCAAMAAGGATMGTTCTTACTCTTTGAAATAAA 

AATCAAACTCCTTAGTCTGCCTCTGAAATAAAAATC 

Sequence 2886 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTATTGTATCAGAAAAATGCTMTTAATTTTTT 

GCACATAAAGGGCATTTTAAACTTGGTTTTATTCTTTGTGATAAATATGGATGATGAATG 

GTMTGTTAMCAGAATTCAAAAGTTATCAGTTTGGCTAGCCAGACACAGTAGTATATGC 

CTATAGTCCTAGCTACCCAGGAGGCTGAGGCCAGAGGAGCCCGGAAGTTCACGTTTAGCC 

TGGGCAGCATAGTGAGACACTTGTCTTTTATAAAAACAACAGCAAAAAATGATCAGTTTT 

GGGGATAAGTAAAGACAAAATGGC 

Sequence 2887 

CCCTTGGCCGCCCGGGCAGGTACGCGGGGAAAAMATATTTTTATCTTTTAGAATATCAG 

TTTGTCCCTTACAGTMTGTAAAATTACATTTATTTTATCTTTAGGATATAACATCTNAA 

ACAGAGGCTATTMCTTTTACACTATGTGTAMGAMCAACTTGAGAGGCMCTTTATTA 

TGCAACCACGATCATGAAGGGGATGAATAAACCAACTACTTAGAAAAAAAAGTGCTCTAT 

TATCATGGMTGGTTTTCATTGATCCTTTCTTGTATTTCTGGAATCACTAGAACTAATCT 

CCAATCTATGCACTGMTTTTTGACATACGATAAMTGCCATGATTTNATTAATGGATTA 

AMTACTTGTCTGGAAGTATANCTTTTTTAMTMTTNCAGTGGATGATAATTTNATTGA 

CMGGTGACANACCTAAATTCTGTTTTCNTCCNTGCCTTGCAAMNGAAACTTTTAGGGG 

GATTAAAA 

Sequence 2888 

CCCTTAGCGTGGTCGCGGCCCGAGGTACAAATAAAGTATTCCAAGGGTTCAGAATAGAAA 

ATGATTTCTTCCAGCTTGGGGACATTTGGGAMTTGGGATATCCTTTGGGGAAATGTAGT 

AATCANTATATTCTGGGAAMCATAGTAGAAGAATGAATAAATAAATTCCATTGAATTTG 

GAATATGTTGCCATTCTCCCTGTAACTAATGCTATCANGATAAAGTAGAAATACCACATT 

TCANAAACAGCTGGAGTAGACAGGTCTTCATAGGCTAGCTTGGGAAACCTAATAACTATT 

MTAATGAAATTTTAATTATACTCTTGGATTCTMACAAATGMCACACAGTGATCTTTT 

TGACTTGCTGCTTGGTTAT 

Sequence 2889 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGATTGTTGAACTCCGAGTCACACTCATTATGA 

CCTGGGAGGCAGATGTCTAGCCTTCAGCTAATTATGATGGGCAGTATGGAGGAGAGTGTG 

AGTCAAACCTTCCATGGAGGAAAAGGAGTGAACACTCAGGGAGACTGGAGCCGTTTAAAA 

GGACATGGGGAGGATCTMTCAGGATCTTGAAMTTTAATGTTCTCAAAAGTTATTTCTA 

TCAGTCTTMGTAATAGATGAATTTCTTTTTGCCACTGCAAAGATGATACTGTCTTGATA 

ACCATTCTCTATCATCATGCC 

Sequence 2890 

CCCTTTCGAGCGGCNCGCCCGGGCAGGTACCACCTGACTCAGCTGGGTGCATTTGACCAG 
TTTTGCCCCCATCTGTAAATATCCCTCAGTCCCCATGGAGCTGTAGAGGTGAGTGATGAA 
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GCAGTGACAGCTCTGGGTAGGAACTCAGCTTAATTTCAAGTTCCAGACCCTCTCCATCCT 

CAGGTTCTTCTTCTGTAAAATGAAGATCCCTTTGCGGTGAAACGTTTTGCAGCTCTGGAA 

TCTGAGGGTGTAGGAGGAGCTACAGAAGAGGGGTATAGGACTCACCTGAATAAAGGTGGC 

ATCTGTGATTAC TGTGG GCGAGGGTGAGAGGGAAGAGGATACTGACCGCTAAGCCTTGCC 

TTCTTAGTCCAAATTTTGGTTGTGCATTAAATATGTCTTA A 1 1 1 1 1 I 1 1 ATAACTTAACC 

TCACC 

Sequence 2891 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGAAGGCTAAGTATCGCTAATTTCTATTCAGAG 

ACACAGGGACTCTACCCTTGTTCCTGGCTTTATAAAGCTGCTCAGTTCTAATGTGCACTG 

CACACAACAGCTGCCAGGGCCACAAACACCCCCAAGCAGACGTTAGGCTCTCAACCCCTA 

AGATACAGTTT TCCTC TCAATAGCTCATAAGACTCTGCCCTCTGACGTGAGAAAATAGAA 

TTCTGTTGTGCTTTTTAAGTATGCCTATGGCATCTTTTAAAATATACCATATATGATCCA 

GTATTTCTTTGTGTTTGGAGTGTGAATCTGCAGGTGAAAGAGTGTGTTCTACACGTGCCC 

GATCCATCCTCATTACACAAATTCACCCTCTACTTTAAAACAGGGAATTTGGGCCTTTGA 

CAGGAAACAAGCCGGACAGTCATGGGGCCAAGCTGGAATCAAATGTAAATATTTAATTTC 

CTCCTGTTAAAAGCAGAGTGGACCCACAGTCCCTTCCAAAGTAA 

Sequence 2892 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACCCAACTAGGAATTGGGGCTCAGCAAAGTGC 

CCTGGAGGAAGTGATACCCAAGCTGAGAGATGAAGAATAATGAGTGGGGTAGCCAGGGAT 

AGGTGAGAGGGAACATCCTCCAGGCAGAAGGAGAAGCAGACACAAGGGAGGGTGTGGCAA 

GAGATGAACCTGAAAGGCAGTCTTGGACCATATCAGAGAGGACCTTATAAGCTCATCATG 

AAGCTTAAAATGAATCCTTANGT CATTG GACAGCTGTGGAGAAAGAGTTTGTAAGGGGAG 

GGAAGTTATATGAATCACAGGAATTTTTTAAAGGTGGCTATGGTAGTAGTGTGAAAAGAG 

AAATGATTAACAGTANGCAAGCTCAGAGAAAGAAGCCCGCCAAGGAGGCTGTCACCATAA 

TCCAGCAGAAAGAAGATGGTGGCCTGGACTGGGGAAAGTGACTGTAGGGGATGGAGAGAA 

ATGGATGGGCTTGGGAAATGAGAAAGGGATAGGATCCACAGGGCTTAGGATTTGAGTGAA 

TAGAAAAGAGAGAGAAAAGTTAAACCAGACC 

Sequence 2893 

CCCTTAGCGTGGTCGCGGCCGAGGTACCACTGCACTCCAGCCTGGGCGACAAAATGAGAC 
CCTGTCTCAAAAGAAAGC CCTCT CCTTAGCTGAGCAGAGGAAGGGAAGGAGTGTGGCTAT 
GAGAATATGATTTATGCCATTTTCTGTTTTTAAATCTAGAAGATCTTCTAAGCACAAATA 
CA GCTACA A TGAA ATATTTTACAGACAAAATGTTAATAGACCATATTCTTTGAATTAAAT 
TTGTTTTTAATTTTC TCTACACA I I I I I I I 1 1 I CCTGGAGTCTCTTAGCTCTAAATATAT 
CAATCAGATTTATAA 1 1 I 1 1 I I I ACCTGATTCAGATGTCTTACATTTTTATATTAAATGA 

ACCTTAAGCATGATTCTTTTGGTAAGCCAGTATGAATGCCAGTGGTTGGGGGGCGGG 
Sequence 2894 

CCCTTAGTGTGGTCGCGGCCGAGGTACAGTCCACAAAATGGTTTGCTTCTGAAAAAGCAG 

TAGTAGTCTTACATCCAANATTATTCCTAAAATAAAGTTTCGACCCACATATATAATGAA 

ATTGAAACTGCCTACATTGGCAAAGCCAGAAAGAGAGTAAAAAAGATGGGGTAGGCCAGT 

TGCGGTGGCTCATGCCTGTAATCCCAGTACCTGCCCGGGCGGCCGCTCGAAAGGG 
Sequence 2895 

CCCTTAGC GTGGT CGCGGCCCGAGGTACTGTCTGTGACAAGAGCAAGAAATACATTTTGA 

AGTGGTTTGTTTTAGTAGCTAAAGTTATCCTGACTGATCATTTGATAAAGGATATTTACC 

AGCAGTTCACAATGAACATACTACTACATGTATATATTAAAGATCATAGCCTGCACAATA 

AAAAGAAGAAAATAAATAAATTAGAAAATAAATAGTTTAGAAACAAAGACACAGTATTAG 

TTCTAGTTACAGAATATGATTCTCCAAATAGTATATGCTATCAGCTTCATCAACAAGTAA 

CCAGAATACCAAAGAAGACATAGAAATGGCTAACAAGTATATAAAAACATGCTCAACATC 

ACTAATTATCAGAGAAATGCAAATCAAAGCCACAATGAGATATTAAGTCACACCTGACAG 

GATCACTATTATCAAAACTACAAAAGTCAATACTTATTGGCAAGGATGTANAGAAATTGA 

AACAGCATGGAGGGTTTCTCAAAATATTAAAAATAGAACTACCATATGATCTGCAACCAC 

TTCTG GGTATTTATCCAA 




WO 01/070979 



PCTAJS01/09126 



TABLE 1A 

5/599 



Sequence 2896 

CCCTTAGCGTGGTCGCGGCCGAGGTAC I I 1 1 11 I I I I I I 1 1 I I I I 1 1 11 I I I I 1 I I I GGA 

GACGGGGTNTNACACTGTTGCCCAGGCTGGAGTGCAGTGGTGCAAACATGGCTCACTGCA 

GCCTCGACTTCCTGGGTCCAATNAATCCTCCTGCCTNANCCTNCAGANTAGCTGGGACTA 

CAGGCATGTGTCACCACTCTAGACTAATTAAAAACAAAAAATTGTAGAGACAGTGTCTCG 

CCATGTTGTCCAGGCTGGTCTNAAACTCCTGGCAATCCTTNTGCCTTGACCTCCCAATNT 

GCTGGGTTACAGGTNNTGAGGMCCTGCCTGGCACCCAGTTTTTTTAMTNAANTTTTCC 

TACTGTNCCCMCCAAMTGACTTCTTATGMGCTTNTCATCANTTAATGAGCTTTATTA 

ATGANANCTTCAACACTGGNCAAGGGGGGAAAAAT 

Sequence 2897 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGGCGTGAGCCACCATGCCCAGCCTGAATCCCC 

CTTCTCAMAGTAACTTCCATTCACTTTGTCCCTGGCAGTCTGCTGGTGACCATGATTTC 

TAAGGCTCACTCATGGGACCAAAGGCAAGAAGAAAGGGAGGAGGGTGGTAACCAGATAAA 

CmGCMCCCCTTTGATAGGACTTTTTGGGTGAGMTTTTTATG 

TTCTACTATAGATCTGCCTCTAGTTTCAACCTCTAGTTTTAGCACTG AI Mi l I CCCCTT 

MTTTCCAGAAGCATGATATMTAATCTTTTTCAAAGAAAGTGGG 

Sequence 2898 

CCCTTAGCGTGGTCG CGGCC GAGGTACCCTGGGGGTGTCATCAACCAAGATTAGAAATAT 

GGGMGATGAMTGGTTTTTAGGAAAGATAATGATTTTMTTTTAGAACTGTCAAATCTG 

AGGTGAATCCTTCAAAAAAGTCAAATGATAGTTCCAAAGAGTGACCATGAAATCTGACAA 

ATAAGAGTTATCTTAGCAAGGAGCAGTCTCATGGAAATAATAAAAGCTGAAACAAATAGT 

AGTGATTGAGGAGTAAATGAGAGGTGAAGCAGTGGAGATAATACAGGTTCTGTTTGGAGG 

AGACTTGGCAATAAAGGTGAAAAAATAGGATGGATGAGATAGCACTCAGCAATGAAATGT 

AACAGAATTGATGCAACACGGGTGAATCTCAAAATCATGCCAAGTGAAAGAAGTATAGTC 

CCTATAATCACCTATAGTCCCAATTACTCAGGAGGCTGAGGCAGGAGAACTGCCTGAGGC 

CAGGAGTTTMGACCAGTCTGGGCCAACACAAGCMGACCCCCATTCTTTTAAAAAAAAT 

TAAAAAACATTANCCAGGCATTGGTGTGGTGCCCCCCNTAGTTCCCCAGCTACTCAAGAG 

GCTTGGGTTGGGAGGACTACTTNGNNCTCNTGAGTTNAAGACCAGTCTNGGGNAATCAAG 

CCAGANCCTNNTTTCTTNAAAAAAACAANCCATAAACCTT 

Sequence 2899 

CCCTTCGAGCGGCCGCCCGGGCAGGTACTCCTATCAGTGATGTATGAGAGTTACAGNTGC 
TTCACATGTCTGT CAAC ATTTACACTGTCACTG I I I I I I ATATTAGCCATTTTAGCATAT 
ATGAMTAACTGATTTTMTTTACATTA 

CATATGCTTATTTTTCATTTGTATATCTTTCMGMGTGTCTTTCATGTCTTTTCTCA^ 

TTAMGTTGGGTGTCTTTTTATTGTAGGTGGTTATTCTTTGTATTT^ 

TTTGATAGATTATGCCTATGA I I I I I I AAAGACTCTATGTNTTCACTTATTGGGGATCTT 

TCG AAGGAA CAAAA I 11 I I I I I ATAAAAATCCAACTTTTATCAGG I I I I I I I AATNTTAA 

T GGNT TTrTATNGACCCTCAAAAGTCACCMTAGGMCATNATTTTTN 

GTTTTTATGGGTTTTGGG 

Sequence 2900 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATCTCTCCATATGCAGCAGGMTGTCTTTTT 

TATTTAAGGGCACATTTACACAAAGAGCAAATGGTCCAGACTCAGTCTTCCATAACCTAG 

AGTCTGM TT GATCA CTCTATAAAATGGAAAAATGTTNCTTTTC I I 1 1 M I I I I I 1 1 1 11 

TTTTTTTNGGNTTTTAAC I Mill I GAGACAGNGTCTCACTTTGTCAGCTAGGCTAGAGT 

GCAGTGACACGATCACAGATCACTGCAGCCTTGACCTCCTGGGCTCAGGTGATCTTCCCA 

TTTCAGCCTGCTGAGTAGCTGGGACCACAGGGCGCGTGCCATCACACCTGCCTAATTTTT 

GTATT 

Sequence 2901 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTGTAGTTATAAAACTGACACTGTTTACAATT 

MCAGTGTTCTAGAATCCANTTGTTTGGAGGGTATTTTACATTATGAAATATTGACTTCA 

GATGGTCACTGCTATTTTCGAGATCTATGACTATGTTTCAAGGAGATCCATTGGTCTGAC 
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AAAATAAGAGATGTATATTCCTGAAAATTGCATGTCCTCTGGAGTCTGTTGTCTGGATGG 

TATCAGAGAGGTATTAGMCTGTCCTCAGAGGTCATTACAATTTTCTCATAGTGTTTGTC 

AAGGAAAGAGTTGGGGTGTGAAAGGTCTTCTATAGTATTGGGTCCTAACCAAGTCAGGGA 

AGAGAAGCAAATGCC 

Sequence 2902 

CCCTTA GCGTGGTCGCGGCCGAGGTACTTAAGTAAAGAAATTAACCTCTAATAATACAAT 
ATTTTTATTAGTTAAAAATAAAGGACATATGTATCTACACACACATATATGCTATACATA 
AATACATATGAGAAAGACAAAAATTGAGAGATTAAGAGACAGAGAGAGTTTGAGATAGAG 
AGTTCAAAAAAAAAGCAAATGATATAAATTTACAAGAGTAAATAGAATAATCATGTGAAG 
G GGTATT AG G AGTTCTTTCTTTT ATCTTTT AAC ATTTCTGTMGTTT AAM 
AMTAAMGATCTAAATATTTTTCAAAAAATAAMCCAMTTTTTGAAAATCTAAAAAAT 
TTTCAAAAAATAAAAAC ATAAACATAAAAAC 
Sequence 2903 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAACAAGGCCAGAAGCCTTCTACCTTCAGTCTT 

GCTCTGGCAACCCCACCTCTTCTTTCATCTTCTCCCATCTAGGGAGATGTCTAATGGGCA 

TGATAGTTAGAGGTGGTGAGAGGATAGGGACAGGAAAGTCCTGCCTGTGGGACAACAGGG 

ATCAGGGACTCAACAAGTCTTAGGCCTCTCACCAGGACCAGCATGTTGGGAAGCTGCCTC 

TCCATCCATATTCAGTCTCAGAATGGGCAGCTGCTTCCCCTGCACCCTCATCTTGGTCTG 

GTCTTGGTGATAGGGTGGGG 

Sequence 2904 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAGTGGTGACAAGATCTCAGCTCACTGCAACC 

TCTGCCTCCCAGGTTCAAGTGATTCTCCTGCCTGAGCCTCTGAATAGCTAGGATTACAGG 

CACGTGCCACCATGCCTGGCTMTTTTTGTATTTTTAGTAGAGACAATGTTTCACCATGT 

TGCCCAGGCTAGTCTCAAACTCCTGACCTCAAATGATCCACCTGCCTCAGCCTCCCAAAG 

TGCTGGATTACAGGCGTGAGCCACTGTGCCTGCCCTMGCCTGTGTGTTTTATTCTTCTG 

ACTTGCAGGCTAAAGCGGCAGCTCTTCCATATCTCATTGCTATCTCCTAGGGCTTCCGCT 

AGGAGACTGATCTGGGGCTAGAGGCCTCCCTCTGTGTACACGAGAATGCTGGAAATGTCA 

CCTCTCAGGGCTCTGCCTGCCTCTCAGCCCTGAAAGCCATGGTGGAAAGGGGTGGCGCTT 

GACATAGACATCTGAGGAAAAGAAGTGAGGGAGGGTAAAGGGTGGTGCAGTAAGANGAAG 

GGGTNGGGAAGG 

Sequence 2905 

CCCTTGGCCCGCCCGGGCAGGTACATTATCATTATTTTATATCTTGCTCAATCTTTTAGG 

CGTCM TCT CACAT CAAAAAGTTTATTGGCCTATAACGGCACGAGCCAACTTCAAATTAT 

TTTTTAAMGTTTTGTGTATTTGTAGATTATGMTCTCTGGTMCTACTTGTAACATTTG 

GCTCTGTTTTCTGTGMTATTTATCCCTAGCTAAAAAAAATTCTTACATCTTCAGAAGAG 

CTATAATG TTGA CACCTAACTACACTGTAATACTACACAGTAATAGGACACTAAGCAGAT 

AGATAGGTTTTTAAAAMTATTTGTATTTTCAGCACCTGGTAATACAGTGGTCTA 

Sequence 2906 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTCGTTGGTGCCTCAGTCTTTAAGGATTAACTA 

GGMGAATTTTCTTTCTGCATAGMACTTTATAAAATMCGACATTGTTAATAATTCAGG 

CATMTATTACATTATACCTTTCTTGAATGCTGACGTTGCATACAAGGGTGATCTGTAAA 

CCTCCATCATTTCCTTGCTCTCACTTTTAGACCTATTTCCTGACATAANAACCTTGGTGG 

GTAGTTCTGGAI I I I I 1 1 I I I NCTTCTTCTTTTimATTTTTGAGACAGGGGTCTCACTG 

TGTCGCCCAGGCTGGCATGCANTGGNTGCTCAAGTGATCCTCCAGCCTGTGCCTTNCTGA 

GTNAGCTTGGGACAACAAAGCACMNGCCACCAGGTNCTGGCTMTTTTT 

Sequence 2907 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATCACCCTGCTGAGGGACATCCAGGACAAGG 

TCACCACACTCTACAAAGGCAGTCAACTACATGACACATTCCGCTTCTGCCTGGTCACCA 

ACTTGACGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACC 

CCAGCCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGG 

GCTCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAAGTTTATCAA 



# 
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CCAACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAACCTACCA 
TATTCCCAGGACAAAGCCCAGCCAGGCACCACCAATTACCAGAGGAACAAAAGGA 
Sequence 2908 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTTGCTAGTATTTGTGGGTTTTGTATTTTACAT 

TTAAATCTTMTGTAAGATTTAMTGTAAATCTTAAACATCTTAAACACATATAGTTTAA 

GGTGTGAGGGGAGATTTMCTMTCTGTTTTTTCCTTTTATTCCACTT ATT AMCC 

TGmMTCAGTTGTCCCTAATATTGTTTAGTGMTMTMTTGCAMTAATATGGTm 

TATATTTTGCAAGGCAATTTTCCTTCATCCTTTATTCTTCTTTTAAAAAAA^ 

CTATTTCTACATTATCATCATCCATCATCCATTATTATT 

Sequence 2909 

CCCTTAGCGTGGTCGCGGCCGAGGTACATATGGATTTGATACGTAGATATAGATTTAAGA 

CTAGAMTACTTATAGATATATTGGTATATAAATGGTATTCCAGTGCCCTTTTATAGGAT 

AAMTTCATTCTTATATTAGAAAAATCCATTCATACATMTTTTTGATTATTCAAGACAT 

AAATG7TGAACAACTAAGTGCTAGACAATATGGTAGGTGCAAAATTAACACTAGAAAATT 

CTTCCC CAGAG CACTGATTTATCATTACACAAGCACAAATCTTTAATTCTCTAATCACAT 

AAMTTATTTTC1TCTMTMCTTCATCTCAMTTCCCTTTGGTTTCTGTGAAAAGAATC 

TTATMTATGTCACTATACATACAAATTATGAAAAATTTATGCAAACATACATCCTTTTC 

AGGMGMCCACAAATCTTCCTGCTTGAGGATCATTAGAAGGCAAATTATTTAATC 

Sequence 2910 

GGTCGCGGCCGAGGTACTTAGAACACTGTTGATGGAGTATATCTGTTGTGAGCCTGGTTT 

CTGTGTATGTGTCCAATTCAGTGATAGGAGAAGAAAGCCTGTAATTACA7TTGCATAAAG 

TGGTTGGCTGCTAAAGAGATGTAATTCCATTATTCCAAACTTTTMTTCATATTTTAAAA 

TATAAAMCTAGTATATATATCAAGGMTATTTTAGGAAATCTTGTAGTTACCTGTAAGT 

GTCTMCTGTGT ATTCA ATCAGTAACCAATATGGAAATTGCAGAGTTTCTTCATCAGAAG 

ATATGAGTTTAMTTTTATATTTGGCATGCACAGCAATATATTAAATGCTGTGCTTAACT 

AGAMGTATTGTGCAGTGTGATGGCCTTTTMCCTCACAATATACATTTTATTTTTATTC 

CCTAGACTTGCACATGMGTGCATATGTTCATATTTACCCGTTTGGAGATGGGTANGATA 

CMCCAGCATAGCTCCATCTNCATGGNCCATCTTTGATTTCATGCCNCCTTTCCTTT 

Sequence 291 1 

ACC CNTAN CGCGGTQGCNGCCGAGGTACTATTTGGGGGATCAAGATNTGNAANATTTAAA 

CTGTTTTTGGTAAAAACTAGCACTGTTGGGAAGAATGCTGTGGAATACCACAAACTAGTT 

TNGCAAAGGGGMAAGMTGTTTNANMGTGCTGATTGTATTTTAAACATCAGTAGTGAT 

AATACAAGGAAAGCTTTTA 

Sequence 2912 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGTTCTCAGTTTTTCTTATAGGAGAAATATGGT 

ATATGTTTATMGAATCTTTTATGAGATTATAGATTTCAATGCTGTGGATAGTGTCTTGC 

ACCCMACMGAMGTCCATAATGGAATGATCTTCCCTCAGCTTCCTATCGATTTAGTTA • 

CCTCTTGAMGCACAAAMTTAAAACATTGCCATATGTTGAATTTTTAAAAAGCACTTGG 

AGTGAGCGMCATTTCCTGATAAATGCCTTTTAGAGATAGGTTCTTGATATTCAGACATC 

TGCAGAMTGTTCTGGTTCCCAAAGTCATTTCACTTCGAAATAAAACACAGCTCCTTCAA 

ACAGCACTTTTTCCACATAAATCTAGTTGCCTCTCCCTGTGGACATTCAGAACTGATAGA 

ACAMCACTACTCTTTTGAATTTGATGGTTCGTGTCCTTTAAAGTGTTTGAGGACCTATG 

CAGAGCCTGTAACACTTGGGTAGTACCTGCCCGGGCCGGCCGCTCGAAAGGGGC 

Sequence 2913 

CCCTTCGAGCGGCCGCCCGGGCAGGTACAATTGGGTAGGCCCAGAAGAGGAAAGAGAGC. 

AAATAGAGTAGTCAGGAAAAGAGGGCCCAGTTTGGGCGCCTGTTAGCTCCTGTGTGGTGT 

GAAAGGCACTCAGATGTTCCTGCATCTCTGATGGAAAACACAGAACCTGGCTCAGCTGAG 

AGACAACAGCTCTGTCATGGGGAGGTTGAAGTGACCCCACAGCAAAGTGGAGAGGAAGTT 

GTTGAACAGAAGCTGTGGCAGAGACTCAAAGAGGCCTATAATGCTGGTGAGAACTATGGT 

GGCCTGAGTCATC 

Sequence 2914 
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CCCTTAGCGTGGTCGCGGCCGAGGTACTCACCTGCCACATCCTAATCTCAGCTCCTCTCT 

CTCCCACCCTCATCCACCTTCCATAACATGCTTTAGTCCCACCCTACTGTCAGTTATTCC 

TGCAAAGAATACAACTTACAGCCGGGCGCGGTGGCTCACACCTGCAATCCCAGCACTTTG 

GGAGGCCGAGGTGGGTGGATCACGANGTCAGGAGATTGAAGACCATCCTGGCTAACACGG 

TGAAACCCCATCTCTACTNAAAATACAAAAAAAAAAAAATTAGCTGGGCATGGTGGCGGG 

TTGCC 

Sequence 2915 

CCCTTTGAGCGGCCGCCCGG GCAGG TACTTTGTTTATGGATCTGTTGAATCTGAGGAGAA 

GTGATTGCTTTGAGTTAGCTAGTTTTAAATTCATGACACTTTCCCAACATAATTAGGTAT 

TTGMCCTMGTGTGTGCTAGTGTATTGTATATTTTTCATTAATTACTTTATTGAAAGTT 

GCTTTGGGAATCAATTCACTTCAATATATnTACTGTTCTCAGTGAAGAATGTGACTAAT 

MGTATTTGGACTTTCAAAAAGTGATACAGTCTTTGTGATAGTTTTTTACCTTTACATTA 

ATGAGTTGGAAAATAAC 

Sequence 2916 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCACTTCACACCAGCCTGGGTGACAGAGTGAG 

ACTCTGTCTCGGAAAAAAAAAAAAAAAAACGAAAACAAACAAAAAAATAAAATGTTCACT 

TTTGGGCTCTATGCCCAACAAGTGAGGTTGCTGGACTGTGACTACAAGGACACCTGAGCT 

GTAACTGTGTCCATGCATCTCATTTTCTCTTGAGAAGACTAAAGAACTGACCTTTAAAAT 

AAACACAGCTCCATGGTGGCCCCTITCAAAGTCATCACCTCAGGAGGCCACACACGTGCC 

TGAAGGATGCTGCCTTGGCTTGCAAACCTCTCAAGCTCCCCCAGAGTAGCCTTCANGGTC 

CCCTCCCCGCTACACAACAGCAGCCAGCTTTGCTAGCAGTCACTTCTATGCTTCTCTCCC 

ACTGACAGTTACTGNGAGTGAGCCTCTGCAAGTATTAACAGCGCCTAATTAATGCAGTGA 

CCCA AGATACAAGCGGTGACAAACTCCTTACCCTTCAAGTANGGCCTNTGTGGCTGAAAA 

TTTTACAATGGGCTTGTGAAGGGGA 

Sequence 2917 

CCCTTAGCGTGGTCGCGGCCGAGGTACATTTACAGAAAGGCCACTGCAGCTGCATCATAG 

TTCTATAACTCTTCAAAGTCCCCAGGCACCCTAAGTAGATAGCTTGAAGCTTCACCTTTA 

CAAAGTCCAATCCATTGCATCAAAAGCCTACATTATCCCATTTCCTGTGGCCTAGAGATT 

ACAAATCACCTGAAGTAATACCAAAACCCAGTGAGTAAAGACGGAAGTAAATCTAGTGAG 

TTTCCAACCACCCACCGCCCCCGAGATCTTAAAGGAAGCAGTAACAAGCAACTAGAACTC 

ACATGAGAATTTATAGGCCAAGAGATTTTAATGTAAGCATTCTTATAATTAAATACCTCT 

TACAAAAAATGCCATTTAATCAGACAAGGGGAAAAAAGTANCATCAAAAGAGATGCTGAC 

AAGGGAGACAATGAAATCAAAT 

Sequence 2918 

CCCmCGAGCGGCCGCCCGGGCAGGtACTTTTAGTAGAGACGGGGTTTCACCGTGTTAG 

CCAGGATGGTCTCAGTCCCCTGACCTTGTGATCCACCCACCTAGGCCTCCCAAAGTGCTG 

GGATTACAGACGTGAGTCACCGCGCGCAGCCCGCTTTTGTGGTTTCAAAAGTTCTCACTT 

GCTAATGTCATGTTAGAGATACAAATTTAAAGTATGCTAAATCAGTGGAGAGAACAGATG 

TTTAATCTATAGACACTTATACATGCACTAAGAAAGTTGACTACTATACTGAAGTCCCTT 

ATAAGAAGGGAGTTGTTAAGGATAAGGAATGTGGATTGGGCTCTTTCAGAGACTCCATAC 

AGGTATATTTAATTAGCAAAGGAGTCAAAAAAACACCAGTTGTTATCAACAGCACTTTGC 

ACATGTTGGGATATGATTTGCCAACTGGTGAGACTATTTCAGGTATTAACATANAATATA 

TGTCAGCTTTTGGGGAGAAAAAAGTAGTCTAGGACAGTAAAATAGTCTTATTTGCTTAAA 

TGAACCCATAAACNTAGGCAGCATTTCTTCAAAGTGGTATTNCATGGATCACCAATCCTT 

ATAAAA 

Sequence 2919 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGAGTATTTTAATCTTTAGGGGATCAAGATGTC 

AGATGCAAACAAAGCTGCCATTGCAGCAGAAAAGGAAGCTCTGAACTTGAAGTTACCCCC 

CATTGTCCATCTCCCAGAAAACATAGGCGTTGATACACCAACACAAAGTAAGCTGCTAAA 

ATACAGAANATCCAAGGAGCAGCAGCAGAAAATTAATCAGGTTAGTAATTGATGGAGCCC 

AAAAGAAATTTAGACAGAACACTGGGTAAAAAGAACACCTCTATTACCACCACCCTGATT 
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ATNCCTCAANACTATGACCAGTGAAAATGAAAAAAAAAAG 
Sequence 2920 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGAGTTTGTGTTGTTAAGAGTTAAGGCCTTAAA 

CTTGGCTCAAMTAAATTTMCTGTGATTTCTCTMTTTTTAGTGCCTACTGTTTCACCA 

GACACCTAGAGTCAATAGTGTCCTAGAATCTACGGTGTCCAGGGCTTGAGCAAGATACTT 

TAGTGAATATAAACACTAACGAGACCTTGCCTTTTCTCAAGCCTACAGGATATCAGCCTA 

TTTGAA AAGTGACACTGGCTTTACACCCAGAAGCCCACAAAATAGAATGATGTAACTGAA 

i 1 1 I i iGGTAGCTCTCTGGGTAATTTATGTGGTGTTAGGACTCTTTGGGTTTGAAAGNAA 

AGAGACTCGTGATCCCATTCTGGCATTGACAGTAAACTTCATTGGA 

Sequence 2921 

CCCTTAGCGTGGTCGCGGCCGAGGTACAMTTATMTACACATACCAGACAMTTTTATT 

AMTAATAAAGAAGGGAGAGAGAAAGAGCMGACACTTT7TAAGCACCATGTGCCAGGCC 

AGTGCTTG MTCT CTTCCTACGTTACTTCTCCTTATTATTTMCCGTTATTTAAATGTTA 

CTACTCCCATTTTACAGATGGAGAAATTTAGCTTAAGGTAATGACAAGATTGAACTTCAA 

GTTTTCTGACTCCAAAATATGCCCTCTTTCGACCACACCGTATTATCTCTAAATATGGAA 

AACATTTCAAAAGTAGCAAGCGTNTTCAGAACAAAAACNAAAACAATCTAAGGGGGCTTA 

CACATTATAAACATGGTTTCGGGGTCAT 

Sequence 2922 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTGGAGAAAATGCACAAAATCTGACTAAGGAGC 

CATA AGAC TCT GGGT GCAMTAATTGAAAAAAATATTTTAGAAGCATAATATCAATTCAT 

CTAGTTTTAAAGTTTTCACAAATTATAATAGGGMTTCTGCTTATTTCACAAAATMTTA 

TCTGACCTTMGAGG ATTTT TCCTAACATCACATAAACCTGATGGCTTTACACAGTGATG 

ATGAGCTATTATTTCATTTTCATCCTGATAAACTACATAAGMTTGCMTTTTTATATGC 

MTTATTCATMTGTATGTTTGGATTTCCCTGTTTAAATTCTATTAGAGTCCTACTTTAT 

GAAAAATCTATCATTTAATATTCTATTTAATACTGAGATGAACATACATAAAAGCAAACA 

GATGAAGAAGATGAGTAATATCTCAGGATCTGTAGCCATCAGACTATGAGCAAATCATAA 

TATATTATTATAAGTCCCTTCTGTTACAGTCAGATATTTCAGAAAAATAACAATAATGTA 

TTAAAAACGGAACTTCTCTGGTCTTTAAATAGCACAAGTAATTCAACTGATAAATCTAGA 

TMTCTTATAGCTCTCTATACCTATNGGTTGGGTTTGCCTTTTCTA 

Sequence 2923 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGCAAMTTGCAAGGGCAAACTTTGTGTCTTAC 

CACTATGATGGGAGAAGAGTTAAATAAACAACATAACAACAAACTAACTTATATACTCTG 

GTGMCACCTCGATTTCATGATNAMGGGMTATACATTTATTTTCMGAACTGANTTAC 

ACTTTTACAMGTTGACCATATATTAGTTAATAAAGCAAATCTGANAACATATTAAAACA 

ATTATCATATAAACTACATTCTTTGATCAGAATGAAATTAAATTAGAACTCAGTATAAAT 

AAAGAACACCAAATAATGTGCAAGTCTGGAAACATCCTTCAAAGTAACCTATAGGTCCAC 

CNNNNNGGANANAAACCAAANNCAANTNATGTTCCTGCCCGGGCGGC 

Sequence 2924 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCATAGGAAGGAAGCTGTCCAAGCCCTCCATTG 

CTCTTTGAGCTCATCATGTGATTATAGGGAAGGGGAAAGGGAATGATTTCCTCAGAAAAA 

TATACTGATCACCATATATTCTTTCGGGATGCTAAACAAGTATCCAGAAAAACATTTACA 

TGTAATACATCTACAAATCCTCTTTTCCAGTAATGCTTATGTCCTGTCATCTGAGATATA 

AAATTCTTTCTG G GTTACTM7TTCTTMTTTGG CC AATTTATTT AAG ATGCTCTTC AAA 

GTTTTAAATTAATTGACT 

Sequence 2925 

CCCTTAGCGTGGTCGCGGCCGAGGTACMTTTCCACTCTTAAAMTTTACTMGGTTTGC. 

TTTGTGGCTTAAGATATGGTTAATTTTTATGMTGTTCTCTCTGTCACACTATTATTTAG 

CATTGTTAAATATCTAATGTCAATTAGCTGAGAAAGAGAAATGAGGCAAACAAATTAGAA 

AGGAGAAGGTAACATTATTACTATCAGATGTTATAAAAGAACCAACAGGACAATCTATAT 

AAACATCCCCGCGTACCTGCCCGGGCGGCCGAAGGG 

Sequence 2926 
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. CCCTTNTTGTGGTCGCGGCCGAGGTACAGTCCACAAAATGGTTTGCTTCTGAAAAAGCNG 
TAGTAGTCTTACATCCAAGATTATTCCTAAAATAAAGTTTCGACCCACATATATAATGAA 
ATTGAAACTGCCTACATTGGCAAAGCCAGAAAGAGAGTAAAAAAGATGGGGTAGGCCAGT 

TGCGGTGGCTCATGCCTGTAATCCCAGTACCTGCCCGGGCGGCCGCTCGAAAGGG 
Sequence 2927 

CCCTTAGC GTGG TCGCGG CCCGA GGTACTTTTMTGTTTCTTATACTTGTTTCTTCAGAA 

TTTMTTAGTTTTAMGGAATTTTTCTCATTATAAMT^ 

TGTTTTGTTTTGTTTTTTTGATGTMGTTCT^ 

TGAGCCGAGATCACACTACTGCACTCCAGCCTGGTGAC 

Sequence 2928 

CCCTTAGCTTAGCGTGGGCCGGGGCCGNAGTACATTTACAGAAAGGCCACTGCAGCTGCA 

TCATAGTTCTATAACTCTTCAAAGTCCCCAGGCACCCTAAGTAGATAGCTTGAAGCTTCA 

CCTTTACAAAGTCCAATCCATTGCATCAAAAGCCTACATTATCCCATTTCCTGTGGCCTA 

GAGATTACAAATCACCTGAAGTAATACCAAAACCCAGTGAGTAAAGACGGAAGTAAATCT 

AGTGAGTTTTCAACCAC CCAC CGNCCCCGAGATCTTAAAGGAAGCAGTAACAAGCCACCT 

AGAACTNCCATGANGAATTTT^^^AGGCCNAGAGATTTTTAATGGTAAGCATTNTTATAAT 

TAAATACCTTTTACAAA 

Sequence 2929 

CCCTTAGCGTGGTCGCGGCCCGAGGTACCAGCCAGTTTTGAGATATACCTGGTTTGCTTT 
MTGTTATTAMTTGCTTTGGTGAGTTTTGCAMTGGACAATTGMCTTTGTTCTGCCAA 
ATTCI I I II rA TTGCATTTAAACTTCAGGGTTTTTTATTTCCGGCTTATTATGAATGGAT 
TAGTCATTTTGTGMTAACTCTTCATTGAGTTTAGGGAGAAATAGGACGTTAGATCCGAA 
GATGCTATTTGTTTAATCACCAGAATGGACAAAATTATACTTCATTTACTATCAGGGAGG 
CA TGGGGAGMTAGAGG CCTAGGGGTGCCGTCTTGTGGCTCTTACTGTTTTTGTGTATCT 
GG 1 1 1 1 1 1 I I I I I I I 1 1 1 IGAGAGGGCTTCCATTATGTTGCCCGTGCTGGACCCCAACTC 
CTG 

Sequence 2930 

CCCTTCGAGCGGCCGCCCGGGCAGGTACGCGGGCAAGTCTGCATGGCAAAAAAGGTGTCC 

ACAAGTGTGMTGGCTCATTTTTAATGACTAGCCAACAGCCACTTGTAAAAGATGTAATT 

GMATTGTTCAGAGATTAGGTTCCGTCTGCTTTGNCCTCCTTCTCAAAAGTTTTCATGGA 

TCCMGCTATTCTTAAGCATTGTTTMTCTAGGAMTAGCCTTTATGGACTAATNATAAG 

AMGAAAAAGGCANATTTTTAAMGATATTCCAAATTCAAAAGAAAATAAGTTTGGGCAG 

CCTGACTT TTGA TTATCTGTA AAAAAT CANGGCCAATAGTTTTGAACGGGAAGAGTAAAT 

ATGMN GGTTTTCT AAACCAGTTTTTTCTTGGAAGAAGAAGGGGTTGGNAGGTTAAGAAA 

GTTNTTCTTTTTTTNATCNAGCCATGTGCCCTACCAAAAAACATAAMTGGTGGTTTCCT 

GTGAACCCTCAAAAAAGTAAAATCAGTGAGTGNGATGNAGGGAATTNGGG 

Sequence 2931 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTGAAAACTCTGATTCCCAGAGGGAGGGTAACA 

TAGAG CAAAG CCCCCTACTTACCCACAAAGAATATTCGGTGTGTGGCAAAAAGGGGAAAC 

TTGTTATTT TAAGT CACTAATTNAAAAACCTAGCTCAATCTAATATACTAGCTCAATCTA 

ATACA CTMTTTTGT TTTTAAATTAGTGTAAACTMTTTAAACCTAGCTCAGTCTAATAC 

ACTMTTTTGTTTTTAAATGATAATACAGTAATGAATACCTAGAAGACAAAGGTTAAGGG 

AAAGGAAAAGAGAAAGAGGTGCTGAGAAAAAAAACAAGGGGAGTTTGAACTGAAAGTTAA 

AGAGCTCATTAAAATTTTGACAGATCATCCATCTTTTCATCTGAACTCCGTATCTATAAA 

TCCTGAT 

Sequence 2932 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTCCCCATCAGTGACATTAACATTACATATTAA 

TTCAACAGGACATCTCCTATATGCCAGGTAGTGTTCTAAGTGCTGAGTGAGTTATAACTG 

TAACAAAACATGCCTGCCACTTAAGAAACTTACAGGGTAATGAGTTAAAAATGAACACAT 

AAATAAGGTAACCTGNTGATAATGGGGAAATGCTNGGAGGAAAATAAATTGGGGAGGCNG 

GGCGGGGGGGT 
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Sequence 2933 

CCCTTTGGCCGCCCGGGCAGGTACTGAAACATCAAAACACCAGAATGAACATCAGGAGGG 

AAACATGAAAGTAGCCACTCCAAGTAAGGCTCCAGAACATACTCAAATGAAGGAACCAGG 

GCTTTACTGCGTGACCTAGATCAACAACAGAGCTGTTCTCCTTCTTTCAGCTMTCAGGA 

GCATTTAGTTGGGATTGGG AAAGA GGCAAAATATACATAGAGGTAGAAAATGAAGATTTA 

TATGGATAACTTCACAGCTGTTTTCCTGTGAACCACATTTCTAGCAATAAACTAATCCAA 

TTGGGAGGGAAAACAATATGTGAAGAA 

Sequence 2934 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTGTGTTGTTTGAATGGTATTTTGAAAGCAA 

TTGCATAGTTACTATTGATTCCTCAAAATGCATAATGGGGATGCTTCTGACCCGACTTGC 

CTTTTAGCAAGTTGAAGGTAGGTATGCACCGACAGGGCTCTGACTTCACCAGAACTAATA 

AGTTAGATTGCAATAGTAGGAGACAGATATAAATAATGAAGGAAGCTAGAGTTATGACGG 

CTTTCTGGAAAAATGTTGGGAATTGGAAAGAAMTGTGGAAAGAGAGTTGAGTGGAAGAA 

AGACTAATAGAGTAGGAGTATTAGTTTTATGAGAATGTGAGTAAATCTAATATTGGATAA 

ATGMTCTAAGGCTACTMTACTACATTTTTGCAACTTACATTTTTGAGTATTTGGGTAG 

TAAAATCACTCCGATAAAAACTTTTAGAGTGGTTCACCTNTCAAGTCACATAATnTGTG 

TTTTATCNACATATACTTGGTTTTTTTAGTCCAAMTAhn'GGTTNGGANTTTCTTGGNTA 

CNCCT NACTTrGGAAAN ACANTAAAACTAGCCATTTAAAAATTAATTCCAATTAATTTGC 

ATGTTAI 1 1 1 1 1 I 1 1 I 1 1 AAAGCATTNTCCTNGGGGGATNTATNGNCTTTGCAAAATTT 

Sequence 2935 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGGCAAGCAGGTGTTCCATGAGCTGAGCCA 

GCAGACCCATGGCATCACCCGGCTGGGCCCCTACTCTCTGGACAAAGACAGCCTCTACCT 

TAACGGATGGACTCCGTGTTGGTCACTGTCAAGGCATTGTTCTCCTCCAATTTGGACCCC 

AGTCTGGTGGAGCAAGTCTTTCTAGATAAGACCCTGAATGCCTCATTCCATTGGCTGGGC 

TCCACCTACCAGTTGGTGGACATCCATGTGACAGAAATGGAGTCATCAGTTTATCAACCA 

ACAAGCAGCTCCAGCACCCAGCACTTCTACCTGAATTTCACCATCACCAAC 

Sequence 2936 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTCCCAGGCTATTAGTATTATCTAATGAGT 

GGCTTAAAGATACCAGAAACACTGACTGAGGTCTCCACCTGCTTCTGGCTGAATGAATGG 

GAGC TTCTA GCCATTGAATAAGAGGCAGTAGCCAATGTTTCTTATTCTGAATGGTCCAGG 

TTGACTTTTATAAGAGAAACACTCANCCTTTTAATTGCTGAANGTCTGGCCCATAGACTA 

AGATTGCACATTCTTAGTCTAGGTTCACTTGAACTAGACTGCACCCACTGACGATATATT 

CAGTTCAGGTGTTAACTCACTTGGATAAGCATNTGGCCTGTCCT 

Sequence 2937 

CCCTTAGCGTGGTCGCNGCCGAGGTACTAAANACTAACATTTATTGAGCACTNACTAGAT 

NCCATAAACTGANATTGNCNTTGATCTNTGATTAATTAATCTCACAATCCTTGCTTTTCT 

GAAGTATNTNATTCAATCCTTTGAATGTCCCTAATGCACAGATAAGGAAATAGGCTGCTC 

TNCANATAAGGAATACANATCCTTGTGGATTCAACCCAGATCTGACTTGAGGCTAAACTC 

CTAACCA 

Sequence 2938 

CCCTTTGCCGCCCGGGCAGGTACGTTTGTTTATCAAGCTAGAAAATTCAGATAATAAAGT 

CAGCATCTGGATGAGGTAACACATTTTTTCTGCCATTCCTAACTTCCTCTCCTGGTATTC 

AAGTCATAGATACCGGCCTTTATCCCCAMGAGTCMTTCTATTGTTCTTTCTTTCATAT 

AGATGAGAGTCAGAGAAACAAAGTAAGGTTTGGGAGAAATGGACATGGTATAGTTTTAGC 

TAMCATTCTMTATACAAMAAAAATTTAAGTTGTTTTGAAACAAATAACACCCAAAGT 

CTACACATTTATAGTAATAAATAATTATCCAGCTAATTCAGATAAAAATTTGTTTCTAAC 

TGGGAAGGCCAGAAGTCTGTATTTCTACAGACAATTTAAAATCGAGCCTGGAAGCAAATA 

TCCAGATMTTTGGATCCCCTTCCTCCAGTTTACCCAAANGAACAAGTTGTTTCAAGGTA 

GGGCACAGTTATACCATGTTTGGATCCACATGGATGAAAATTTACTACT 

Sequence 2939 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCCAGAGGACCAGATGATTGATGGGTTCTACC 
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ACGAGCATCTAATAGCGACTGCTGCAGAACTGGTGTGGAAAAGGTTTCTATTTTATAAAA 

GAAAGTATCCAGAAGGCTAAAGGCAGTTGTGGAATTTATTTCTTCCCAGTGCTCTGATTG 

TCAAAATTTGAATAAAGCTCAGTGAAAGGAGGACAGTAATTAATGACTTGATAGCCAAAC 

ATCGTAATGATGATTTAGTGATATACTAGACTGGATGAAAGAGAGGTCACTGTCTTTTCA 

CTATGGATTCATCATAAACAGTATCATT 

Sequence 2940 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAACTCATTATAAGCAGATCACAGTCATATAGT 

GAGAGAGAAGGAAAAGAAAAACCTATGTAGATCTAGAGGAATTACCTAAGAATTCCTAAT 

TACATA GGATTMGCT CTGGGCTAACATCAAATTTTAAGTAATTACTAAAATAATTTCTT 

TTCTTTCI 1 1 I 1 1 1 1 1 IGAGACAGAGTCTCATTTTGTCACCAAGGCTAGAGTGTAGTGGC 

TTGATCTCAGCTCACTGCAACCTCCACCTCCCGGGTNCAGGTGATTCTCCTGTCTCCAAG 

TAGCTGGGATTACAGGCATGCACCACCACGTCCAGCTCCTTTTTTTATTTTTAGGT 

Sequence 2941 

CCCTTTAGCGGCCGCCCGGGCAGGTACAGTTCTGCTCAGAATGCCCTATCTCCTNANTAA 

ANNNNM NGNN GGAGANANGNCCTATTTTGGGAGAACTAATTTTGCAATTCCTGCCAATT 

TTTATGCATTTTGCCCTTGTTTCCATAATTCAGCACCTCCATCATCTCTTCATCACCTCC 

ATCMGATACTTT CACTTC CTTTTAAAAGCAAAGATCTCTATTTAATGTAATGCTTCTCA 

GTTTATTATTTACATTTTTATTTTTTCACCATGCTATCATTTTTACTGGGATTGTCACTG 

CTrTGGTTAATGCTCTGGTTACAAAGTGGCATTGCTGTGGGGCAGTCAGCTGTAACCC 

Sequence 2942 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAGGTGATACATTCTAGCCTTCTACAAAGTGCA 

CTTAGACCANGATATTCTTTCATGATGGCAGACAAATGAGGAGTTTC7TTTCATTATCTA 

ATGAAAAAAGAGTTGGTCTATATTTTCTCAGTGTATTCCTATAGGGAAGAGAGTCTTATT 

GTTGATAACCTTTCAGACAAGTAAGATGCTTCTGCATCCTATGAGGTATTCCCAAAGCCC 

CTCTTTAAT CTCC ACCATTGTGCATAATTTTGTGATAATGAAGCAGTTATTTTCTTCTCT 

TGATAGACATTTTATGAAGGCAGAGACTATGCCACTCTTGCTTACAGTTGTAGCTCCAGC 

ATCTAACATGGTGGCTTAACACATAGAAGATGCTCAGTAATTACTTGTTAAATGAATGGG 

ATGGGTGGGTGGATATGCCCTAATATGATGCAGCTTGATTATGCTNGAANTCTGGCTGGA 

TTAAGACATCATTCCAAGAAGTAAATACCAGTANTCTTTAAACTGTTGCCCTGTATCTTA 

CACACCTGAAAGAGACATCTTTCCAAGGGTGATNAAAAAAAAAGGTCATACCCTTTGAAA 

AA 

Sequence 2943 

CCCTTAGCGGCCGCCCGGGCAGGTACTTAGATTTGCTTAGATACCTGTAAAAGTGGGATT 

GGTA TGGGATAAAACGTTCTTGTGCTGCTTGCAAAAGTCTTTGGAATATTCTTTGGGATA 

TTTTGGTGCTTAGGAAGTGCTTTTCAGGTTATATTAATATATGCTTAGTCTTCATGCTTT 

CMGTCCTGTGGGACGTTMTGTTAGTCrrTTMGTTGAGTCCTTTTTGGTTATATTAAG 

AGGTAGTTCTTGATGTTTCAAAGGCCATCCAGAAATAGGAATGCCTGAACAGGAATTTCC 

AATTAAGTCGGTCAGAATCCTGAA 

Sequence 2944 

CCCTTGAGCGGCCGCCCGGGCAGGTACGCGGGACACAAACTAACCTCTAATGTCAATATA 
CTGTGAAAAACACTGTGATACAGATATGTGGAGGGTAGTATGAGGACACAAAAAAGAGTG 
ACAGCTAACCCAACCAAGGTGGTCTGGAGAGGCTTCCTGGGAGACTGATTTAGACTTAAG 
TGTTATGGACTAAGTAAGAGCTACCTGAGTGAAGAAGGAGGGAAAGGAGACTCCATACAG 
AAGAGCTGAGGAATGAGGAGTGTCCTAGGGTAGTCATGGTTTC CI I 1 1 1 1 I I I 1 1 I ATGA 
Sequence 2945 

CCCTTNCGAGCGGCCGCCCGGGCAGGTACCAGTGATTACCTGCCCCTCCTACCCTGGGCC 

TGCCCCATCCCCCACTTAAGAACTACAAACTAAACTACTGATGAAAACATTTATATATGT 

TGATACTGAAAGANTCCCTGTAGAGGAATAGCAACTGGATANGTAAGCNACTGCAGAAAA 

AAAGGTGGGNAAAATTTGNTTAGGGCATATCCTTTANMTTTAMCTTGMCTTTCTTCCN 

ACNACCNAAATTATAGNACCCCAAGGNACCTTTGGCTTAACCTGCACNCTTAATGGTTNT 

NCAAAATTAAAAAAGNGGAAAACCAAGGTTNGGCATTMTTGGTTTTTACNAAAAAATTT 
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TTAAAAAAATTTGGGGGNATATGNAACCAACNANAATTGGTNANCTTAANATAACNNACC 

NMATTNG^GGGNGGAAAATTCNCC^NGAAMTTTTGGGGTMTACCCCmGGGGG 

AAACCCANNGAAAAAAAAAAGGGGGAACCAACNTTTCNAACCAAAAAAAAAGNTTTGGGG 

GAAAAAANAACCTTTTGGGGNTTAAAAAAAATT 

Sequence 2946 

ccctttgag cggcc gccc gggcagg tactatcttttatttagatatgctgagataattac 

agatgaaatg i i i i igggg i i i 1 1 i i i ctacataatctgttgtaataactggtgggagaa 

agtgggtgtataaacacattgctgcaattaagtgatggagacatgagtgttcattgcact 

acmttacattttattgaamtttcctcactcaaatttatggaataggtatctg™ 

actattgatcattctctctaaatttgaaaaaagaagctgtatttgactcttatgtcccaa 

taaaacttaaaaaatgctgcccgagaattgtttggtgtgaaaaaaaaggaaactacaaag 

TCTTTT 
Sequence 2947 

CCCTTTCGAGCGGCCCGCCCGGGCAGGTACTACCCAAACCTAAATTTAGAAACAGGATTA 

GAAAMTATCATTCCTATTCTTCTTGGCTATAAATTTACCCATCAAAGTTAAATTTAATT 

ATAC ATTCTTA ATAGTCTTGGATAGACAGGGCTTCACAGATAAATTGTAATTTAAGCAAG 

AATTAi ii i i iCCTGTTTAACCCTAAACACAATAATGTGGTCTTTACTAGCTCATCTCAA 

TTGMTATCATGCCTTAGTATTTGGGCTTTATATAATTTAGTGTGTATATTAATTTCTGC 

TCCTGAGCTCTTATAAATGAAACTGCTGGGAGCTGTGCACAAGGCTTAAACTAAAGAAAC 

CTCTTAMGACAG TGTG GATATCACTGAMGTATTTTGGGGGAAMCTAAATTTTAATGT 

AAMGAGMCMCTTTTAGAAAGAAGAGAACCATAAAAATAAAATGCAGCACTCCTTTGA 

CTTTGNCTAATTCCTTGNGTCTTTCTGGGTGGAATTCTGTAGTCAATGNGGTTTAAAAAT 

ACATCAGCACCCTTTGCTTCTNCCTATMTGGAAAMTTTAAATTTAAAACAAAA 

Sequence 2948 

CCCTTAGCGTGGTCGCGGCCCGAGGTACTTGTTTTTGGCCACAGGAATCTTAAGTGACTT 

AATTAACAGGTGAGTGGCATTGCTCCTGAACAGAGCTACCTAGTTATAATCTGTGTTACC 

TTTGCTTTAAAATGTTCTCTTTACTCCATTTCCTTAGTTGATATGCTTGAGGCATCTTG^ 

TGMTCCATCATTTATTCACCCACTCATTCATTTCACATGTATTTGCAGTTGCAAAGCAG 

GCAGTAGGAATAAAATTAGGATCAAAATAAGGTAGTAAGTAACGAAATTTGCTGAGTATG 

TTCTGAGAGAATAGCTGAATATACATTTCTAMTGGACGACAMTCCATTTGGAAGTCTT 

TCTCCTCCTGTTATTTTGGTAA 

Sequence 2949 

CCCTTAGCGTGGTCGCGGCCGAGGTACCGTAAATGGAAATTATCCCGCAAATTACGGCAT 

CTCCAAATGTAGCGTGGAAACTAATATTGACATGTCAACACAGGATTAATACATAGTAAT 

CAAAGAGTGCTTAAAAATTCATTAAAAATGAAGTCCCACCTCTAAAAGTGAAATGTGCAA 

ATTAMGTAGCAGTTAGATMTGTTTTGTCATTTAGGTAAAAATTACAAAAGAATCGGCC 

GGGCGCGGTGGCTCACGCCTGTAATCCCAGCACTCTGGGAGGCCGAGGCCGGCAGATCAC 

CTGANGTCANGAGTTCAAGACCAGCCTGGCCATGGTGAAACCCCGTCTCTACTAAATATA 

CCANGAAATTAGTTGGGGCCGTGGTG 

Sequence 2950 

CCCTTAGCGTGGTCGCGGCCGANGTACANCAAGATAAGTGACCTACTGAGGGACAGAGTT 

CMMTAAMTAGGTTTCAGTCACTCAATGAAGTGCTTTGCTGGGAGATCTCGCCTGGTC 

CTGTGGTCCTTTAGTGAAATATGGGGAAACCAGCAGGGGACGTAGGTCAGATAGTTTATA 

ATGGTGAGTTATAGGTTGTATATTATMGGAGCATTTCTCAGCACTGAAGAAATTTCACT 

AGATTCCAGCTCACCTTTCCAMCTATTAMAGCAAMCGGGACTGTTTTTCAAAAATTG 

GMAGTAMCCTGGCTTCTTGTTAGAMTTTGGAAMTATTGAGMGTTTTAGACCCAGT 

CCCTTTAACCTTAAGGAGCTCACATGCAATAAGATGGAAGGGCAGGGGAGATCANGACAG 

AAMC TACCGA TATGGCCGTGAGAACATGGAAGCTTATATCAAGTGAATTCTTCTTTGAA 

TTTACCTTTTTATAMTCTTTTAAAAACMATTTGACTTTAAAAAAGCC 

TCGGAAATGCTAATTAGCACCTTTAAAGTTAGATGAAACATATCAGTGGAATCAAAGCCT 

ACGCCTCAAATGGATGTATTTTAATGGTTT 
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Sequence 2951 

CCCTTAGCGT GGTCG CGGCCGAGGTNCAAACATTAAGTGTGTAATTCAATGAGTTTTGAC 

AAA TAATGCA GTTTTGTAAAAACTTTGGTGAAATTGGAAAATAACCGCACCCCAGTCACC 

TTGI I I 1 1 1 IGGCTACTACTMATtGACTCTGATCACATGGAAGTGTAATAATTTTCTTT 

AGTGTGAATGCCCTGAGCCATGAGGTAACCCTGTTCTGAAGTGGTTTCCTTGCTTGTAAA 

TAA ATAA TGTTGACATGTAAAGCAGGACCACCAAACTGTAAGTCTTGTAGTAATTTTAAT 

GTATmCCA CTCTG GACAAAAAGATGTGTCTTTATTCCTGATAAATATTAATAGTTCTA 

GMGAGGACATTTTCCAAATTAAAGAATTAGGTATTTNTTCATAATTCCCGGGTAGTA 
Sequence 2952 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAATTTATTAAGCTCTAGGTTCATCAGATGACC 

AGGAAAACATAGACTATCANGACACANACAAAGCAGAACTGCAGAATTATAGAAAATTCA 

AGGTCTCATGGAGGAAAAGCAGTATCTACAATAAGAAAGAGACCCCAAACAGCACTCTAG 

ATTTCTT TMGA GGGATTATATTTCCTCCTTCCTCCCATTCTCCCTTCATTTCTTCCTAC 

CTCCCTCTTTTCATCCTCTTCTCCTTTTATCTTTTCATCATTCCTTTCTTMGCATTTTA 

CTTAGTTACTACTATG 

Sequence 2953 

CCC7TAGCGTGGTCGCGGCCGAGGTACAAGCCGTGTCTAAGGAGCTGGTTGGAGAGTTTT 

TGCAATTTGTTCAACTTGATAAAGAGGCCTCTGATCCTTTCAGCCTAAATGAATTACGAG 

ATGAATTATCAAGGAAACAGAAAGAAGAATTATGGCAAAGGCTGAAGAATTTATTGACAG 

ATGTGTTGTTAGAAAGCCCAGTGGATGGGTGGCAGGTAGTGGAAGCCCAGGGTGAAGACA 

ATATGGAAACCGAACATGGCTCAAAAATGAGAAAAAGCATAGAAATAATTTATGCAATTA 

CATCTGTGATTCTTGCTTCTGTGTCTGTAATAAATGAAAGTGAGAACTACGAAGCCCTAC 

TGGAATGTGTTATTATATTAAATGGTATTTTATATGCA 

Sequence 2954 

C 'CCTTA GCGTGGTCGCGGCCGAGGTACAACGTGGGCAAGCATCTGTGCATATGAAATGCT 

GT1TTTCTAAMGATTTCTCACATTGGCAMCTGAGAGATTTTCTTCTGGAACACAGGAA 

AGATTATATTAATGCTNATAGCCATACCATGTCTGAATATGGGAGGATGACAGACACNGA 

ACGAGACCAGATAGACCAGGATGCCCAGATATTCATGAGGACCTGTTCAGAAGCAAAGGG 
Sequence 2955 

CCCTTGAGCGGCCGCCCGGGCAGGTCCGGGCAGGTACACTGCATAAAGCCAGAGTTAAAA 

C7TCACTGCCAGCCTCTGAACAGAAGGCTGTTCTATCCACACTATCACAAGACCTGGTGG 

AGTTGAGGCAACTGCTGAATTACCATACAGGGAAGAATGAATTCAAGAAAATTCCCATGC 

AAGATAGGCTCTTAAAAAATAAATTTACACAAGAAAATCAGCACTGTAAAGGTAA7TGAT 

AAGCCCAATAGAAGGGAAACCTATACAAAGAAATAGAAATAACTAAGCAATCTGAAATGG 

AC7TTAMTMTGATGTTTACAATTCTCTMGAGGAAAAGGAGCA7TAGCATCAGTGAAA 

CAAAAGTAGGGCTATAGAAAAAACAATACTTATGAAAAAAACCAATTGGAAATTT7TAGA 

TGGAAAAGCGCGAAGTAAAAA ATTCAA CCCCATGGTCTAAAAGAATAAAACTGCACCCAG 

CTGGAANGGGAAM TTANT TMTTTTTACGAAGAAACCAATTAAATCTTACCAGAATGTA 

MGGAG ATMA GATNTTTTMATAMTCNGGAGTTTAGGAGATNTAAACTTTTTTCCATA 

TGGAGTNTTTTAAATATCCATATGGTTATTATGTATTCCATATATATCCCNAAAGGGGAA 
AGGG 

Sequence 2956 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAACTATTCTCTTTCCTATTGCATTTTCCCTTG 

AGAACATGAACTGACAAGGAACAAGGTGAAGGAGTTTGACTGGACTTTAGGGATAATATA 

ACTCCCTTAAAAGAAGTGTTAAAATGAAGGAGAGATTTTACAAATGGCAAGTAGCATAAA 

AAATGGGAATTGTAAAGAATTAGTATAGATGATTTTTGACTGGGGAAACTTAAACACTAA 

TTACTTTCAGAAAGAATTTGAGATGATTTTCTTTAGTAGGGGTGATAGTGGTAAGGCCCT 

TCTATGAAGT ATGTG ATGGATTGGACTACTTCTGAGAGCCCTTTCTTCTCTTAAAATCTT 

ACGGTAAAAG l 1 1 1 1 GAAGAATCATA7TTCAAG AACTTTTTTC7TCTTCCACATTAGCCA 

TCTTCAGATAATAAGAAACCAGATACCTA 

Sequence 2957 
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CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGATGGCTC 

TMGMGTCTGTCTTTTMCTGACTACATTATTTATATGAMTAGMTTCTAGTATATTT 

TGACTCAGTTTTCCCMCmTTTCGTATTTTCTTTTT 

CATCMCCCCCTTCTTTTGGGTTGGTTGTGTTTTACGCACCTCCTCCTCTTTTCCTC 
CCACTGGTTTGCMGTTATACATTTGATTTCTTTTCTTTCAGTGCCATGCTCTGATAAAA 
TAAAGGACATTTATTACAGAAAA 
Sequence 2958 

CCCTTAGCGTGGTCGCGGCCGAGGTACACAAGTCACGATTATACAGTTTAATGAGTAAAC 

ACTGATCACACCCATATAAACCACTGCCAGGCCAATAAGTAAAATCTTACAAGCAACCCC 

AAAMAACTCTAGGCCTCTACCTTTAGMTGCGTTTGCAGTATTTAGGACTATTGTAGTT 

CC AAMTTA CTAAAAGTMTGMTCATTMTGTATGTGTATTACATGTTAGAAATTTTAT 

TTATTTTTTGAMGCATGCATGTTTATCATTCTTAATAT^ 

TATAMCCTTAAACTGTAGTGACTNTTAGGAGAGTGAAATTCATGAATATAAAGGTAGTC 
CATAAMCCGTGATTTGTAGATTTTAAATCTGAAAAACATTCTCATATGCACCATAAATT 
TTGTCACTGTTCTATTTGATTTGCNTATCTCTACAGGTGMTGMTTTGTTTNNTTTGAA 
GAGGATGGGGGAAGCNTTTNCCATTTTAAACAGGGATAAAAACTGAGGGAAANTGA 
Sequence 2959 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAATTTATTAAGCTCTAGGTTCATCAGATGACC 

AGGAAAACATAGACTATCAGGACACAGACAAAGCAGAACTGCAGAATTATAGAAAATTCA 

AGGTCTCATGGAGGAAAAGCAGTATCTACAATAAGAAAGAGACCCCAAACAGCACTCTAG 

ATTTCTT TMGA GGGATTATATTTCCTCCTTCCTCCCATTCTCCCtTCATTTCTTCCTAC 

CTCCCTCTTTTCATCCTCTTCT^ 

CTTAGTTACTACTATGTT 

Sequence 2960 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAAATTGCTTGAGTCTGAAGAACCTGCTANGGA 

GCATATACATCTTCAATTAACCTACAACTGGTCTTCAGTAAAACCTCTGTCCCTGNCACA 

CTGAATNTGGTGTAAGTCATTTTTAAAACTTCTGGCCAAAACCCAACCATAATATGCTTT 

CTTGTAAGCCANCTTCATATACCTAAAGAAAACATGTGAAAGGCCGGGCACCGATGGCTC 

ACTCCTGTAATCCCAGCACTTTGGGAAGACGAGGGGG 

Sequence 2961 

CCCTTAGCGTGGTCGCGGCCGAGGTACCCACTGATGTCGTCATGGTGGCAAGTCAGTGTT 

GGCCCTGTCCTAACCACAGGGAAGGGGTTCTCTACAGAAAAGTAGAGTTTAGATCTGAGA 

AGGTAGTTGAGMTGAGTGCCGTATAGTGMTMTAGTAGATTTTTACCCCATTTTACCC 

CTTTGTCTTTGGTTGTATATATATTCACATACCATTCTAATTATACATATAGTTTCAAAA 

ATGCCTCTGCTTCTCCAMCACMCTATACTTTATATATATTTTCAAAAACAGTTCTACC 

TCCTTCTTCCAAAGATAAATATCTGTGTTTTGTGTCCTCCATGGCCACCCCTAAGAGAGG 

TGCAAGGAGGATTTCATTCCTGGGAATTACCTACTTTAATAGCCT 

Sequence 2962 

CCCTTAGCGTGGTCGCGGCCGAGGTAC I I I I 1 1 I I I I I I I I I I I I II If I I GGAGATGGA 

GTCTCCCTCTATCGCCCAGGCTGGAGTGCAGTAGCACAGTCTCGGCTGCCAGGCCTCCCC 

AATCTCTTCTTCTCACAGGGCAGGGCTGAACAGTATCTCCTCCTGGCTGCATCCGAGCCC 

ANAAGTCAAAACCTAAGGGGTCCCTGCTTGGCTGGGACAGAGGCCGCANACCTAGGTCCA 

GCTNTGACTCGACAGNGGCACATCCCTGTGAAGTAGGGCTTGGGGGCTTCCCATGAACAG 

CTTTCANA GTGA GTGCAGTAGAGTGGGGCTGGAGGACAGTTTTGACAGATCTTGAGCAGG 

TGCTTGMTTTTAACCCAACATCCACCCTTACCCATGACCCAGGTGACATTTCAGAGGAG 

GGTTCACCTGCCCTGCAAAGTAAGAGCTGCAAAATCA 

Sequence 2963 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAATGAGGAAGAGAGTTTTGTAAAAGCAAGCAG 
GCTGGCTGTCATTAAAAGTTATCTTAAATAACTTGTCGCAGCCCACTCCCACCAGTCCTG 
AAAATAGTTCATTCAAGTGGTAAAACAAGACAAGGCAGAGGACGATGTCTCTAACTTCCC 
TGATGTGAGATCTAAAGCCCCATTCCTAACTCTTGGTTTTAGAATTGAGGAAAGTGGAAT 
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AAATGCATTTGGAAAGGATCTGTTNTCTTCCCAGGTTTCCTGCCTGGGTTGAAAATAAAG 

GGTTCAGGGGATGGAMCAACTTTGAGGAACATAAAGAGGTATTGGGGG7TCATCM7TC 

ATCTTTGTTTCAAGATGGGTCCCCCCCACCCTCCNCAATGCAAGTTAATTAGGGAGATAA 

TTTTAGCCTACCTTTGGGAATTNATNTNTAMNATAMGTTATCTTTTTTTAANGCTTTG 

TCACTTTMTTGNCCNCCCCTGNATTTGATMNGAAAAGTAGTTTTTTTCTTATTTGACC 

NCTNTTTTNGANTGNCTATTNGGAANNCNTTTTATTTTGGGNACTTTTTTGGGGGGTGGG 

AAAGGAAAGTTAAATT 

Sequence 2964 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAAGAAGCTGAGTGATTTGTTGAGGTCCCAGTG 

GAGTGTCAGACCTAAACTTCCAGTTTCCCAATTACTTAAATAAGGTTCTTCTCATCAGAC 

CCTCTTTCCTCATCTCTCTACTAAAAAACTACAGTGAATAACTTCCACTGTCATCAAATA 

GAMCTTTCTGTGCTGGTTTCCCCTACCCCACCTTTCTGTGCTTGATAACAGAAGCTGTT 

GGGAGCCCCACCTTCAGTCTTATTTGTCCTTCCTTGCNGG I I 1 1 I CTTTG 

Sequence 2965 

CCCTTGGCCGCCCGGGCAGGTACAATAATGGGA7TGTGGTTATAAGGGAGAATGCCTCTA 

TTCTTCAGAGAGGCACACAGATTTACTTGCTTTTAAATGGTCTAGCAAAAATAAAAACAA 

ATCCAC TCATA AAACTAATATGACAAAGTAGATGGTTCTATAGGTGTTCAATGCATCATC 

CTTTCMTTTTCTAMTATTTGAAATTTTCTTAATAAAAAGATAGGGAAACAAAGGTGGC 

TTGGAGGAAACAAAAACAAAAACAAMCAAGAAGATCCAGTCTAATGGAAAGATGCAATG 

GACTCAGGGATCCTGCAGCCAAAG 

Sequence 2966 

CCCTTAGCGTGGTCGCGGCCGAGGTACTGTTGTTCCAGACTGTATGATTGAAATAGCTGT 

TATTTTCCCAGTTTCTGTAGATCACATATAGGAAGTTCTGCATAATCATAGTGATGAAAA 

CTCATGTT TAAA AAATCCATATATAATTAGCGTGATACAACACAGCATCTTAACACTGAG 

GCTTTAAGTTTTAATAATTCTGTTATTCTCAGTAACACTGAAAGTTGCCTGTGCTCTTTC 

TGTCACACAATTGATTCCAATGTATmAAMTGTGTTTTTCTGAGTCATTTGTTGCCTT 

GTTACTTTTTAAA 

Sequence 2967 

GGNACTTAGCTGGGMTCTGCATCTTTAGAGNTCAAAACTGTGCAGTTTTTCCTCTGATT 

AGGTATTATNATGTTGAACTCCAAAGGGTCATTTACATTCCTTTATAACACTGGAGCTCT 

GGTG TATCG AATTGNGTTATGAGATAAGCCACTNGCAGGGACTTGAACATTCCATTTTCT 

TTAGAT7TTGT NGTA TCAGCATGTGAATATGCTGAATACAACTTGTATCCTAAAGCATAC 

AANCTATAACATTTTCACGTTGGACTCAAATTTC 

Sequence 2968 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTGGAAGGCCCCACTGTTACTTCTTCATACAGG 

CAAAAGCTAATCAACTATTTCACCTTGCAGCTAAGTCAGGACAGAGTGACTTGGAGAAGT 

GCAGATGAACTCCCGTGGCTTTTTCAGCAGCAGGGMGTAAACAGAAGCTGCATGATTGC 

CTTCTTMTCTCTTTGTGTCTCAAAACCTTTATAAAAGGGGACACTTTGCTGAGTTGCTG 

AGTTATTGGCAGTTTGTTGGCAAAGACAAAAGTGCAATGGCAACAGAATACTTCGATTCA 

TTGAAGCAGTATGAGAAAAACTGCGAAGGCGAGGACAACATGAGTTGCTTAGCTGATCTT 

TATGAAACCTTGGGGCGATTTCTCAAGGATCTAGGCCTTCTCAGTCAGGTAAGCTCAGCA 

GATCMTGGCAAGCCAGAGACAGGTGGGATTTTTGTGTTCACTCTTCTCTTTTTCCAAGA 

TGTTMTGTGGTTTTCATTTGTTGTGCTATAATCCTTAATATTTAATCAGATGACAGATC 

TTGCAGCATTTCACTCACTTTAACTGGCCTTTCACCCCTTT 

Sequence 2969 

CCCTTCGACGGCCGCCGGGCAGGTACTTGAGTCTCTTAACTCTGTAAAATCAAAACCTAA 

AGAAAACAAAMGATTATCCCTTGCATTCGMTACTGTTTAGTCAGTGTATATCAATTTT 

TTTTTACTGTATCACTGCATTCAGAGGTATCAAATTTTGCTTGCAGCGCTGTAAAAGTTT 

ACCAGTCAGATTTGTCGGCCCGGCATGGTGGCTCACACCTGTAATCCCAGCACTTTGAGA 

GGCCAAGGCGGGAGGATTGCTTGAGGCTGAAAGTTTGAGACCAGCCTGGGCGACATGATG 

AAACTCCATCTCTACTAAAAACACAAA^ATTGGCCAGGCTTGGTGCCATGCACCCGTGGT 
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CTCAGCTACTTGGGAGGTTGAGTGGGGAGGATCGTTTGAGCCCAGGAGATTGAGGCTTCA 
GTGAG 

Sequence 2970 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTGTGTTGTTTGAATGGTATTTTGAAAGCAA 

TTGCATAGTTACTATTGATTCCTCAAAATGCATAATGGGGATGCTTCTGACCGACTTGCC 

TTTTAGCAAGTTGAAGGTAGGTATGCACGACAGGGCTCTGACTTCACCAGAACTAATAAG 

TTAGATTGCAATAGTAGGAGACAGATATAAATAATGAAGGAAGCTAGAGTTATGACGGCT 

TTCTGGAAAAATGTTGGAAT TGAA AGAAAATGTGGAAAGAGAGTTGAGTGGAAGAAAGAC 

TAATAGAGTAGGAGTATTAGTTTTATGAGAATGTGAGTAAATCTAATATTGGATAAATGA 

ATCTMGGCTACTAATACTACATTTTTGCMCTTACATTTTTGAGTATTTGGGTAGTAAA 

TCACTCCGATAAA AACTTT TAGAGTGGTTCACCTCTCAGTCACATATTTTGTGGTTTTAT 

CTACATATACTTGGTTTTTAAGTCCAAAAATATTGGTTGGATTTCTGGTATACCCTACNT 

TTGT AAAAAG CATAA ACTAGCATTTAAAATTAATCNAATTAATTGCATGTT AI I I I 1 1 1 1 

TTTrAAAGCCATTTTCCTGGNGATATATGGCTTTTGNAATTTTA 

Sequence 2971 

CCCTTAGCGGCCGCCCGGGCAGGTACGTGTCTGTGGTCCCGACTACTGGGGAGGCCAAGG 

TGGGAGAATTGC7TACGACCAGGAGTTTGAGTCCAGCCTGGGCAACATAGCAACACCCCA 

TATCT TATTT AAAAAAAAGCACTCTATGTTTGTTTTGGGCCTCTTTGAGAACATTTACTA 

TGAAATTTTATTTATCTTTGTTTCCTTTCAGTTCTTCTTTTGTTTATTGTGATTATCTTT 

CATATAACAATGTGCTTAAATAGCTGGTGGTTCTTGATAGGTCACTCAACAAGTGAGCAA 

ACTGAT GGAT CCTGTGTTTTGTGAGCAGTGTTTATTGGCTGTGATAGAGGGTGGGGACCT 

AGATGCTTTTAGGGGATCCTCCTGTTATCCATTTCTG 

Sequence 2972 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTGGAAGAATGAGATAATTTATTTGGCAAAAT 

TCATATGTATATGTTATGCCAAACACTTGGCTAATATCTGGATGAGATGGGCAATTTACA 

GACTATGTAATCAACTAGGGTTCTATCTGTAGTTGTGTTATGAGAAACAATGGTTCATTA 

ATTCACGGAAATGTGAATTAAAAATGTCATTAAGCGAAAGACTCTGTTAGCTTGAATTCC 

TATTGGGATATGCMGACACATMCATTTATGCATATTTGTGATGTTCTAATCCATTTTT 

AGGGAATAGGATATTATATAAAGCACCCTAGTTTAAACAAATACATGATTACGGTTGTCA 

ATAGAGATAGATATAAATAGTGACCTGTGTMTTATTTTGCTAATGCTGAMTTTTTTTT 

CAGACAAGTGGATCCAAAA TCTACC AATGATCTTGCATTACATTAAGGGGAAAAGTATCT 

CCTTAATGGGGAATCTTTAC I I 1 1 I I I 1 1 1 CCAAATTGTGCCTATCACATGATCTTATGG 

ATAGATAATGGTTGGTATTCCTGGTGGATTCTTATGNGCAAAAAAAAGAGAACTTGATAC 

ATTGGTTAGTTACATTGCTCTCTTTCTGNGCATCAAGCTTCAACTATCAAGATTTCTAAG 

TGGCAAAAAAGTTTGGGGGCTGNGTTATAGAAGTGGAATAAAATNTTACTGNACATCATT 

CTTTTAATTAAAATCCCN 

Sequence 2973 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACAACAGCTTCCTAACTGGTGTCTCCCTTCCTC 

TNAGCCNTGCTGTGCATCATTCTCGCACCCTGCAGAGAGTTATGTCTTCACAATGCAATT 

TTGGTGTTACCCCTTAACTTAAAAACTTCCAGTGCACTTAGGACAAAATCCAATATTCTT 

AAGCCCCCAAAGACTCTTCATGATCTGACCCTATCTTGCTTTCAACATTCATCCCCTCCA 

ATGTTCTCTCTCAGGTTCTAAGCTTTCCACAAATTGCTACATCTGCAAAGATTACTGCCC 

CCAGCCTTACTCCATCTTCTTTACCTAAATCAATCTACTCAGTTTTCAATTCATTCTTGG 

ATTAAACTTCAAGTTCTTTAGGGGAAGCCCTTT 

Sequence 2974 

CCCTTTCGAG CGGCCGCCCG GGCAGGTACTG ATCATTCAATGAGAG AATATGTCAAAAAA 

TAGTTGGTGCATAGTAGATGCTCAAGAMTGTCAAATTCTCCTGCCTTTTCACTTAGATT 

CTTAAAAGTATGAATGGTTTCATATCCTCATTGACAAAGTATTTCTCTGAAAAATTCTAA 

GAATAAAAAATCCTACATCTCTGAATAATTACATTCATGACTCAGGGTGAATTAAAGCTT 

CACTTTGCCTGGAAAGACGCAAAAGAAACTTGAAGACAAAACAAGAAGCAGATCACAAAG 

ACATAAAGTATAGTG 
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Sequence 2975 

CCCTTAGCGTGGTCGCGGCCGAGGTACACACTTCTGTAACTTAATATGTCATTTGGATAC 

CTTTTCTCAAGMGACTCAMTAAATMTTTTATTATGCCTTAAATATTACCAAGTCAGA 

ATACTCACAGATGGATAC TCTG TCCAAATGAGTATTTGTGATTGCGCATTTACACACACA 

TATAATTTGTCATTGGTAATTTTGTTGCAGMATTATTGTGAGAAMTAAATTTTAAAAG 

AAMTACGATTCTATGMTACATTTTATATTTTACCTTTTTCTGGCACATACTGTTG^ 

CAMCTGATTMTTMTGTMTTATGTTAAACTTCTCCAGTTAAAAAAGAGAATATACAC 

TGCAAAGCTGAATGTGTATACTACTTGTTGAA 

Sequence 2976 

CCCTTAGCGTGGTCGCGGCCGAGGTACTTAAAGAGCTCGTGGTGTGTGCCAGGAGTGTGT 

TTTCCATGCATTCTTCCTGAGAAGTAGGTTTGATTATACAGTATCTATTTTACAGATAAA 

GGTTGAGGTCTTTTCTTTCTAAGGTGTTATTAGACAGCTATAGACATGGACACCAAAACT 

NGNCGAGCCTNCGGCATTGTAAAGGCTTGGCTTACCTTCCATCACCCTCAGCAGTTCTGC 

TGGAAGGGAAAAATGGAAACAGGGCCCTTAATTTTTTCTCATTATTGGATTAATTGGAAG 

CACTAGGCATGGTNTGCTT 

Sequence 2977 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTAAAAGGTGATGCTAATACTTTAAAATG 

TTTMGATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCA 

TTGGACAGGACATAAT GTAAA ACATAAAAGTGCAATTGTTACACTTACATATGATAGTGA 

ATGGCAACGTGACCMTTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTAC 

TGGATTTAT GTCT ATATGA CAAA TCTTGATACTGCATCCACMCATTACTGGCGTGCTTT 

TTGCTTTGCTTTTGTGTGCTTTTGTGTGTCTGCCTATTAATACGTCCGCTACTTTTGTCT 

GTGTCTACATA CACA TCATTAATACTATTGGTATTACTATTGTGGATAACAGCAGCCTCT 

GCGTTTAGGTGTTTTATTGGATATATTGGATTTGNTTATATACCATTATTTT 

Sequence 2978 

CCCTTAGCGTGGTCGCGGCCGAGGTACATAACACATTTATGTGTCTATGTGTATATACTA 

TTTATGTAAACATATAATATATGCAAAACATATATGCATATACACATACCGATATATGCA 

MTCTGTATA TAGA AAMCTTAGTTTACTCAAGATTATTAAAATTGATCCTGGGAACAAA 

ACAATACACCTTTTCTAGGCATTTATCCCTACTATAGAGACAGAGAAAGAAAGATTGTGA 

GAGGGAGACACACAQATTCAGATGAGCCTTCAGGGAATTGTATGMCTTTTGAACCAGAG 

ATGGAGAAGCCAGAGCCCAGCCtGTTCCCACTTTAATAMCATTTTTTTCTTCMTTAAT 

TAAGGTTTGGGAAAGAAAAAAAAACCCCCAGAAGAGAAAGTAATTTAACATGCCATGCAT 

ACTTTACAGATGGATCCTCCNTCNGATGCCTATTAAATCTATGCATGATTGCNCTGGAAA 

AMTTAATTGCATTTGGTTCTCACTATTGGGAAAATAGGGTGCCAACAGCTCAAAAATAA 

AAANGGAAAATGAAACCANGGTGGNTGNTGGTGGTTTA 

Sequence 2979 

ACGCGGGCTCCMTTTAATTCTTTTAGTTGMTGAGCTTCTTTCTCGCTTGTTACAGACT 

CACTTAGGGCTCATTCCTCCCTGCCTTAGTCCCAGTATGGTCTCAGTAATGACATGCAGT 

ATGGGGACTTTGGACTCCTGTTTGMCAMCCAGCTTTGAGATGATAAAAATTTGAATAT 

GMTTGAGTTCTTGMGATMGAAATTGTTAATTTTGTCAGGTTTGTTAATGGCATGGTA 

GTTACTCTAAAAACAAATTTCTGGGCGAGTGTAGTGGCTCATGCCTGCAATCCTAGTACC 

TCGGCCGCGACCACCGCTAAGGGCGAATTCC 

Sequence 2980 

CCCGCGTCCGCTTTCTTATTTGGAAAGAAAACAACACTCACCACGGCATTCACAATGCTG 

GGGACCTTATGAATGGGAATGGAGAGGCTGCACTAATGTAAGTGGGGTCAACATCCCTGC 

TGGAGGGC AGAT GCAAGTGGGGTCAACATCCCTGCTGGAGGGCAGATGCAATTTTGGCAA 

ACTMTGACTTTTGGMGCCATTTTCTAGACTGCATTTTTCTCTTCTTTCTTAGM 

TTTCTCAAGGGGAAAAAGAAAGCAGAGGTGTAAATGGTATCTGGCCGTTCGAAGAGCCCG 

TCTGGGAGACTCTCTGCTGACCACAGCACTGTCTCTCCCCAGGGCCCAGTCCCACAGTGT 

CACTGAGTCCAGAGTCGGGACTCCAACAAGCCCTCCGCATTCCAGCATGACTGTGGTAAA 

MCAAAAAAAATCTATTTTATTCTAGTCCATCACATTTTCTATATTTGCTAAAGAAATCT 
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CNATCACTCCATAAATGATCGGTTAAAATNAAAAATATAGGCTGGGTACAGCAGCTTATG 

CCCTGTAATCCCATGCTTTGGGACACTGAGGCTGGAAGATCACTTTGAGGCCANGAGTTC 

NAGACCAGCCTGGGCAACATAAAAAAGACCCATNTTTTCCAAAATTAAAAATAATTNGCT 

GGGGGGAGCCTAATTCCACTACTTGGGAAGCTGAAGGCTGGAGGATTGCTTGACCCTGGG 

AGTGAGANGCTACANTGAGCTTTTATTGCCCCACTGCNCTTCANCTTTGGCCAACAAAAT 

GANAACCCTNTCCGTAAAAAATCNNCNCNNNTCAAAAANTTTTAAAAAAAAAAAAAAAA 

Sequence 2981 

CCCTTAGCGTGGTCGCGGCCGAGGTACMTMTCGTTTTTGGAATGATGGAGTAAATGGT 

GGATTTCATGCCTCAACTTGAATACACTCTCTGGTGGCTGAAACCGAGGAAGAATGTTGC 

TCCAMATATTMCGGGTGATAACTTGCTTTTCTATTTTAAMTAGAAAAATTAATTCTC 

CCATTCTC TTGCATG GAAAGAACCACTCAAACGGAMTTATTTTGTATTTAGTCACTATT 

^^^^T^ZZ^"^^^^^^^^^"^^^^ ^ CTC ATGCTCAG AAGCT ATTTTTCTTAATATGTT 

TGAGTTTTTAAAAATTGGAAAAAATATCTCTTCCACCMGMTTTGGCAAAAGAAAGCTA 

GAATTAAACACCGCAATAATAAATTTCACAACACTATGAAAAG 

Sequence 2982 

CCCTTAGCGT GGTCG CGGCCGAGGTACACTNACAGCCCTGTCCTGAGCCTGTGTTAAAGC 

CGGCACTCTATTTTTCTGGTGTCATGTCAGCTGCTCACATGTCTGTCCCTCACTCCATGC 

CCCACATCCCCTTGTTCCAGGGTCCGGCTCGCAGTCAGCACTTTTACCATGGGAATGCAT 

GCCTTAGTTTAGCCCACCTTCCCTGGGAAAAAGANCTGGATATACACACAACATACCCAT 

GTCTCM7TGTCTGGTATGCMGTGATATGACATATTTGTGTTMTAAATTTGNGNCTTA 

TATGCATTTGTGTGCAGGCAGTCACTCAAACTGGGACACTTTTGATCTAAATAAATACTA 

AGGNGGATCCTTGGACAATTGGCAAAGCCAGGACTGTCAGATAAATGGTCCCCTTGG 

Sequence 2983 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAAAAACTAGTAATGCAAAGGCAGCAATGTTAG 

CMAATTTAAAGAGTTATACCGGGGTGAGTTTTACAGAATTAGTAAGACCATTTAAAAGT 

MTAAATCMCGTGTTGCGATTGGTGTATTGCTGCATTTGGACTTACACCCAGTATAGCT 

GACAGTATAAAMCACTATTACMCAATATTGTTTATATTTACACATTCAAAGTTTAGCA 

TGTTCATGGGGAATGGTTGTGTTACTATTAGTAAGATATAAATGTGGAAAAAATAGAGAA 

ACAATTGAAAAATTGCTGTCTA 

Sequence 2984 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACGCGGGGAGCGCGCCGGAAGAAAAACCAGCAA 

GMGGCGGCGGGGGMGATGGCGGTCCTGGGGTAGAGTTTGCMGCTTTCTGACTAGGCT 

AGTCGAGCAACTATTCGGGTCATGGCGTCAAACTCAACTAAGTCTTTCCTGGCAGATGCC 

GGCTATGGCGAACAGGAACTGGATGCCAACTCTGCCCTTATGGMTTGGACAAAGGCCTA 

AGATCTGGCAAACTTGGTGMCAGTGTGAAGCAGTTGTTCGCTTTCCCAGACTTTTTCAG 

MGTATCCATTCCCTATTCTTATCAATTCTGCATTCCTAMGTTAGCTGATATTTTCAGA 

GTTGGAAATAATTTCCTGAGGCTATGTGTTCTTAAAGTTACCCAACAAAGTGAGAAACAT 

TTGGAGMGATTCTAAATGTGGATGAATTTGNGAAGAGMTTTTTTCTGGGATTCATAGT 

MTGATCCTGTGGCAAGAGCCATCACCCTCCGGAT 

Sequence 2985 

CCCTTTCGAGCGGCCGCCCGGGCAGGTAC I I I < I I I i I I M i I I I I I I I II I GGTGGACT 

CAGGGTCTTAC TCTGT CACCCAGGCTGGAGTGCAGTGGTGCAATCTCAGTTTAAACCCCC 

MGTCTGTGGTA7TTTGTTATAGTCCCCCTGGAAATTAATACAGGATTCATGGTCTGCCA 

AAGTCTCATTCCCTGCTCANACCGACAATGGATAGTGGGATCCTTTGTGATATGTGCTCT 

GTGAGTTANAAGTAGTAACTTANAGCTGTTGAGGACAATACCTTTGCACCCCATCCACTG 

MTTTTGTTTTCAMGATACAGTCTGAMCTCCCAGAGGGTGAGTGCCTTTCCAGTGTGT 

CAGAGCCATTAGCGGCTGAGCAGACCGCTGAGCATGTANATTTCTAGGTCAGGGCCCTCT 

CCCCAMGMTCATACTCCCTGGTTGAACTGAACCTAAATCAGGTTTAAAACTAGACAGA 

TGCCAACCAACTTAAAATTCCAAAGATATNCAAGACCTGCCCATGGC 

Sequence 2986 

CCCTTAGCGTGGTCGCGGCCGAGGTACCAACAGCAGTGAACCAGGGCCTANTTGTCAGCA 
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ACCAANACC TNTGCTGACCCA NAAGCANACAGGTGACCCTTTCACCAAAACTTCAATCAT 

CAGTCTTCTCTTTTCTTTTTTTGAAACANAGTCTTGCTCTGTCGCCCATGCTGGAGTGCA 

GTGGCGCGATCTCNACTCACTGCAAGCTCTGCCTTCCAAGTTCACGCCATTCTCCTGCCT 

CAGCC TCCCGAGTMCTGGGACTACAGGCGCCCGCCACCATGCCCGGCTMTTTTTTGTA 

TTTTTCATANANACATGGTTTCACCATGTTAGCCATGATGGTCTCGATCTCCTGACCTCG 

TGATCCACATGCCTTGGCCTCCCAAAGTGCTGGGATTACAGGTGTGAGCCACTGCGCCCG 

GCTAGTTTTCTC 

Sequence 2987 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTAAAAGGTGATGCTAATACTTTAAAACG 

TTTAAGATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCA 

TTGGACAGGACATAAT GTAAA ACATAAAAGTGCAATTGTTACACTTACATATGATAGTGA 

ATGGCMCGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTAC 

TGGATTTAT GTCT ATATGACAAATCTTGATACTGCATCCACAACATTATTGGCGTGCTTT 

TTGCTTTGCTTTTGTGTGCTTTTGTGTGTCTGCCTATTMTACGTCCGCTGCTTTTGTCT 

GTGTCTACATA CACA TCATTAATACTATTGGTATTACTATTGTGGATAACAGCAGCCTCT 

GCGTTTAGGTGTTTTATTGNATATATTGGATTTGNTTATATACCATTATTTTTM 

ACACATGCACGCTTTT 

Sequence 2988 

GGTACMGAGTTGGAAAAACAGAGTTGAAAGGAAAAATGAGGATAATCTTATTTAAAAGA 

TTATTGMTGCCTMGTGCCTTTAATAACTCCACAAGGACCCTCATGTAAACAAAATTCA 

GCAATAGCAACAAAACAAAGCCTCTGAAATGCTCATTGTTCCCCAGTGAGATAAGCTTTA 

MGAGCANAAATACGAAAAACAAMCCAMTNTCAAGTNAMCTTAAAAATGTTACTTTA 

TGGGAGAGGACAGGCAAGGCGGCAAGAACATAGAAGCAGTGCCACCTGAAGCTGGCATGA 

TMGGTTGATTTMCTGCACTGGAAGTGACAGMTCTGAAAGAACGACTTTTCAAATGTN 

AGATAMTCCATTTCAAANNATNGTTAAACTTGTGAATTCAACAGAAAAACTGTAAAACT 

TTCNGNGGAGGCTGTTAAGAGCAAAATTGTTCCCTCAGAGCANAGGACTCATTGATGCCG 

AAGATAGAGGAACATGGCCAACCCAATTAAAA 

Sequence 2989 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAAGAGAAGAACTCATGTCAATGGAAAGTCTGT 

TTATCCTTAMGCCACATTTGTAGACTAGACAGTTCCAAAATATCCCTTGGCAATCCCTT 

TTGCAMGTMTT GGAA MTGCTGTGCCMGTGATAAAMGCATTTCAAGTTTCTGCCTA 

ATGTCTGTTATAGTTTTCAAAGCCCTGAATTGGAAAACAAGATTAAAACTAGTTCTGAAA 

TAMCMCMGTATTTACTGAAGAACAACTATGAGGCACTCTGCTGGAACTTTGGGGATT 

TGAAGATGTGTAAGACTGAGCTTACAATTTAGTTGGGGAGATCAGACGTGTGCACAGCCA 

AGGTTAAGCTCTAATACAGCATGAAAATATCTGAAATGTCAAAAAACACCAGCAACTGTT 

TCCATCTCANGGAGTGAGGCTACTATCCTCCAAAGGCAAGTCCTAGCCCCATCAAAA 

Sequence 2990 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATATACATTATGTAATTAAAAAGCGTGCATG 

TGTATGTATTAAAAATAATGGTATATAAACAAATACAATATATACAATAAAACACCTAAA 

CGCAGAGGCTGCTGTTATCCACAATAGTAATACTAATAGTATTAATGATGTGTATGTAGA 

CACAGACAAAAGCAGCGGACGTATTAATAGGCAGACACACAAAAGCACACAAAAGCAAAG 

CAAAAAGCACGCCAGTAATGTTGTGGATGCAGTATCAAGATTTGTCATATAGACATAAAT 

CCAGTAGACACTGTMTAGTTTTTGGTATTTTMCTTGAGACMAAATTGGTCACGTTGC 

CATTCACTATCATATGTAAGTGTMCAATTGCACTTTTATGTTTTACATTATGTCCTGTC 

CMTGCCATGTAGACGACACTGCAGTATACMTTTACMTGCTTTTTAAATCTATATCTT 

AAACATTTTAAAGGATTAGTATCACCTT 

Sequence 2991 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACCAGGACTGTCCTAGTGCGGGGAATAAAAGTG 
AAATTATAGTCCTTTCTCCTGAAGAAATCAGGGTCTAGTGGAAAGGAAGGTGGGACCGGA 
AGGAAGCTGGCCGGTGGAGCTTGGCCCTGTGCTAAGCATTCCTATCCACTTAACTGTGGG 
TAATCGTCTGAAACTCCCTACAGACCTGGGAGTAAGCTATTGTATTGAAGGGGCTGTATG 
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GAGTCAGGGAGAGAAGCCAGGCTTTANAGTCAGATGGCTCTAATCGTGCATCTCACCCTC 

TCTGTGTCCCCAAGCTCTCCCTCTGTAAGATGGGGACAATGACAGCACCCACC 
Sequence 2992 

CCCTTAGCGTGGTCGCGGCCGAGGTACATTGTATACTGCAGTGTCGTCTACATGGCATTG 

GACAGGACATAAT GTAAA ACATAAAAGTGCAATTGTTACACTTACATATGATAGTGAATG 

GCAACGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTACTGG 

ATTTATGTCTATATGACAMTCTTGATACTGCATCCACAACATTATTGGCGTGCTTTTTG 

CTTTGCTTTTGTGTGCTTTTGTGTGTCTGCCTATTAATACGTCCGCTGCTTTTGTCTGTG 

TCTACATACACATCAT 

Sequence 2993 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACTTTTCTCCAAGTCAATGAATCCAATCTTTAA 

TGGATCTGGGATGCTCTCTGACCAGGTCCTGTTATGAGTTTCCTGTGTTATAATTAAAAA 

CCCCAAACTGATTAAGC ATGAG AGGATACTTCCTGCTCTCAGATAAATTATTTATTATAT 

TCT GTAT TATCCTTGTAATTTTTCTGGATATGAATAAATAACAATGTATCTCAGAAAAAC 

ACATTTTGAAMGCCTTATAMGTTAAMCTTTAAATGCTTTTCATCAAAATGTTATGAC 

CAATAAAAATGCCATTCATGTATCTCATCACACTTTCTATACATTTTGTAGTGATATGTA 

TT TATAC ATACTGCAGTTGGAAATAGGGMTGCTTTTTTGGTTTCCCACGGTAGGCTGCA 

GGTTTTAAAAGCAGAAGTCCTGAACCTCTGAAGGCAACATAAGAAAAGTATTCATTTGAA 
ACCTG 

Sequence 2994 

CCC7TAGCGTGGTCGCGGCCGAGGTACTTGTGTGGGTGTCTTACCTTGCATTAGAATATC 

CTAGGCCTTTCTAATTCTTCCTCTTGAATTCCCTGTGTCACTCATCTGAGGTCATCTCTT 

GGATTTAAATACATTTGGTCTGAAATTCTCAAACTGCTGAATAAAATGCCATANTAAGTC 

CAACTATATGGTTTCATGTAGGGTAAAGTAACCATTGTGGAAGGATTACGATAAAACTGT 

GTGACAATAAATAGAATAAAAGATGGCAAAACAATCTGTAAATTACAAAGTGTTATAAAA 

ATGCATTGMTTTCTAGTGTTCTTACAC7TTCATATTTCTCATTGGCACATCAATCANAC 

CTTGCCTTCCTAAAGTGGGAGGAAGAAAAAGGCTGCACCTGTCTGGGGCTGAAGGTCTGG 

GCAGAAACACTAGCCAAAACTTTTACTAAATCTTCATGTCACCCCCCCAAACTNATTAGG 

GGCCCAMCTTATTTCATTATCTTCACTATCCCCCAGGGTTTCCCTCTCCNTAT7TTTTC 

ATGCCCCACCTTTCAGCACTTTTCATGGTTNCTTNGTCTTTAAAAAACCCCCAA 

Sequence 2995 

CCCTTCGGCCGCCCGGGCAGGTACATTTAAAAGGTGATGCTAATACTTTAAAATGTTTAA 

GATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTGCATGGCATTGGA 

CAGGACATAAT GTAAA ACATAAAAGTGCAATTGTTACACTTACATATGATAGTGAATGGC 

AACGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTACTGGAT 

TTATGTCTATATGACAAATCTTGATACTGCATCCACAACATTACTGGCGTGCTTTTTGCT 

TTGCTTTTGTGTGCTTTTGT 

Sequence 2996 

CCCTTAGCGTGG TCGCG GCCGAGGTACCATTTGGGATTATGGTTTTTAAAAGCTTGAGAA 

TATTACTTATTTACTTTTGCAGCTTTCCATGCAGCTG I I 1 1 1 1 1 1 I CTCTCTGTTTTTGA 

CTTTGGANATAAAGTTACTGATTTTTATCT7TACTTTTCTCTTTATTTTGTGCCTTGGTC 

TTGAAAAAGTAATAAATATTTCTTACTGAGGCCCCCTCCTGCTTTGTTCTGCCATCTAAG 

GTGATTACAATGCTCAAGTCTCTTTGTTGGTTGCCCTTGACTAATGCTTTTCAGCAGCCC 

CGGGCTCAGCTCCTGGTTCCCAGTATGGCACAATGACCAGGCAGATATCTCGACACAACT 

CTACTACTTCTTCGACATCTTCTGGTGGATACAGACAAACTCCCTCTGTGACTGCTCAAT 

TTTCTGCTCAGCCTCATGTTAATGGGAGGTCCAC7TTATTCTCAAAATTCAAGTAAGCTT 

GT GCTTT GCAGGCATACAGCAACAGGGGGAGCTCCTTTTGGAAGGAAATTATTTGAAACT 

GGTT TTT CTTG CCAGGGGATAGACAACCAATTGAAAAACCATGAAGGNTNCAAGGGTTAT 

TTTTGGGT TTTANG AAATTAAGNCCCTTGAAATCATACCNATTTTTAAANANGTAATTGN 

TTTACCACCTTTT7TIMTTCCCA 

Sequence 2997 
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CCCTTTCGAGCGGCCGCCCGGGCAGGTACATGTTCCTTCCCTTACTTTGGGGAGTCTAGG 

TTGTGAATTTGAAACMAATCAGATTCTTTTCATCTTCCCTTTCCCCTCAAATTCCTGAG 

AAAACCTCCAACCTTCTAMTTTATAGCAAATCAACTATMTTATGTGTTTCCATTTGAA 

ATTCMGCTAMATMCATACTTTAAAAAGTGTATCTTAAAAATCATATTTCGCTTCAAA 

AAMCTTGTTAAMGTAATTTGCATCAGATCCTGGAGTCGACTTGAAGAATTCTCCTACA 

TCTGACACCCAGTTAGGCCCTTTGAGAAAGAGAGAAMGAGAATTTTTTMTGCATGTTT 

GATTATGGCCATCTCTTTTCTTAGAAGGTAGAAGATAGCACCATGCCGATTCGTCGAACT 

GTGAATTCTACCCGGGAAACTCC 

Sequence 2998 

CCCTTAGCGTGGTCGCGGCCGAGGTACATTTAAAAGGTGATGCTAATACTTTAAAATGTT 

TMGATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCATT 

GGACAGGACATAAT GTAAA ACATAAAAGTGCAATTGTTACACTTACATATGATAGTGAAT 

GGCMCGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTACTG 

GATTTAT GTCT ATATGACAAATCTTGATACTGCATCCACAACATTATTGGCGTGCTTTTT 

GCTTTGCTTTTGTGTGCTTTTGTG 

Sequence 2999 

ACATCTGCACATTGTGCAGGTTAGTTACATATGTATACATGTGCCATGCTGGTGCACTGC 

ACCCACTMATCGTCATCTAGAAGTGTTTTCTAATTTTATTTGTMTTTCTTCTTTGTTC 

CATTGGCTGTTGAAGAATGTGTTGTTTGGCCGAGCACAGTGGCCGACGCCTGTGATCCCA 

GCACTTTGGGAGGCCGAGGCAGGTGGATCACGAGGTCAGGAGATCCCCGCGTACATCTAT 

TTAAAAGTCAATACACAGGAAAAAGGTCTGGAAGAATATATACTAGAAAAGAATGGTTAC 

CTCTAGGGGATGGAGTGTCAAGGTGAATTTTGGTTTATTCATACTGTCTAAATGATTGCT 

TAATA ACTC AGGTATTGCTTTCAAAAAGTAAGAGGAAGGGTGGGAAATACCCTGTTCTCT 

CANAATTTTAAGTCCAGAGTGCTGGGGAGAAAAANATGGCAGCACTTGANCGGGACTGAA 

AAAGT 

Sequence 3000 

CCCTTAGCGTGGTCGCGGCCGAGGTACGCGGGTAGGAGCAGGAGGAGCAGTGGGGGAAG 

TGGTATTATTTATACCACTCTTTGTATITTATCACTTTTTATMGTCTGAMTTAT^ 

AM TGAAA ATGTAAAMGMCAAGGAGTTTCTCACTATTCTTTGTCCCCCTAAAAAGACC 

ATCl 1 1 1 1 AAAAAGATTAAGGAATTCATACAAAATAAGTAAGAATTAACAATTATAGGAA 

GTTTGGGCCAAGCACAGTGGCTCATGCCTGTAATCCCAGCGCTTTGGGAAGCTGAGGTGG 

GTGGATTGCTTGGGCCTAAGAGTTCAAGACTANCCCTGGTAACAGCGAGGACCCTTGTCT 

CTACAAAAAATAAAAAATACTAAC 

Sequence 3001 

CCCTTAGCGTGGTCGCGGCCGAGGTACAGATGGGGTTTCACTGTGTTAGCCAGGATGGTC 

TTGATCTCCTGACCTCGTGATCTGCCCGCCTCGGTCTCCCAAAGTGCTGCGATTACAGGC 

GTGAGCCACTGTGCCCAGGCCTATTGGGGGTTATGCAGGTTCCTTTGCTTGGCCATGGTA 

GATGGCTMGTCTTTGCTTCCAAGTTCTTTTGACCATGAGACATGTAGAGCCAGGGAGGA 

AAGAGAAAACCCAAACCAGGTCACTGTTACCTGTGAATAACATCTTGATCGAGTTCTGAG 

A 

Sequence 3002 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTTCATATTCTCCACCTTTGTTCCTTGCC 

CTTTGCCTTCAAAGATATGTAGATGTTCCTTAACTCCAAAAATTATAAAAACAAACAAAA 

AMCTTTACCACTGTTCATTATCAAAGTTACTATTCATTTATCCATTCMTATTTAGAGT 

AGCTAACAAGACAAAACAAAATAATTGCTGCTTTCGTGGCTTATAGTGGGGAAGGGTGAT 

AAACACCATGATGAAGGAGAGAAGAGTGCTATTGTGGGGGCACAAATTATGGGGATCTGA 

TCCTTTTTTTTAAGGGAGGCAT 

Sequence 3003 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTAAAAGGTGATGCTAATACTTTAAAATG 

TTTAAGATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCA 

TTGGACAGGACATAATGTAAAACATAAAAGTGCAATTGTTACACTTACATATGATAGTGA 
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ATGGCAACGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTAC 
TGGATTTATGTCTATATGACAAATCTTGATACTGCATCCACAACATTACTGGCGTGCTTT 
TTG CTTTGCTTTTGTGTG C 
Sequence 3004 

CCCTTAGCGTGGTCGCGGCCGAGGTACCATGTTCTAAAGGCAATCGAGTCATAATACACT 

GAAAGCAGTCCAAGAGCAGCAAGAGACTTTGCTGCAGCTGTAGATCACTGGATCGGTAAC 

TCGTAGTTAAAGGAAATCCCTACCCACTCCCATCCAGCCCCTACCCCCTAATTATGGGAC 

AAMTAGCCMTTCAATGAGATTTAAACAGGAAGTTCACAAGAGGAACTCTTATACCTAT 

GAGGACCCATTAACCAGGGATTCCACAACCAGTAGAATATTATCTACTGGTAGCTATTTA 

AGCCTTACCTGACAGGTCTCCAAGCCAGCCGCAGCAGCCACATCCCTCTCTGCGCCATAA 

ACGTCTCGCCTCAGTTTCCTGAAATTCCCAACTGAGAANGGAATATAGCCAGCAGGCACT 

CCCAACCCTGTTATGATTCTACAGTTCTACTGTTAGTAAATTTCCCAGGGGCATCACAAT 

CTGTGTCCTTACACATGCATACACAGAGGAGCTTCCTTTTTANGGATGAGCTCCTCATTA 

CAGGGC 

Sequence 3005 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAAGTCTATCGCCATGCTGTGTCACGTGGGCAG 

GTCTTCTTGGCA TTCATC TTGAACATGACCCAAATCCTGGCTCCATCTTCCCTCTTACTT 

TTCCTTTGCCCCATTTTTCTTCCTTATTTTTTATGTATGTTTACCT^ 

ATTTGTAAGCTGGCTCAMTCCTTTTGGGAACAAGGTGGGTTATAAATGCACACGTTCCT 

ACACAGGATGTTGTTAGTCATTTCATAAGGTCTCTTGTGGAGGAGATTAGCCAATAGCTT 

CTTTTGTAACCTTAGGCA 

Sequence 3006 

C CCTTA GCGTGGTCGTGGCCGAGGTACMAACACATTTGGTCCTGAGATTTTTCAGAAGC 

AATTTTAAGTGTAATAGGAATCCTCAGAAGGAAGAAAGTAAACATGTTCTGAGCAACTCC 

TCCTCCAGGCACTACAATAAAAGAGAAGGGAACTGGACCTGGAAAGACAGACAGTCCATA 

GAGAAGTGGAGAGGGAAGAGGAAGAGAGCATGCTGCTTAATGCAGGTTGACATTTCTGTT 

TGTGMGCTMTTAMTTCCAGMGGATTAAATACTTAATACTTTACCGTCTGTATGTAT 

GTATGTATTTATTTATTTATATTTTTTTGAG 

AGTGCAAAGGCCCAATCTTGGCTCACTGCAACCTCCACCTTCCAGGTTCAAGCGATTCTT 

CTG ACTCAC CTCCAAGTAGCTGGGATTACAGGCACACACCACCATGCNGAGCTMTTTTT 

TGNATTTTTANTAAAANACANGGGTTCCCCGGNTACCCGGATGGGCTCAATCTCCTGACC 

TCNGGATCTGGCTGTNTCGGGCCTCCCAANTGCTTGGATANAGGCGTGAGCCCCTGGGCC 

CCACTTATACCC7TNATMNTAAATATTTMTCCCCMGNTTTAMTTTTNNTTATCCCT 

TGTCGGGGTAAACCTTTCCCNAAAAAGG 

Sequence 3007 

CCCTTAGCGTGGTCGCGGCCGAGGTACATGATTCTACACTGAATCTGCATTTCACTCCCA 

TATCTATACCAGAAGGTTATCAGTGGAAGAGAAATTCAGTTATCTTGAATGGACATGATC 

TTCTCAGGAGCAGTCAGTGGTTAATTGGGACAAGAAAACACAAGTCATTATCATTGAGAA 

ATCTGAAGCAMTTGAGGCAGGTTGTCACCTTTCACCAGGAAACAAATTAGCCCTGTCTT 

TAAAAAGACTTCTTTCTCTCTCTGCTGACTGGGGATTCCTGTCAACTGCCAATGAAATAG 

GGAGAGTGTAGATTAATGGC 

Sequence 3008 

CCCTTAGCGTGGTCGCGGCCGAGGTACTAACCATATGAAAATAGTTGCTACCTTGTTGGA 

CAGGACAGATATACAMGCATTTCCATTTTTACAGAAAATGTGTATGGATACCATGGAGG 

TTTGAAATAAATGCTGAGTTCATTCAAACTATAGTGAAATACAGCATGTGAAAAAGTAAT 

TACTTATTGACTATAATAAATGCTTCATAAAATATAATAGATGCTGATTCTGGCCAAGAT 

TGACAAAATCTAATGCAGCCTACATGTCTCAGTGGTTAATACTAAGACTCTGAATAAAAT 

ACAAGCAAATAAACA 

Sequence 3009 

CCCTTAGCGTGGTCGCGGCCGAGGTACCTTTGCCAGTTTCTGGCATATTAACAGCTGAGA 
TCAGATAAGTGAACAGCGATAGAGGTAGTGAGGTTTAGGATTCAGGTCTGCCACCGGGAG 
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CGCACCTCTGGGCTCAGGTCTCTCATTTGTAAAATGGGACATTTAGGTCAGATCAGCATT 

TTCAAAAGTGTGACATGTTAGTAMTAAGATGATTTTATGTGGTTGACAGATGAATGTAT 

TACCTTTAGTGGTTATATGTTTATTTTCATGGTTACTT^ 

CTAGCTAAGMTCMTTATCTCTTTAAMTATATTTAAAGTTTAAAATGTGGGCCAACTT 
AG 

Sequence 3010 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTAAAAGGTGATGCTAATACTTTAAAATG 

TTTMGATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCA 

TTGGACAGGACATAAT GTAAA ACATAAAAGTGCAATTGTTACACTTACATATGATAGTGA 

ATGGCMCGTGGCCMTTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTAC 

TGGATTTATGTCTATATGACAAATCTTGATACTGCATCCACAACATTACCTGGCGTGCTT 

TTTGCTTTGCTTTTGTGTGCTT 

Sequence 3011 

CCCTTC GAGCG GCCGCCCGGGCAGGTACTGAATGGAAGAGAGAGTTGGAACAGTGAACAT 

ATTCTACTTTTTCAAGMTTTCTCTTTTAAAGGTATGGGAAAMGATGGCACAAMTTC^ 

TTTGAGGATAGCATAGATCTAAAGGCATTTAAMCCTTGAGTCTTGAACATATTTATAGG 

CACGGAAAAAATTCTACAGAAAGGAAAGATTGTAATTATAGGAGAGAGAAAAACAGTGAT 

ATTTCTGAGGATATTGTGGGGAGTGGGTGCTAAAGGACCATTGGATATATCATTCTTAAA 

CCAGAAAGGGGAAAGGATTGGGTACCTTCGGNCGCGACCACGCTAANGG 

Sequence 3012 

CCCTTTGAGCGGCCGCCCGGGCAGGTACATTTAAMGGTGATGCTAATACTTTAAAATGT 

TTAAGATATAGATTTAAAAAGCATTGTAAATTGTATACTGAGTGTCCGTCTACATGGCAT 

TGGACAGGACATAAT GTAA AACATAAAAGTGCAATTGTTACACTTACATATGATAGTGAA 

TGGCAACGTGACCMTTTTCGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTACT 

GGATTTATGTCTATATGACAAATCTTGATACTGCATCCACMCATTACTGGCGTGCTTTT 

TGC7TTGCTTTTGTGTGCTTTTGTGTGTCTGCCTATTMTACCGTCCGCTGCTTTTGTCT 

GTGTCTACATACACATCATTAATACTATTGGTATTACTATTGTGGATAACAGCAGCCTCT 

GCGTTTAGGTGTTTTATTGNATATATTGNATTTGGTTATATACCATTATTTT^ 

ACACATGCACGCTTTTT 

Sequence 3013 

CCCTTAGCGTGGTCGCGGCCGAGGTACAAATAGGGTCACCTACTCTCAAATTCTGCGGTT 

CAMTTGTGGTTGGAGATGAGATGTTCTATTTCTGCATCTCTCCTGGACCATCTTAGAGC 

AGTGCAAAGTTTACTGGATCTTCCAGTGCATTATTGGCCCTTGGTGTGATGACGAGAGAG 

CAGGMGGAGGTCTCCAGCTCCTTTTCTGAGTTCAGATGTTCCCTAGCCCAAGAAGACTA 

ATCATGCACCTGGTTTCCTGCTCTGAMMTMGAAGTGATTTTGGCAGCCTTTGAATCT 

GGATGGAAGGTGGT 

Sequence 3014 

CCCTTTCGAGCGGCCGCCCGGGCAGGTACATTTAMAGGTGATGCTMTACTTTAAAATG 

TTTMGATATAGATTTAAAAAGCATTGTAAATTGTATACTAGCAGTGTCGTCTACATGGC 

ATTGGACAGGACATAATGTAAAACATAAAAGTGCAATTGTTACACTTACATATGATAGTG 

MTGGCAACGTGACCMTTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGTGTCTA 

CTGGATTTATGTCTATATGACAAATCTTGATACTGCATCCACAACATTACTGGCGTGCTT 

TTTGCTTTGCTTTTGT 

Sequence 3015 

CCCTTAGCGTGGTCGCGGCCGAGGTACATTTAAAAGGTGATGCTAATACTTTAAAATGTT 

TAAGATATAGATTTAAAAAGCATTGTAAATTGTATACTGCAGTGTCGTCTACATGGCATT 

GGACAGGACATAATGTAAAACATAAAAGTGCAATTGTTACACTTACATATGATAGTGAAT 

GGCMCGTGACCAATTTTTGTCTCAAGTTAAAATACCAAAAACTATTACAGCGTCTACTG 

GATTTATGTCTATATGACAAATCTTGATACTTGCATCCACAACATTACTGGCGTGCTTTT 

TGCTTTGCTTTTGTGTGCTT 

Sequence 3016 



